earticle

논문검색

Interaural Time Difference Estimation Using Generalized Cross-correlation with Maximum Likelihood Weighting in Reverberant Environments

초록

영어

In this paper, an interaural time difference (ITD) estimation method is proposed for binaural speech separation in reverberant environments. First, the auditory signals are represented in the time-frequency (T-F) domain, and the ITD for each T-F bin is then estimated using generalized cross-correlation (GCC) with a maximum likelihood (ML) weighting function. In particular, the ML weighting function is designed to reduce the reverberation effect. Then, a mask is estimated by comparing the estimated ITD with the ITD corresponding to the location of the pre-defined target speech source. Finally, the target speech is separated by applying the mask to the auditory signals. It is shown that the proposed ITD estimation method outperforms a conventional cross-correlation-based ITD estimation method under reverberant conditions in terms of the signal-to-noise ratio (SNR) and signal-to-distortion ratio (SDR) of the separated speech signals.

목차

Abstract
 1. Introduction
 2. Binaural Speech Separation
  2.1. Gammatone Analysis
  2.2. ITD Estimation
  2.3. Mask Estimation
  2.4. Speech Reconstruction
 3. Proposed ML-GCC Based ITD Estimation
 4. Performance Evaluation
  4.1. Database
  4.2. SNR and SDR Measurements
 5. Conclusion
 Acknowledgements
 References

저자정보

  • Ji Hun Park Visual Display R&D Office, Samsung Electronics, Gyeonggi-do 443-742, Korea
  • Seung Ho Choi Dept. of Electronic and IT Media Engineering Seoul National University of Science and Technology, Seoul 139-743, Korea

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.