earticle

논문검색

Poster Session Ⅳ

음성 향상를 위한 STDCT 와 STFT 의 비교

원문정보

초록

영어

Speech enhancement is the task of improving the quality of the speech by reducing the noise. The magnitude of the short-time Fourier transform(STFT) or spectrogram is widely used for speech enhancement. However, this approach neglects the noisy phase and limits the quality of enhancement. Recently, short-time discrete cosine transform(STDCT) has been introduced to overcome the limitation of the STFT. STDCT is a real value representation; thus, it does not require phase information to reconstruct the audio. This paper compares the two approaches and analyzes the importance of phase information in speech enhancement. Our experiment shows that when trained under similar condition STFT performs better than STDCT in low noise scenarios, however, for high noise situations, STDCT has better performance than STFT.

목차

Abstract
1. Introduction
2. Related Works
3. Methods
3.1. Unet
4. Experiments
4.1. Experimental setup and dataset
4.2. Experimental result
5. Conclusions
Acknowledgement
References

저자정보

  • Nisan Aryal Department of IT Convergence Engineering Gachon University Gyeonggi-do, South Korea
  • Sung-Hwan Park Dept. of Nano Science and Technology Gachon University
  • Sung-yoon Ahn Department of Software Gachon University
  • Sang-Woong Lee Department of Software Gachon University

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      0개의 논문이 장바구니에 담겼습니다.