Speaker Recognition of Noisy Short Utterance Based on Speech Frame Quality Discrimination and Three-stage Classification Model

Ying Chen; Zhenmin Tang

Speaker Recognition of Noisy Short Utterance Based on Speech Frame Quality Discrimination and Three-stage Classification Model

원문정보

Ying Chen, Zhenmin Tang

보안공학연구지원센터(IJCA) International Journal of Control and Automation Vol.8 No.3 2015.03 pp.135-146 SCOPUS

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

The noisy short utterance is polluted by noise and corpus is less, so the recognition rate significantly decreased. For improving recognition rate, we proposed the dual information quality discrimination algorithm to classify the speech frames: one is differences detection and discrimination algorithm (DDADA), another is the improved SNR discrimination algorithm (ISNRDA). Based on the above two algorithms, the speech frames are classified to three classes: high quality, medium quality and low quality. We proposed GMM-UBM three-stage classification model, and we combine the dual information quality discrimination algorithm with GMM-UBM three-stage classification model. Experiments show that, the dual discrimination quality algorithms can be more precise to classify speech frame, and combining it with GMM-UBM three-stage classification model can make full use of limited corpus of short utterance and can improve the speaker recognition rate of the noisy short utterance.

Abstract
1. Introduction
2. Speech Frames Quality Discrimination
  2.1 Extract Spectral Feature of Noisy Speech
  2.2 NMF-SNRDA(Nonnegative Matrix Factorization-SNR Discrimination Algorithm)
  2.3 Differences Detection and Discrimination Algorithm (DDADA)
3. GMM-UBM Three-stage Classification Model
  3.2 Differences Detection and Discrimination Algorithm (DDADA)
4. GMM-UBM Three-stage Classification Model
5. Experiments and Results
  5.1 Speech Database and Noise
  5.2 The Relationship between Classification Methods and Recognition Rate
  5.3 Research on the Dual Information Quality Discrimination Algorithm
6. Conclusion
References

키워드

저자정보

Ying Chen School of Computer Seience & Engineering, Nanjing University of Science and Technology, Jiangsu Nanjing, China
Zhenmin Tang School of Computer Seience & Engineering, Nanjing University of Science and Technology, Jiangsu Nanjing, China

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle