Non-Intrusive Speech Intelligibility Estimation Using Autoencoder Features with Background Noise Information

Yue Ri Jeong; Seung Ho Choi

Non-Intrusive Speech Intelligibility Estimation Using Autoencoder Features with Background Noise Information

원문정보

Yue Ri Jeong, Seung Ho Choi

한국인터넷방송통신학회 International Journal of Internet, Broadcasting and Communication Vol.12 No.3 2020.08 pp.220-225

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

This paper investigates the non-intrusive speech intelligibility estimation method in noise environments when the bottleneck feature of autoencoder is used as an input to a neural network. The bottleneck featurebased method has the problem of severe performance degradation when the noise environment is changed. In order to overcome this problem, we propose a novel non-intrusive speech intelligibility estimation method that adds the noise environment information along with bottleneck feature to the input of long short-term memory (LSTM) neural network whose output is a short-time objective intelligence (STOI) score that is a standard tool for measuring intrusive speech intelligibility with reference speech signals. From the experiments in various noise environments, the proposed method showed improved performance when the noise environment is same. In particular, the performance was significant improved compared to that of the conventional methods in different environments. Therefore, we can conclude that the method proposed in this paper can be successfully used for estimating non-intrusive speech intelligibility in various noise environments.

키워드

저자정보

Yue Ri Jeong Undergraduate Student, Dept. of Electronic and IT Media Engineering, Seoul National University of Science and Technology, Seoul, Korea
Seung Ho Choi Professor, Dept. of Electronic and IT Media Engineering, Seoul National University of Science and Technology, Seoul, Korea

참고문헌

자료제공 : 네이버학술정보

※ 기관로그인 시 무료 이용이 가능합니다.

4,000원

0개의 논문이 장바구니에 담겼습니다.

earticle