Subword Neural Language Generation with Unlikelihood Training

Salahuddin Muhammad Iqbal; Dae-Ki Kang

Subword Neural Language Generation with Unlikelihood Training

원문정보

Salahuddin Muhammad Iqbal, Dae-Ki Kang

국제인공지능학회(구 한국인터넷방송통신학회) International Journal of Internet, Broadcasting and Communication Vol.12 No.2 2020.05 pp.45-50

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

A Language model with neural networks commonly trained with likelihood loss. Such that the model can learn the sequence of human text. State-of-the-art results achieved in various language generation tasks, e.g., text summarization, dialogue response generation, and text generation, by utilizing the language model’s next token output probabilities. Monotonous and boring outputs are a well-known problem of this model, yet only a few solutions proposed to address this problem. Several decoding techniques proposed to suppress repetitive tokens. Unlikelihood training approached this problem by penalizing candidate tokens probabilities if the tokens already seen in previous steps. While the method successfully showed a less repetitive generated token, the method has a large memory consumption because of the training need a big vocabulary size. We effectively reduced memory footprint by encoding words as sequences of subword units. Finally, we report competitive results with token level unlikelihood training in several automatic evaluations compared to the previous work.

키워드

저자정보

Salahuddin Muhammad Iqbal Master Student, Department of Computer Engineering, Dongseo University, Korea
Dae-Ki Kang Professor, Department of Computer Engineering, Dongseo University, Korea

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle