Efficient Dataflow for SwiGLU

Yunpyo Hong; Seokhun Jeon; Young-Jong Jang; Byung-Soo Kim

Workshop Session_KETI

Efficient Dataflow for SwiGLU

원문정보

Yunpyo Hong, Seokhun Jeon, Young-Jong Jang, Byung-Soo Kim

한국차세대컴퓨팅학회 한국차세대컴퓨팅학회 학술대회 The 9th International Conference on Next Generation Computing 2023 2023.12 pp.38-39

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

As many LLMs have been released, modified network layers based on transformer have been researched to improve performance. However, it is essential to design LLMs in a large size for performance, and as a result, current LLMs can only be executed on large servers, and various attempts have been made to reduce the amount of computation. In this paper, we present a method to reduce the amount of computation by using the data attribute of the SwiGLU layer used by meta and google. Since SwiGLU contains an activation function, it generates a large number of near-zero values, and we try to reduce the amount of computation by skipping unnecessary operations. Our experiments show that our algorithm can reduce the computation by 13.3% when there are 20% zeros from activation function.

키워드

저자정보

Yunpyo Hong Korea Electronics Technology Institute SoC Platform Research Center
Seokhun Jeon Korea Electronics Technology Institute SoC Platform Research Center
Young-Jong Jang Korea Electronics Technology Institute SoC Platform Research Center
Byung-Soo Kim Korea Electronics Technology Institute SoC Platform Research Center

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

0개의 논문이 장바구니에 담겼습니다.

earticle

Efficient Dataflow for SwiGLU

원문정보

초록

목차

키워드

저자정보

참고문헌

함께 이용한 논문