earticle

논문검색

Oral Session Ⅱ 의료, 멀티미디어 콘텐츠

Emotion Recognition in Speech Signals through Graph Neural Networks Integration : A GCN-GAT Hybrid Model

초록

영어

Our research aims to enhance the modeling of speech signals for more effective extraction of node features and analysis of relationships between nodes. To achieve this, we model speech signals as cyclic or linear graphs. Our model combines layers of Graph Convolutional Networks (GCN) and Graph Attention Networks (GAT) to leverage their respective strengths in processing graph data. Specifically, we utilize GCN to aggregate information from neighboring nodes, which helps capture local relationships among nodes. Additionally, we employ GAT mechanisms to assign varying attention weights to different neighboring nodes, facilitating a better capture of complex global relationships between nodes. In our experiments, we validate our approach using the IEMOCAP dataset and demonstrate comparable performance to state-ofthe- art models in emotion recognition tasks. This research outcome provides new insights and methodologies for further exploration in the field of speech signal processing.

목차

Abstract
1. Introduction
2. Related works
3. Methods
3.1. Dataset
3.2. Experiment setup
4. Experiment result
5. Conclusions
Acknowledgement
References

저자정보

  • WangHan Department of Electrical and Computer Engineering Inha University
  • Deok-Hwan Kim Department of Electrical and Computer Engineering Inha University

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      0개의 논문이 장바구니에 담겼습니다.