A New Tempo Feature Extraction Based on Modulation Spectrum Analysis for Music Information Retrieval Tasks

Hyoung-Gook Kim

論文

A New Tempo Feature Extraction Based on Modulation Spectrum Analysis for Music Information Retrieval Tasks

원문정보

Hyoung-Gook Kim

한국ITS학회 한국ITS학회논문지 제6권 제2호 통권13호 2007.08 pp.95-106 KCI 등재후보

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

This paper proposes an effective tempo feature extraction method for music information retrieval. The tempo information is modeled by the narrow-band temporal modulation components, which are decomposed into a modulation spectrum via joint frequency analysis. In implementation, the tempo feature is directly extracted from the modified discrete cosine transform coefficients, which is the output of partial MP3(MPEG 1 Layer 3) decoder. Then, different features are extracted from the amplitudes of modulation spectrum and applied to different music information retrieval tasks. The logarithmic scale modulation frequency coefficients are employed in automatic music emotion classification and music genre classification. The classification precision in both systems is improved significantly. The bit vectors derived from adaptive modulation spectrum is used in audio fingerprinting task That is proved to be able to achieve high robustness in this application. The experimental results in these tasks validate the effectiveness of the proposed tempo feature.

한국어

본 논문은 음악 정보검색에 사용되는 효과적인 템포 특징 추출방식을 제안한다. 제안된 템포 정보는 협소 밴드상의 일시적인 변조 성분에 의해 형성된다. 이러한 변조 성분은 시간 축 상의 음악 신호로부터 스펙트럼을 구한 후, 각 스펙트럼 성분에 대한 주파수 영역 분석을 통해 획득된 변조 스펙트럼으로 구성된다. 실제 구현에 있어서는 MP3 음악파일로부터 부분 디코딩에 의해 출력된 변형된 이산 코사인 변환 계수에 퓨리에 변환을 취하여 변조스펙트럼을 구하였다. 획득된 변조 스펙트럼의 진폭으로부터 고속으로 추출된 음악 템포 특징값은 다양한 음악 정보 검색에 적용되었다. 음악 무드 및 장르 분류에서는 로그 변조 주파수 계수를 적용하여 분류 성능을 개선시켰으며, 적응 변조 스펙트럼에서 유도된 비트 벡터는 오디오 핑거프린팅에 적용되어 잡음환경 하에서도 검색 성능을 크게 향상시켰다.

요약
Abstract
I. Introduction
II.Tempo Feature Extraction
  1. Tempo Characterization
  2. Feature Extraction
III. Music Emotion Classification
  1. Feature Extraction
  2. Classification of Music Emotion
  3. Database for Music Emotion Classification
  4. Exoerunentak Results for Music Emotion Classification
IV. Music Genre Classification
V. Audio Fingerprinting
  1. Audio Fingerprinting Design
  2. Robustness Evaluation
  3. Discussion
VI. Conclusion
References

키워드

저자정보

Hyoung-Gook Kim 김형국. 광운대학교 전파공학과 교수

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 기관로그인 시 무료 이용이 가능합니다.

4,300원

0개의 논문이 장바구니에 담겼습니다.

earticle

A New Tempo Feature Extraction Based on Modulation Spectrum Analysis for Music Information Retrieval Tasks

원문정보

초록

목차

키워드

저자정보

참고문헌

함께 이용한 논문