A Hybrid approach for speech emotion recognition using 1D-CNN LSTM

Sanghoon Lee; Taeho choi; Minjae Joo; Sunkyu Kim; Inggeol Lee; Junseok Choi; 이상훈; 최태호; 주민재; 김선규; 이잉걸; 최준석; 강재우; Gayrat Tangriberganov; Tosin A. Adesuyi; Byeong Ma

연구문헌

학술대회 프로시딩

홈 > 연구문헌 > 학술대회 프로시딩 > 한국정보과학회 학술대회 > 2020년 컴퓨터종합학술대회

2020년 컴퓨터종합학술대회

Current Result Document :

한글제목(Korean Title)	A Hybrid approach for speech emotion recognition using 1D-CNN LSTM
영문제목(English Title)	A Hybrid approach for speech emotion recognition using 1D-CNN LSTM
저자(Author)	Sanghoon Lee Taeho choi Minjae Joo Sunkyu Kim Inggeol Lee Junseok Choi 이상훈 최태호 주민재 김선규 이잉걸 최준석 강재우 Gayrat Tangriberganov Tosin A. Adesuyi Byeong Ma
원문수록처(Citation)	VOL 47 NO. 01 PP. 0833 ~ 0835 (2020. 07)
한글내용 (Korean Abstract)
영문내용 (English Abstract)	Speech is an important aspect of human interaction in which relationship and emotions can be expressed. The ability to learn and recognize human emotions through speech has become an area of interest in the field of human-machine interaction and machine learning. Through the use of private and publicly available dataset, several researches on speech emotion recognition (SER) has been successfully carried out. However, their recognition results requires improvement, because important speech emotion features are not well captured during training. Hence, we proposed a 1D-CNN LSTM for speech emotion recognition to assist in the capturing of global and local features. The 1D-CNN part is responsible for learning salient features required for recognition while the LSTM is delegated for classification based on the extracted features received from the 1D-CNN. We experimented our hybrid approach with the raw audio from Berlin EmoDB dataset and an average accuracy of 95.5% and validation accuracy of 63.7 % were achieved. Our result an improvement over the existing studies.
키워드(Keyword)
파일첨부	PDF 다운로드