Aural-visual two-stream 기반의 아기 울음소리 식별

박철; 이종욱; 오스만; 박대희; 정용화; Zhao Bo; Jonguk Lee; Othmane Atif; Daihee Park; Yongwha Chung

연구문헌

홈 > 연구문헌 >

Current Result Document :

한글제목(Korean Title)	Aural-visual two-stream 기반의 아기 울음소리 식별
영문제목(English Title)	Aural-visual two-stream based infant cry recognition
저자(Author)	박철 이종욱 오스만 박대희 정용화 Zhao Bo Jonguk Lee Othmane Atif Daihee Park Yongwha Chung
원문수록처(Citation)	VOL 28 NO. 01 PP. 0354 ~ 0357 (2021. 05)
한글내용 (Korean Abstract)
영문내용 (English Abstract)	Infants communicate their feelings and needs to the outside world through non-verbal methods such as crying and displaying diverse facial expressions. However, inexperienced parents tend to decode these non-verbal messages incorrectly and take inappropriate actions, which might affect the bonding they build with their babies and the cognitive development of the newborns. In this paper, we propose an aural-visual two-stream based infant cry recognition system to help parents comprehend the feelings and needs of crying babies. The proposed system first extracts the features from the pre-processed audio and video data by using the VGGish model and 3D-CNN model respectively, fuses the extracted features using a fully connected layer, and finally applies a SoftMax function to classify the fused features and recognize the corresponding type of cry. The experimental results show that the proposed system classification exceeds 0.92 in F1-score, which is 0.08 and 0.10 higher than the single-stream aural model and single-stream visual model.
키워드(Keyword)
파일첨부	PDF 다운로드

사이트맵

연구문헌

교육정보

심화정보

컴퓨터iN

연구자료

알림마당

CSERIC 광장

서비스 바로가기

Please wait....

연구문헌

Current Result Document :