• Àüü
  • ÀüÀÚ/Àü±â
  • Åë½Å
  • ÄÄÇ»ÅÍ
´Ý±â

»çÀÌÆ®¸Ê

Loading..

Please wait....

Ȩ Ȩ > ¿¬±¸¹®Çå >

Current Result Document :

ÇѱÛÁ¦¸ñ(Korean Title) Aural-visual two-stream ±â¹ÝÀÇ ¾Æ±â ¿ïÀ½¼Ò¸® ½Äº°
¿µ¹®Á¦¸ñ(English Title) Aural-visual two-stream based infant cry recognition
ÀúÀÚ(Author) ¹Úö   ÀÌÁ¾¿í   ¿À½º¸¸   ¹Ú´ëÈñ   Á¤¿ëÈ­   Zhao Bo   Jonguk Lee   Othmane Atif   Daihee Park   Yongwha Chung  
¿ø¹®¼ö·Ïó(Citation) VOL 28 NO. 01 PP. 0354 ~ 0357 (2021. 05)
Çѱ۳»¿ë
(Korean Abstract)
¿µ¹®³»¿ë
(English Abstract)
Infants communicate their feelings and needs to the outside world through non-verbal methods such as crying and displaying diverse facial expressions. However, inexperienced parents tend to decode these non-verbal messages incorrectly and take inappropriate actions, which might affect the bonding they build with their babies and the cognitive development of the newborns. In this paper, we propose an aural-visual two-stream based infant cry recognition system to help parents comprehend the feelings and needs of crying babies. The proposed system first extracts the features from the pre-processed audio and video data by using the VGGish model and 3D-CNN model respectively, fuses the extracted features using a fully connected layer, and finally applies a SoftMax function to classify the fused features and recognize the corresponding type of cry. The experimental results show that the proposed system classification exceeds 0.92 in F1-score, which is 0.08 and 0.10 higher than the single-stream aural model and single-stream visual model.
Å°¿öµå(Keyword)
ÆÄÀÏ÷ºÎ PDF ´Ù¿î·Îµå