• Àüü
  • ÀüÀÚ/Àü±â
  • Åë½Å
  • ÄÄÇ»ÅÍ
´Ý±â

»çÀÌÆ®¸Ê

Loading..

Please wait....

Ȩ Ȩ > ¿¬±¸¹®Çå >

Current Result Document :

ÇѱÛÁ¦¸ñ(Korean Title) Æ®·£½ºÆ÷¸ÓÀÇ È¿°úÀûÀÎ ½Ã°£ Ư¡ Á¤º¸ ÇнÀÀ» À§ÇÑ ÇÕ¼º°ö ±â¹ý
¿µ¹®Á¦¸ñ(English Title) Convolutional Approach to Learning Temporal Feature Effectively in Transformer
ÀúÀÚ(Author) ¹ÚÇؼº   Á¤Çõö   ÃÖ¿ë¼®   Hae Sung Park   Hyuck Chul Jung   Yong Suk Choi  
¿ø¹®¼ö·Ïó(Citation) VOL 49 NO. 02 PP. 0517 ~ 0519 (2022. 12)
Çѱ۳»¿ë
(Korean Abstract)
¿µ¹®³»¿ë
(English Abstract)
In the video classification task, a well-performing deep learning model is likely to extract proper temporal features to classify the data. However, we found out several problems of attention-based TimeSformer[2] related to extracting temporal features, and replaced the time attention module in the TimeSformer with the 3D convolution module for better temporal feature processing. Through several experiments and visualization results, we demonstrate that the 3D convolution module can extract more accurate temporal features of video data than the time-attention module.
Å°¿öµå(Keyword)
ÆÄÀÏ÷ºÎ PDF ´Ù¿î·Îµå