JIPS (Çѱ¹Á¤º¸Ã³¸®ÇÐȸ)
Current Result Document :
ÇѱÛÁ¦¸ñ(Korean Title) |
Multimodal Context Embedding for Scene Graph Generation |
¿µ¹®Á¦¸ñ(English Title) |
Multimodal Context Embedding for Scene Graph Generation |
ÀúÀÚ(Author) |
Gayoung Jung
Incheol Kim
|
¿ø¹®¼ö·Ïó(Citation) |
VOL 16 NO. 06 PP. 1250 ~ 1260 (2020. 12) |
Çѱ۳»¿ë (Korean Abstract) |
|
¿µ¹®³»¿ë (English Abstract) |
This study proposes a novel deep neural network model that can accurately detect objects and their relationships in an image and represent them as a scene graph. The proposed model utilizes several multimodal features, including linguistic features and visual context features, to accurately detect objects and relationships. In addition, in the proposed model, context features are embedded using graph neural networks to depict the dependencies between two related objects in the context feature vector. This study demonstrates the effectiveness of the proposed model through comparative experiments using the Visual Genome benchmark dataset.
|
Å°¿öµå(Keyword) |
Deep Neural Network
Multimodal Context
Relationship Detection
Scene Graph Generation
|
ÆÄÀÏ÷ºÎ |
PDF ´Ù¿î·Îµå
|