Publications

2024

TrackIME: Enhanced Video Point Tracking via Instance Motion Estimation spotlight
Seong Hyeon Park, Huiwon Jang, Byungwoo Jeon, Sukmin Yun, Paul Hongsuck Seo, Jinwoo Shin
In NeurIPS 2024

Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
Heeseong Shin, Chaehyun Kim, Sunghwan Hong, Seokju Cho, Anurag Arnab, +Paul Hongsuck Seo, +Seungryong Kim (+ corresponding authors)
In NeurIPS 2024

Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation
Seonghoon Yu, +Paul Hongsuck Seo, +Jeany Son (+ corresponding authors)
In ECCV 2024

CAT-Seg: Cost Aggregation for Open-vocabulary Semantic Segmentation highlight
*Seokju Cho, *Heeseong Shin, Sunghwan Hong, Anurag Arnab, +Paul Hongsuck Seo, +Seungryong Kim (* equal contribution; + corresponding authors)
In CVPR 2024

Learning Correlation Structures for Vision Transformers
Manjin Kim, +Paul Hongsuck Seo, Cordelia Schmid, +Minsu Cho (+ corresponding authors)
In CVPR 2024

2023

AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR
Paul Hongsuck Seo, Arsha Nagrani, Cordelia Schmid
In CVPR 2023

IFSeg: Image-free Semantic Segmentation via Vision-Language Model
Sukmin Yun, Seong Hyeon Park, Paul Hongsuck Seo, Jinwoo Shin
In CVPR 2023

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Antoine Yang, Arsha Nagrani, Paul Hongsuck Seo, Antoine Miech, Jordi Pont-Tuset, Ivan Laptev, Josef Sivic, Cordelia Schmid
In CVPR 2023

Zero-shot Referring Image Segmentation with Global-Local Context Features
Seonghoon Yu, Paul Hongsuck Seo, Jeany Son
In CVPR 2023

2022

Learning Audio-Video Modalities from Image Captions
Arsha Nagrani, Paul Hongsuck Seo, Bryan Seybold, Anja Hauth, Santiago Manen, Chen Sun, Cordelia Schmid
In ECCV 2022

AVATAR: Unconstrained Audiovisual Speech Recognition oral
*Valentin Gabeur, *Paul Hongsuck Seo, *Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid (* equal contribution)
In Interspeech 2022

End-to-end Generative Pretraining for Multimodal Video Captioning
Paul Hongsuck Seo, Arsha Nagrani, Anurag Arnab, Cordelia Schmid
In CVPR 2022

2021

Look Before you Speak: Visually Contextualized Utterances
Paul Hongsuck Seo, Arsha Nagrani, Cordelia Schmid
In CVPR 2021

2020

Reinforcing an Image Caption Generator by Off-line Human Feedback oral
Paul Hongsuck Seo, Piyush Sharma, Tomer Levinboim, Bohyung Han, Radu Soricut
In AAAI 2020

2019

Combinatorial Inference against Label Noise
Paul Hongsuck Seo, Geeho Kim, Bohyung Han
In NeurIPS 2019

Regularizing Neural Networks via Stochastic Branch Layers oral
*Wonpyo Park, *Paul Hongsuck Seo, Bohyung Han, Minsu Cho (* equal contribution)
In ACML 2019

Learning for Single-Shot Confidence Calibration in Deep Neural Networks through Stochastic Inferences
*Seonguk Seo, *Paul Hongsuck Seo, Bohyung Han (* equal contribution)
In CVPR 2019

2018

CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of Maps
Paul Hongsuck Seo, Tobias Weyand, Jack Sim, Bohyung Han
In ECCV 2018

Attentive Semantic Alignment with Offset-Aware Correlation Kernels
Paul Hongsuck Seo, Jongmin Lee, Deunsol Jung, Bohyung Han, Minsu Cho
In ECCV 2018

Progressive Attention Networks for Visual Attribute Prediction
Paul Hongsuck Seo, Zhe Lin, Scott Cohen, Xiaohui Shen, Bohyung Han
In BMVC 2018

2017

Visual Reference Resolution using Attention Memory for Visual Dialog
Paul Hongsuck Seo, Andreas Lehrmann, Bohyung Han, Leonid Sigal
In NIPS 2017

MarioQA: Answering Questions by Watching Gameplay Videos
*Jonghwan Mun, *Paul Hongsuck Seo, Ilchae Jung, Bohyung Han (* equal contribution)
In ICCV 2017

2016

Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction oral
Hyeonwoo Noh, Paul Hongsuck Seo, Bohyung Han
In CVPR 2016

A Corpus for a Multimodal Dialog System for Presentation Controls
Paul Hongsuck Seo, Gary Geunbae Lee
In Proceedings of the International Workshop Series on Multimodal Corpora (MMC 2016)

2015

Conversational Knowledge Teaching Agent that Uses a Knowledge Base
Kyusong Lee, Paul Hongsuck Seo, Junhwi Choi, Sangjun Koo, Gary Geunbae Lee
In SIGDIAL 2015

2014

Grammatical Error Correction based on Learner Comprehension Model in Oral Conversation
Kyusong Lee, Seonghan Ryu, Paul Hongsuck Seo, Seokhwan Kim, Gary Geunbae Lee
In Proceedings of the IEEE Workshop on Spoken Language Technology (SLT 2014)

2012

Generating Grammar Questions using Corpus Data in L2 Learning
Kyusong Lee, Soo-ok Kweon, Hongsuck Seo, Gary Geunbae Lee
In Proceedings of the IEEE Workshop on Spoken Language Technology (SLT 2012)

A Meta-Learning Approach to Grammatical Error Correction
Hongsuck Seo, Jonghoon Lee, Seokhwan Kim, Kyusong Lee, Sechun Kang, Gary Geunbae Lee
In ACL 2012

Grammatical Error Annotation for Korean Learners of Spoken English
Hongsuck Seo, Kyusong Lee, Gary Geunbae Lee, Soo-ok Kweon
In LREC 2012