Google Scholar

User profiles for Junbin Xiao

Junbin Xiao

National University of Singapore

Verified email at comp.nus.edu.sg

Cited by 1998

[PDF] thecvf.com

Next-qa: Next phase of question-answering to explaining temporal actions

J Xiao, X Shang, A Yao… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

We introduce NExT-QA, a rigorously designed video question answering (VideoQA)
benchmark to advance video understanding from describing to explaining the temporal actions. …

Save Cite Cited by 580 Related articles All 6 versions View as HTML

[PDF] thecvf.com

Can i trust your answer? visually grounded video question answering

J Xiao, A Yao, Y Li, TS Chua - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com

We study visually grounded VideoQA in response to the emerging trends of utilizing
pretraining techniques for video-language understanding. Specifically by forcing vision-language …

Save Cite Cited by 99 Related articles All 6 versions View as HTML

[PDF] thecvf.com

Invariant grounding for video question answering

Y Li, X Wang, J Xiao, W Ji… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Video Question Answering (VideoQA) is the task of answering questions about a video. At
its core is understanding the alignments between visual scenes in video and linguistic …

Save Cite Cited by 164 Related articles All 6 versions View as HTML

[PDF] arxiv.org

Video graph transformer for video question answering

J Xiao, P Zhou, TS Chua, S Yan - European Conference on Computer …, 2022 - Springer

This paper proposes a Video Graph Transformer (VGT) model for Video Question Answering
(VideoQA). VGT’s uniqueness are two-fold: 1) it designs a dynamic graph transformer …

Save Cite Cited by 124 Related articles All 10 versions

[PDF] aaai.org

Video as conditional graph hierarchy for multi-granular question answering

J Xiao, A Yao, Z Liu, Y Li, W Ji, TS Chua - Proceedings of the AAAI …, 2022 - ojs.aaai.org

Video question answering requires the models to understand and reason about both the
complex video and language data to correctly derive the answers. Existing efforts have been …

Save Cite Cited by 147 Related articles All 7 versions View as HTML

Annotating objects and relations in user-generated videos

X Shang, D Di, J Xiao, Y Cao, X Yang… - Proceedings of the 2019 …, 2019 - dl.acm.org

Understanding the objects and relations between them is indispensable to fine-grained
video content analysis, which is widely studied in recent research works in multimedia and …

Save Cite Cited by 207 Related articles

[PDF] arxiv.org

Video question answering: Datasets, algorithms and challenges

Y Zhong, J Xiao, W Ji, Y Li, W Deng… - arXiv preprint arXiv …, 2022 - arxiv.org

Video Question Answering (VideoQA) aims to answer natural language questions according
to the given videos. It has earned increasing attention with recent research trends in joint …

Save Cite Cited by 124 Related articles All 5 versions View as HTML

[PDF] aaai.org

Fakesv: A multimodal benchmark with rich social context for fake news detection on short video platforms

P Qi, Y Bu, J Cao, W Ji, R Shui, J Xiao… - Proceedings of the …, 2023 - ojs.aaai.org

Short video platforms have become an important channel for news sharing, but also a new
breeding ground for fake news. To mitigate this problem, research of fake news video …

Save Cite Cited by 92 Related articles All 5 versions View as HTML

[PDF] thecvf.com

Abductive ego-view accident video understanding for safe driving perception

J Fang, L Li, J Zhou, J Xiao, H Yu… - Proceedings of the …, 2024 - openaccess.thecvf.com

We present MM-AU a novel dataset for Multi-Modal Accident video Understanding. MM-AU
contains 11727 in-the-wild ego-view accident videos each with temporally aligned text …

Save Cite Cited by 29 Related articles All 7 versions View as HTML

Preparation, characterization, and thermal properties of the microencapsulation of a hydrated salt as phase change energy storage materials

J Huang, T Wang, P Zhu, J Xiao - Thermochimica acta, 2013 - Elsevier

Microcapsules loaded by disodium hydrogen phosphate heptahydrate (Na 2 HPO 4 ·7H 2 O)
were prepared by means of the suspension copolymerization-solvent volatile method, with …

Save Cite Cited by 153 Related articles All 7 versions

Create alert

Cite

Advanced search

Saved to My library

User profiles for Junbin Xiao

Junbin Xiao

Next-qa: Next phase of question-answering to explaining temporal actions

Can i trust your answer? visually grounded video question answering

Invariant grounding for video question answering

Video graph transformer for video question answering

Video as conditional graph hierarchy for multi-granular question answering

Annotating objects and relations in user-generated videos

Video question answering: Datasets, algorithms and challenges

Fakesv: A multimodal benchmark with rich social context for fake news detection on short video platforms

Abductive ego-view accident video understanding for safe driving perception

Preparation, characterization, and thermal properties of the microencapsulation of a hydrated salt as phase change energy storage materials