User profiles for Junbin Xiao

Junbin Xiao

National University of Singapore
Verified email at comp.nus.edu.sg
Cited by 1998

Next-qa: Next phase of question-answering to explaining temporal actions

J Xiao, X Shang, A Yao… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We introduce NExT-QA, a rigorously designed video question answering (VideoQA)
benchmark to advance video understanding from describing to explaining the temporal actions. …

Can i trust your answer? visually grounded video question answering

J Xiao, A Yao, Y Li, TS Chua - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
We study visually grounded VideoQA in response to the emerging trends of utilizing
pretraining techniques for video-language understanding. Specifically by forcing vision-language …

Invariant grounding for video question answering

Y Li, X Wang, J Xiao, W Ji… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Video Question Answering (VideoQA) is the task of answering questions about a video. At
its core is understanding the alignments between visual scenes in video and linguistic …

Video graph transformer for video question answering

J Xiao, P Zhou, TS Chua, S Yan - European Conference on Computer …, 2022 - Springer
This paper proposes a Video Graph Transformer (VGT) model for Video Question Answering
(VideoQA). VGT’s uniqueness are two-fold: 1) it designs a dynamic graph transformer …

Video as conditional graph hierarchy for multi-granular question answering

J Xiao, A Yao, Z Liu, Y Li, W Ji, TS Chua - Proceedings of the AAAI …, 2022 - ojs.aaai.org
Video question answering requires the models to understand and reason about both the
complex video and language data to correctly derive the answers. Existing efforts have been …

Annotating objects and relations in user-generated videos

X Shang, D Di, J Xiao, Y Cao, X Yang… - Proceedings of the 2019 …, 2019 - dl.acm.org
Understanding the objects and relations between them is indispensable to fine-grained
video content analysis, which is widely studied in recent research works in multimedia and …

Video question answering: Datasets, algorithms and challenges

Y Zhong, J Xiao, W Ji, Y Li, W Deng… - arXiv preprint arXiv …, 2022 - arxiv.org
Video Question Answering (VideoQA) aims to answer natural language questions according
to the given videos. It has earned increasing attention with recent research trends in joint …

Fakesv: A multimodal benchmark with rich social context for fake news detection on short video platforms

P Qi, Y Bu, J Cao, W Ji, R Shui, J Xiao… - Proceedings of the …, 2023 - ojs.aaai.org
Short video platforms have become an important channel for news sharing, but also a new
breeding ground for fake news. To mitigate this problem, research of fake news video …

Abductive ego-view accident video understanding for safe driving perception

J Fang, L Li, J Zhou, J Xiao, H Yu… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present MM-AU a novel dataset for Multi-Modal Accident video Understanding. MM-AU
contains 11727 in-the-wild ego-view accident videos each with temporally aligned text …

Preparation, characterization, and thermal properties of the microencapsulation of a hydrated salt as phase change energy storage materials

J Huang, T Wang, P Zhu, J Xiao - Thermochimica acta, 2013 - Elsevier
Microcapsules loaded by disodium hydrogen phosphate heptahydrate (Na 2 HPO 4 ·7H 2 O)
were prepared by means of the suspension copolymerization-solvent volatile method, with …