User profiles for Junbin Xiao
![]() | Junbin XiaoNational University of Singapore Verified email at comp.nus.edu.sg Cited by 1998 |
Next-qa: Next phase of question-answering to explaining temporal actions
We introduce NExT-QA, a rigorously designed video question answering (VideoQA)
benchmark to advance video understanding from describing to explaining the temporal actions. …
benchmark to advance video understanding from describing to explaining the temporal actions. …
Can i trust your answer? visually grounded video question answering
We study visually grounded VideoQA in response to the emerging trends of utilizing
pretraining techniques for video-language understanding. Specifically by forcing vision-language …
pretraining techniques for video-language understanding. Specifically by forcing vision-language …
Invariant grounding for video question answering
Video Question Answering (VideoQA) is the task of answering questions about a video. At
its core is understanding the alignments between visual scenes in video and linguistic …
its core is understanding the alignments between visual scenes in video and linguistic …
Video graph transformer for video question answering
This paper proposes a Video Graph Transformer (VGT) model for Video Question Answering
(VideoQA). VGT’s uniqueness are two-fold: 1) it designs a dynamic graph transformer …
(VideoQA). VGT’s uniqueness are two-fold: 1) it designs a dynamic graph transformer …
Video as conditional graph hierarchy for multi-granular question answering
Video question answering requires the models to understand and reason about both the
complex video and language data to correctly derive the answers. Existing efforts have been …
complex video and language data to correctly derive the answers. Existing efforts have been …
Annotating objects and relations in user-generated videos
Understanding the objects and relations between them is indispensable to fine-grained
video content analysis, which is widely studied in recent research works in multimedia and …
video content analysis, which is widely studied in recent research works in multimedia and …
Video question answering: Datasets, algorithms and challenges
Video Question Answering (VideoQA) aims to answer natural language questions according
to the given videos. It has earned increasing attention with recent research trends in joint …
to the given videos. It has earned increasing attention with recent research trends in joint …
Fakesv: A multimodal benchmark with rich social context for fake news detection on short video platforms
Short video platforms have become an important channel for news sharing, but also a new
breeding ground for fake news. To mitigate this problem, research of fake news video …
breeding ground for fake news. To mitigate this problem, research of fake news video …
Abductive ego-view accident video understanding for safe driving perception
We present MM-AU a novel dataset for Multi-Modal Accident video Understanding. MM-AU
contains 11727 in-the-wild ego-view accident videos each with temporally aligned text …
contains 11727 in-the-wild ego-view accident videos each with temporally aligned text …
Preparation, characterization, and thermal properties of the microencapsulation of a hydrated salt as phase change energy storage materials
J Huang, T Wang, P Zhu, J Xiao - Thermochimica acta, 2013 - Elsevier
Microcapsules loaded by disodium hydrogen phosphate heptahydrate (Na 2 HPO 4 ·7H 2 O)
were prepared by means of the suspension copolymerization-solvent volatile method, with …
were prepared by means of the suspension copolymerization-solvent volatile method, with …