default search action

combined dblp search
author search
venue search
publication search

ask others

Difei Gao

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tip/BaiWGC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tip/BaiWGC24
Ziyi Bai, Ruiping Wang, Difei Gao, Xilin Chen:
Event Graph Guided Compositional Spatial-Temporal Reasoning for Video Question Answering. IEEE Trans. Image Process. 33: 1109-1121 (2024)
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Gao0BOLMWZWGWZS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Gao0BOLMWZWGWZS24
Difei Gao, Lei Ji, Zechen Bai, Mingyu Ouyang, Peiran Li, Dongxing Mao, Qinchen Wu, Weichen Zhang, Peiyi Wang, Xiangwu Guo, Hengxu Wang, Luowei Zhou, Mike Zheng Shou:
AssistGUI: Task-Oriented PC Graphical User Interface Automation. CVPR 2024: 13289-13298
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ChenLWLSGLGMS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ChenLWLSGLGMS24
Joya Chen, Zhaoyang Lv, Shiwei Wu, Kevin Qinghong Lin, Chenan Song, Difei Gao, Jia-Wei Liu, Ziteng Gao, Dongxing Mao, Mike Zheng Shou:
VideoLLM-online: Online Video Large Language Model for Streaming Video. CVPR 2024: 18407-18418
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LeiGYZGSGSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LeiGYZGSGSS24
Weixian Lei, Yixiao Ge, Kun Yi, Jianfeng Zhang, Difei Gao, Dylan Sun, Yuying Ge, Ying Shan, Mike Zheng Shou:
VIT-LENS: Towards Omni-modal Representations. CVPR 2024: 26637-26647
[c17]
- view
  - electronic edition @ ijcai.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ijcai/HuLGTW0S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/HuLGTW0S24
Juan Hu, Xin Liao, Difei Gao, Satoshi Tsutsui, Qian Wang, Zheng Qin, Mike Zheng Shou:
Delocate: Detection and Localization for Deepfake Videos with Randomly-Located Tampered Traces. IJCAI 2024: 5862-5871
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GaoHBLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GaoHBLS24
Difei Gao, Siyuan Hu, Zechen Bai, Qinghong Lin, Mike Zheng Shou:
AssistEditor: Multi-Agent Collaboration for GUI Workflow Automation in Video Creation. ACM Multimedia 2024: 11255-11257
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-13516
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-13516
Juan Hu, Xin Liao, Difei Gao, Satoshi Tsutsui, Qian Wang, Zheng Qin, Mike Zheng Shou:
Delocate: Detection and Localization for Deepfake Videos with Randomly-Located Tampered Traces. CoRR abs/2401.13516 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-14974
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-14974
Henry Hengyuan Zhao, Pan Zhou, Difei Gao, Mike Zheng Shou:
LOVA3: Learning to Visual Question Answering, Asking and Assessment. CoRR abs/2405.14974 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10227
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10227
Kevin Qinghong Lin, Linjie Li, Difei Gao, Qinchen Wu, Mingyi Yan, Zhengyuan Yang, Lijuan Wang, Mike Zheng Shou:
VideoGUI: A Benchmark for GUI Automation from Instructional Videos. CoRR abs/2406.10227 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-11816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-11816
Joya Chen, Zhaoyang Lv, Shiwei Wu, Kevin Qinghong Lin, Chenan Song, Difei Gao, Jia-Wei Liu, Ziteng Gao, Dongxing Mao, Mike Zheng Shou:
VideoLLM-online: Online Video Large Language Model for Streaming Video. CoRR abs/2406.11816 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-13719
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-13719
Qinchen Wu, Difei Gao, Kevin Qinghong Lin, Zhuoyu Wu, Xiangwu Guo, Peiran Li, Weichen Zhang, Hengxu Wang, Mike Zheng Shou:
GUI Action Narrator: Where and When Did That Action Take Place? CoRR abs/2406.13719 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-21757
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-21757
Kevin Qinghong Lin, Pengchuan Zhang, Difei Gao, Xide Xia, Joya Chen, Ziteng Gao, Jinheng Xie, Xuhong Xiao, Mike Zheng Shou:
Learning Video Context as Interleaved Multimodal Sequences. CoRR abs/2407.21757 (2024)
2023
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/GaoWSC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/GaoWSC23
Difei Gao, Ruiping Wang, Shiguang Shan, Xilin Chen:
CRIC: A VQA Dataset for Compositional Reasoning on Vision and Commonsense. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 5561-5578 (2023)
[c15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LeiGWWLZS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LeiGWWLZS23
Stan Weixian Lei, Difei Gao, Jay Zhangjie Wu, Yuxuan Wang, Wei Liu, Mengmi Zhang, Mike Zheng Shou:
Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task. AAAI 2023: 1250-1259
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HouZ0GYCNSD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HouZ0GYCNSD23
Zhijian Hou, Wanjun Zhong, Lei Ji, Difei Gao, Kun Yan, Wing Kwong Chan, Chong-Wah Ngo, Mike Zheng Shou, Nan Duan:
CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding. ACL (1) 2023: 8013-8028
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ChenGLS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ChenGLS23
Joya Chen, Difei Gao, Kevin Qinghong Lin, Mike Zheng Shou:
Affordance Grounding from Demonstration Video to Target Image. CVPR 2023: 6799-6808
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/GaoZ0ZYS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/GaoZ0ZYS23
Difei Gao, Luowei Zhou, Lei Ji, Linchao Zhu, Yi Yang, Mike Zheng Shou:
MIST : Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering. CVPR 2023: 14773-14783
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/IlaslanSCGLXLS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/IlaslanSCGLXLS23
Muhammet Ilaslan, Chenan Song, Joya Chen, Difei Gao, Weixian Lei, Qianli Xu, Joo Lim, Mike Zheng Shou:
GazeVQA: A Video Question Answering Dataset for Multiview Eye-Gaze Task-Oriented Collaborations. EMNLP 2023: 10462-10479
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/LinZCPGWYS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/LinZCPGWYS23
Kevin Qinghong Lin, Pengchuan Zhang, Joya Chen, Shraman Pramanick, Difei Gao, Alex Jinpeng Wang, Rui Yan, Mike Zheng Shou:
UniVTG: Towards Unified Video-Language Temporal Grounding. ICCV 2023: 2782-2792
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/SinghLSLGT0SKZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/SinghLSLGT0SKZ23
Parantak Singh, You Li, Ankur Sikarwar, Weixian Lei, Difei Gao, Morgan B. Talbot, Ying Sun, Mike Zheng Shou, Gabriel Kreiman, Mengmi Zhang:
Learning to Learn: How to Continuously Teach Humans and Machines. ICCV 2023: 11674-11685
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-01740
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-01740
Juan Hu, Xin Liao, Difei Gao, Satoshi Tsutsui, Zheng Qin, Mike Zheng Shou:
DeepfakeMAE: Facial Part Consistency Aware Masked Autoencoder for Deepfake Video Detection. CoRR abs/2303.01740 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-14644
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-14644
Joya Chen, Difei Gao, Kevin Qinghong Lin, Mike Zheng Shou:
Affordance Grounding from Demonstration Video to Target Image. CoRR abs/2303.14644 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-05943
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-05943
Juan Hu, Xin Liao, Difei Gao, Satoshi Tsutsui, Qian Wang, Zheng Qin, Mike Zheng Shou:
Mover: Mask and Recovery based Facial Part Consistency Aware Method for Deepfake Video Detection. CoRR abs/2305.05943 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-08640
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-08640
Difei Gao, Lei Ji, Luowei Zhou, Kevin Qinghong Lin, Joya Chen, Zihan Fan, Mike Zheng Shou:
AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn. CoRR abs/2306.08640 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-15255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-15255
Zhijian Hou, Lei Ji, Difei Gao, Wanjun Zhong, Kun Yan, Chao Li, Wing-Kwong Chan, Chong-Wah Ngo, Nan Duan, Mike Zheng Shou:
GroundNLQ @ Ego4D Natural Language Queries Challenge 2023. CoRR abs/2306.15255 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-16715
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-16715
Kevin Qinghong Lin, Pengchuan Zhang, Joya Chen, Shraman Pramanick, Difei Gao, Alex Jinpeng Wang, Rui Yan, Mike Zheng Shou:
UniVTG: Towards Unified Video-Language Temporal Grounding. CoRR abs/2307.16715 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-09921
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-09921
Juan Hu, Xin Liao, Difei Gao, Satoshi Tsutsui, Qian Wang, Zheng Qin, Mike Zheng Shou:
Recap: Detecting Deepfake Video with Unpredictable Tampered Traces via Recovering Faces and Mapping Recovered Faces. CoRR abs/2308.09921 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-15818
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-15818
David Junhao Zhang, Jay Zhangjie Wu, Jia-Wei Liu, Rui Zhao, Lingmin Ran, Yuchao Gu, Difei Gao, Mike Zheng Shou:
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation. CoRR abs/2309.15818 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-16003
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-16003
Jay Zhangjie Wu, Xiuyu Li, Difei Gao, Zhen Dong, Jinbin Bai, Aishani Singh, Xiaoyu Xiang, Youzeng Li, Zuwei Huang, Yuanxi Sun, Rui He, Feng Hu, Junhua Hu, Hai Huang, Hanyu Zhu, Xu Cheng, Jie Tang, Mike Zheng Shou, Kurt Keutzer, Forrest N. Iandola:
CVPR 2023 Text Guided Video Editing Competition. CoRR abs/2310.16003 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-16081
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-16081
Weixian Lei, Yixiao Ge, Kun Yi, Jianfeng Zhang, Difei Gao, Dylan Sun, Yuying Ge, Ying Shan, Mike Zheng Shou:
ViT-Lens-2: Gateway to Omni-modal Intelligence. CoRR abs/2311.16081 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-13108
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-13108
Difei Gao, Lei Ji, Zechen Bai, Mingyu Ouyang, Peiran Li, Dongxing Mao, Qinchen Wu, Weichen Zhang, Peiyi Wang, Xiangwu Guo, Hengxu Wang, Luowei Zhou, Mike Zheng Shou:
ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation. CoRR abs/2312.13108 (2023)
2022
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/WongCWLMGS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/WongCWLMGS22
Benita Wong, Joya Chen, You Wu, Stan Weixian Lei, Dongxing Mao, Difei Gao, Mike Zheng Shou:
AssistQ: Affordance-Centric Question-Driven Task Completion for Egocentric Assistant. ECCV (36) 2022: 485-501
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/WangGYLFS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/WangGYLFS22
Yuxuan Wang, Difei Gao, Licheng Yu, Weixian Lei, Matt Feiszli, Mike Zheng Shou:
GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval. ECCV (35) 2022: 709-725
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/LeiGWMLRS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/LeiGWMLRS22
Weixian Lei, Difei Gao, Yuxuan Wang, Dongxing Mao, Zihan Liang, Lingmin Ran, Mike Zheng Shou:
AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant. EMNLP (Findings) 2022: 319-338
[c5]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LinWSWYXGTZKCWD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LinWSWYXGTZKCWD22
Kevin Qinghong Lin, Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rong-Cheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou:
Egocentric Video-Language Pretraining. NeurIPS 2022
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-04203
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-04203
Benita Wong, Joya Chen, You Wu, Stan Weixian Lei, Dongxing Mao, Difei Gao, Mike Zheng Shou:
AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant. CoRR abs/2203.04203 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-00486
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-00486
Yuxuan Wang, Difei Gao, Licheng Yu, Stan Weixian Lei, Matt Feiszli, Mike Zheng Shou:
GEB+: A benchmark for generic event boundary captioning, grounding and text-based retrieval. CoRR abs/2204.00486 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-01670
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-01670
Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rong-Cheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou:
Egocentric Video-Language Pretraining. CoRR abs/2206.01670 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-01622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-01622
Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rong-Cheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou:
Egocentric Video-Language Pretraining @ Ego4D Challenge 2022. CoRR abs/2207.01622 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-12037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-12037
Stan Weixian Lei, Difei Gao, Jay Zhangjie Wu, Yuxuan Wang, Wei Liu, Mengmi Zhang, Mike Zheng Shou:
Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task. CoRR abs/2208.12037 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-10918
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-10918
Zhijian Hou, Wanjun Zhong, Lei Ji, Difei Gao, Kun Yan, Wing Kwong Chan, Chong-Wah Ngo, Zheng Shou, Nan Duan:
CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding. CoRR abs/2209.10918 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-08776
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-08776
Zhijian Hou, Wanjun Zhong, Lei Ji, Difei Gao, Kun Yan, Wing Kwong Chan, Chong-Wah Ngo, Zheng Shou, Nan Duan:
An Efficient COarse-to-fiNE Alignment Framework @ Ego4D Natural Language Queries Challenge 2022. CoRR abs/2211.08776 (2022)
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-09522
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-09522
Difei Gao, Luowei Zhou, Lei Ji, Linchao Zhu, Yi Yang, Mike Zheng Shou:
MIST: Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering. CoRR abs/2212.09522 (2022)
2021
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/Gao0B021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/Gao0B021
Difei Gao, Ruiping Wang, Ziyi Bai, Xilin Chen:
Env-QA: A Video Question Answering Benchmark for Comprehensive Understanding of Dynamic Environments. ICCV 2021: 1655-1665
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-15050
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-15050
Stan Weixian Lei, Yuxuan Wang, Dongxing Mao, Difei Gao, Mike Zheng Shou:
AssistSR: Affordance-centric Question-driven Video Segment Retrieval. CoRR abs/2111.15050 (2021)
2020
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/GaoWSC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/GaoWSC20
Difei Gao, Ruiping Wang, Shiguang Shan, Xilin Chen:
Learning to Recognize Visual Concepts for Visual Question Answering With Structural Label Space. IEEE J. Sel. Top. Signal Process. 14(3): 494-505 (2020)
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/GaoL0SC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/GaoL0SC20
Difei Gao, Ke Li, Ruiping Wang, Shiguang Shan, Xilin Chen:
Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text. CVPR 2020: 12743-12753
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-13962
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-13962
Difei Gao, Ke Li, Ruiping Wang, Shiguang Shan, Xilin Chen:
Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text. CoRR abs/2003.13962 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1908-02962
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-02962
Difei Gao, Ruiping Wang, Shiguang Shan, Xilin Chen:
From Two Graphs to N Questions: A VQA Dataset for Compositional Reasoning on Vision and Commonsense. CoRR abs/1908.02962 (2019)
2017
[c2]
- view
  - electronic edition @ dropbox.com (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/bmvc/Gao0SC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bmvc/Gao0SC17
Difei Gao, Ruiping Wang, Shiguang Shan, Xilin Chen:
Visual Textbook Network: Watch Carefully before Answering Visual Questions. BMVC 2017
2015
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/GaoPLCX15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/GaoPLCX15
Difei Gao, Lili Pan, Risheng Liu, Rui Chen, Mei Xie:
Correlated warped Gaussian processes for gender-specific age estimation. ICIP 2015: 133-137

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.