


default search action
Zijia Zhao
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j3]Zijia Zhao
, Longteng Guo, Tongtian Yue, Erdong Hu, Shuai Shao, Zehuan Yuan, Hua Huang, Jing Liu:
ChatSearch: A dataset and a generative retrieval model for general conversational image retrieval. Pattern Recognit. 167: 111696 (2025) - [j2]Liang Zhao
, Zijia Zhao
, Ammar Hawbani
, Zhi Liu
, Zhiyuan Tan
, Keping Yu
:
Dynamic Caching Dependency-Aware Task Offloading in Mobile Edge Computing. IEEE Trans. Computers 74(5): 1510-1523 (2025) - [c11]Zijia Zhao, Yuqi Huo, Tongtian Yue, Longteng Guo, Haoyu Lu, Bingning Wang, Weipeng Chen, Jing Liu:
Efficient Motion-Aware Video MLLM. CVPR 2025: 24159-24168 - [c10]Yifan Du, Yuqi Huo, Kun Zhou, Zijia Zhao, Haoyu Lu, Han Huang, Xin Zhao, Bingning Wang, Weipeng Chen, Ji-Rong Wen:
Exploring the Design Space of Visual Context Representation in Video MLLMs. ICLR 2025 - [c9]Zijia Zhao, Haoyu Lu, Yuqi Huo, Yifan Du, Tongtian Yue, Longteng Guo, Bingning Wang, Weipeng Chen, Jing Liu:
Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs. ICLR 2025 - [i17]Zijia Zhao, Yuqi Huo, Tongtian Yue, Longteng Guo, Haoyu Lu, Bingning Wang, Weipeng Chen, Jing Liu:
Efficient Motion-Aware Video MLLM. CoRR abs/2503.13016 (2025) - [i16]Wenxuan Wang, Zijia Zhao, Yisi Zhang, Yepeng Tang, Erdong Hu, Xinlong Wang, Jing Liu:
Image Difference Grounding with Natural Language. CoRR abs/2504.01952 (2025) - [i15]Angang Du, Bohong Yin, Bowei Xing, Bowen Qu, Bowen Wang, Cheng Chen, Chenlin Zhang, Chenzhuang Du, Chu Wei, Congcong Wang, Dehao Zhang, Dikang Du, Dongliang Wang, Enming Yuan, Enzhe Lu, Fang Li, Flood Sung, Guangda Wei, Guokun Lai, Han Zhu, Hao Ding, Hao Hu, Hao Yang, Hao Zhang, Haoning Wu, Haotian Yao, Haoyu Lu, Heng Wang, Hongcheng Gao, Huabin Zheng, Jiaming Li, Jianlin Su, Jianzhou Wang, Jiaqi Deng, Jiezhong Qiu, Jin Xie, Jinhong Wang, Jingyuan Liu, Junjie Yan, Kun Ouyang, Liang Chen, Lin Sui, Longhui Yu, Mengfan Dong, Mengnan Dong, Nuo Xu, Pengyu Cheng, Qizheng Gu, Runjie Zhou, Shaowei Liu, Sihan Cao, Tao Yu, Tianhui Song, Tongtong Bai, Wei Song, Weiran He, Weixiao Huang, Weixin Xu, Xiaokun Yuan, Xingcheng Yao, Xingzhe Wu, Xinxing Zu, Xinyu Zhou, Xinyuan Wang, Y. Charles, Yan Zhong, Yang Li, Yangyang Hu, Yanru Chen, Yejie Wang, Yibo Liu, Yibo Miao, Yidao Qin, Yimin Chen, Yiping Bao, Yiqin Wang, Yongsheng Kang, Yuanxin Liu, Yulun Du, Yuxin Wu, Yuzhi Wang, Yuzi Yan, Zaida Zhou, Zhaowei Li, Zhejun Jiang, Zheng Zhang, Zhilin Yang, Zhiqi Huang, Zihao Huang, Zijia Zhao, Ziwei Chen, Zongyu Lin:
Kimi-VL Technical Report. CoRR abs/2504.07491 (2025) - [i14]Tongtian Yue, Longteng Guo, Yepeng Tang, Zijia Zhao, Xinxin Zhu, Hua Huang, Jing Liu:
LaVi: Efficient Large Vision-Language Models via Internal Feature Modulation. CoRR abs/2506.16691 (2025) - 2024
- [c8]Erdong Hu, Longteng Guo, Tongtian Yue, Zijia Zhao, Shuning Xue, Jing Liu:
OneDiff: A Generalist Model for Image Difference Captioning. ACCV (3) 2024: 114-130 - [c7]Wenxuan Wang, Yisi Zhang, Xingjian He, Yichen Yan, Zijia Zhao, Xinlong Wang, Jing Liu:
Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions. ACL (Findings) 2024: 762-776 - [c6]Tongtian Yue, Jie Cheng, Longteng Guo, Xingyuan Dai, Zijia Zhao, Xingjian He, Gang Xiong, Yisheng Lv, Jing Liu:
SC- Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models. CVPR 2024: 13073-13083 - [c5]Shichen Lu
, Longteng Guo
, Wenxuan Wang
, Zijia Zhao
, Tongtian Yue
, Jing Liu
, Si Liu
:
Collaborative Training of Tiny-Large Vision Language Models. ACM Multimedia 2024: 4928-4937 - [i13]Wenxuan Wang, Yisi Zhang, Xingjian He, Yichen Yan, Zijia Zhao, Xinlong Wang, Jing Liu:
Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions. CoRR abs/2402.11265 (2024) - [i12]Tongtian Yue, Jie Cheng, Longteng Guo, Xingyuan Dai, Zijia Zhao, Xingjian He, Gang Xiong, Yisheng Lv, Jing Liu:
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models. CoRR abs/2403.13263 (2024) - [i11]Yanyuan Qiao, Zheng Yu, Longteng Guo, Sihan Chen, Zijia Zhao, Mingzhen Sun, Qi Wu, Jing Liu:
VL-Mamba: Exploring State Space Models for Multimodal Learning. CoRR abs/2403.13600 (2024) - [i10]Zijia Zhao, Haoyu Lu, Yuqi Huo, Yifan Du, Tongtian Yue, Longteng Guo, Bingning Wang, Weipeng Chen, Jing Liu:
Needle In A Video Haystack: A Scalable Synthetic Framework for Benchmarking Video MLLMs. CoRR abs/2406.09367 (2024) - [i9]Yifan Du, Kun Zhou, Yuqi Huo, Yifan Li, Wayne Xin Zhao, Haoyu Lu, Zijia Zhao, Bingning Wang, Weipeng Chen, Ji-Rong Wen:
Towards Event-oriented Long Video Understanding. CoRR abs/2406.14129 (2024) - [i8]Erdong Hu, Longteng Guo, Tongtian Yue, Zijia Zhao, Shuning Xue, Jing Liu:
OneDiff: A Generalist Model for Image Difference Captioning. CoRR abs/2407.05645 (2024) - [i7]Yifan Du, Yuqi Huo, Kun Zhou, Zijia Zhao, Haoyu Lu, Han Huang, Wayne Xin Zhao, Bingning Wang, Weipeng Chen, Ji-Rong Wen:
Exploring the Design Space of Visual Context Representation in Video MLLMs. CoRR abs/2410.13694 (2024) - [i6]Han Huang, Yuqi Huo, Zijia Zhao, Haoyu Lu, Shu Wu, Bingning Wang, Qiang Liu, Weipeng Chen, Liang Wang:
Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining. CoRR abs/2410.16166 (2024) - [i5]Zijia Zhao, Longteng Guo, Tongtian Yue, Erdong Hu, Shuai Shao, Zehuan Yuan, Hua Huang, Jing Liu:
ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval. CoRR abs/2410.18715 (2024) - 2023
- [j1]Liang Zhao
, Zijia Zhao
, Enchao Zhang
, Ammar Hawbani
, Ahmed Yassin Al-Dubai
, Zhiyuan Tan
, Amir Hussain:
A Digital Twin-Assisted Intelligent Partial Offloading Approach for Vehicular Edge Computing. IEEE J. Sel. Areas Commun. 41(11): 3386-3400 (2023) - [c4]Sihan Chen, Handong Li, Qunbo Wang, Zijia Zhao, Mingzhen Sun, Xinxin Zhu, Jing Liu:
VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset. NeurIPS 2023 - [c3]Wenbo Jia, Zijia Zhao, Wenzhuo Huang, Yangyang Li, Jie Ling
, Bai Chen, Yayi Shen:
Snake-inspired Swarm Robot Design for Distributed Underwater Search and Rescue. ROBIO 2023: 1-6 - [c2]Zijia Zhao
, Longteng Guo
, Xingjian He
, Shuai Shao
, Zehuan Yuan
, Jing Liu
:
MAMO: Fine-Grained Vision-Language Representations Learning with Masked Multimodal Modeling. SIGIR 2023: 1528-1538 - [i4]Zijia Zhao, Longteng Guo, Tongtian Yue, Sihan Chen, Shuai Shao, Xinxin Zhu, Zehuan Yuan, Jing Liu:
ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst. CoRR abs/2305.16103 (2023) - [i3]Sihan Chen, Handong Li, Qunbo Wang, Zijia Zhao, Mingzhen Sun, Xinxin Zhu, Jing Liu:
VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset. CoRR abs/2305.18500 (2023) - 2022
- [i2]Zijia Zhao, Longteng Guo, Xingjian He, Shuai Shao, Zehuan Yuan, Jing Liu:
MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning. CoRR abs/2210.04183 (2022) - 2021
- [c1]Sihan Chen, Xinxin Zhu, Dongze Hao, Wei Liu, Jiawei Liu, Zijia Zhao, Longteng Guo, Jing Liu:
MM21 Pre-training for Video Understanding Challenge: Video Captioning with Pretraining Techniques. ACM Multimedia 2021: 4853-4857 - [i1]Jing Liu, Xinxin Zhu, Fei Liu, Longteng Guo, Zijia Zhao, Mingzhen Sun, Weining Wang, Hanqing Lu, Shiyu Zhou, Jiajun Zhang, Jinqiao Wang:
OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation. CoRR abs/2107.00249 (2021)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-07-22 19:15 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint