


default search action
Guo Chen 0006
Person information
- affiliation: Nanjing University, State Key Laboratory for Novel Software Technology, China
Other persons with the same name
- Guo Chen — disambiguation page
- Guo Chen 0001
— Hunan University, China
- Guo Chen 0002
— University of Newcastle, School of Electrical Engineering and Computing, NSW, Australia (and 3 more)
- Guo Chen 0003 — University of Ottawa, Canada (and 1 more)
- Guo Chen 0004
— Wuhan University, Wuhan, Hubei, China
- Guo Chen 0005
— Central South University, School of Automation, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [c13]Guo Chen, Yicheng Liu, Yifei Huang, Baoqi Pei, Jilan Xu, Yuping He, Tong Lu, Yali Wang, Limin Wang:
CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding. ICLR 2025 - [c12]Baoqi Pei, Yifei Huang, Jilan Xu, Guo Chen, Yuping He, Lijin Yang, Yali Wang, Weidi Xie, Yu Qiao, Fei Wu, Limin Wang:
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning. ICLR 2025 - [c11]Jilan Xu, Yifei Huang, Baoqi Pei, Junlin Hou, Qingqiu Li, Guo Chen, Yuejie Zhang, Rui Feng, Weidi Xie:
EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos. ICLR 2025 - [i30]Zhiqi Li, Guo Chen, Shilong Liu, Shihao Wang, Vibashan VS, Yishen Ji, Shiyi Lan, Hao Zhang, Yilin Zhao, Subhashree Radhakrishnan, Nadine Chang, Karan Sapra, Amala Sanjay Deshmukh, Tuomas Rintamaki, Matthieu Le, Ilia Karmanov, Lukas Voegtle, Philipp Fischer, De-An Huang, Timo Roman, Tong Lu, José M. Álvarez, Bryan Catanzaro, Jan Kautz, Andrew Tao, Guilin Liu, Zhiding Yu:
Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models. CoRR abs/2501.14818 (2025) - [i29]Baoqi Pei, Yifei Huang, Jilan Xu, Guo Chen, Yuping He, Lijin Yang, Yali Wang, Weidi Xie, Yu Qiao, Fei Wu, Limin Wang:
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning. CoRR abs/2503.00986 (2025) - [i28]Jindong Jiang, Xiuyu Li, Zhijian Liu, Muyang Li, Guo Chen, Zhiqi Li, De-An Huang, Guilin Liu, Zhiding Yu, Kurt Keutzer, Sungjin Ahn, Jan Kautz, Hongxu Yin, Yao Lu, Song Han, Wonmin Byeon:
Token-Efficient Long Video Understanding for Multimodal LLMs. CoRR abs/2503.04130 (2025) - [i27]Yifei Huang, Jilan Xu, Baoqi Pei, Yuping He, Guo Chen, Mingfang Zhang, Lijin Yang, Zheng Nie, Jinyao Liu, Guoshun Fan, Dechen Lin, Fang Fang, Kunpeng Li, Chang Yuan, Xinyuan Chen, Yaohui Wang, Yali Wang, Yu Qiao, Limin Wang:
An Egocentric Vision-Language Model based Portable Real-time Smart Assistant. CoRR abs/2503.04250 (2025) - [i26]Aaron Blakeman, Aarti Basant, Abhinav Khattar, Adithya Renduchintala, Akhiad Bercovich, Aleksander Ficek, Alexis Bjorlin, Ali Taghibakhshi, Amala Sanjay Deshmukh, Ameya Sunil Mahabaleshwarkar, Andrew Tao, Anna Shors, Ashwath Aithal, Ashwin Poojary, Ayush Dattagupta, Balaram Buddharaju, Bobby Chen, Boris Ginsburg, Boxin Wang, Brandon Norick, Brian Butterfield, Bryan Catanzaro, Carlo del Mundo, Chengyu Dong, Christine Harvey, Christopher Parisien, Dan Su, Daniel Korzekwa, Danny Yin, Daria Gitman, David Mosallanezhad, Deepak Narayanan, Denys Fridman, Dima Rekesh, Ding Ma, Dmytro Pykhtar, Dong Ahn, Duncan Riach, Dusan Stosic, Eileen Long, Elad Segal, Ellie Evans, Eric Chung, Erick Galinkin, Evelina Bakhturina, Ewa Dobrowolska, Fei Jia, Fuxiao Liu, Gargi Prasad, Gerald Shen, Guilin Liu, Guo Chen, Haifeng Qian, Helen Ngo, Hongbin Liu, Hui Li, Igor Gitman, Ilia Karmanov, Ivan Moshkov, Izik Golan, Jan Kautz, Jane Polak Scowcroft, Jared Casper, Jarno Seppänen, Jason Lu, Jason Sewall, Jiaqi Zeng, Jiaxuan You, Jimmy Zhang, Jing Zhang, Jining Huang, Jinze Xue, Jocelyn Huang, Joey Conway, John Kamalu, Jon Barker, Jonathan M. Cohen, Joseph Jennings, Jupinder Parmar, Karan Sapra, Kari Briski, Kateryna Chumachenko, Katherine Luna, Keshav Santhanam, Kezhi Kong, Kirthi Sivamani, Krzysztof Pawelec, Kumar Anik, Kunlun Li, Lawrence McAfee, Leon Derczynski, Lindsey Pavao, Luis Vega, Lukas Voegtle, Maciej Bala, Maer Rodrigues de Melo, Makesh Narsimhan Sreedhar, Marcin Chochowski, Markus Kliegl:
Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models. CoRR abs/2504.03624 (2025) - [i25]Jilan Xu, Yifei Huang, Baoqi Pei, Junlin Hou, Qingqiu Li, Guo Chen, Yuejie Zhang, Rui Feng, Weidi Xie:
EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos. CoRR abs/2504.11732 (2025) - [i24]Guo Chen, Zhiqi Li, Shihao Wang, Jindong Jiang, Yicheng Liu, Lidong Lu, De-An Huang, Wonmin Byeon, Matthieu Le, Tuomas Rintamaki, Tyler Poon, Max Ehrlich, Tong Lu, Limin Wang, Bryan Catanzaro, Jan Kautz, Andrew Tao, Zhiding Yu, Guilin Liu:
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models. CoRR abs/2504.15271 (2025) - [i23]Lidong Lu, Guo Chen, Zhiqi Li, Yicheng Liu, Tong Lu:
AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs. CoRR abs/2506.05328 (2025) - [i22]Yuping He, Yifei Huang, Guo Chen, Lidong Lu, Baoqi Pei, Jilan Xu, Tong Lu, Yoichi Sato:
Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision. CoRR abs/2506.06253 (2025) - [i21]Shihao Wang, Guo Chen, De-An Huang, Zhiqi Li, Minghan Li, Guilin Li, José M. Álvarez, Lei Zhang, Zhiding Yu:
VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding. CoRR abs/2507.13353 (2025) - [i20]Yuping He, Yifei Huang, Guo Chen, Baoqi Pei, Jilan Xu, Tong Lu, Jiangmiao Pang:
EgoExoBench: A Benchmark for First- and Third-person View Video Understanding in MLLMs. CoRR abs/2507.18342 (2025) - 2024
- [j2]Yifei Huang
, Lijin Yang, Guo Chen, Hongjie Zhang, Feng Lu, Yoichi Sato:
Matching Compound Prototypes for Few-Shot Action Recognition. Int. J. Comput. Vis. 132(9): 3977-4002 (2024) - [c10]Jilan Xu, Yifei Huang, Junlin Hou, Guo Chen, Yuejie Zhang, Rui Feng, Weidi Xie:
Retrieval-Augmented Egocentric Video Captioning. CVPR 2024: 13525-13536 - [c9]Yifei Huang, Guo Chen, Jilan Xu, Mingfang Zhang, Lijin Yang, Baoqi Pei, Hongjie Zhang, Lu Dong, Yali Wang, Limin Wang, Yu Qiao:
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World. CVPR 2024: 22072-22086 - [c8]Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Lou, Limin Wang, Yu Qiao:
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark. CVPR 2024: 22195-22206 - [c7]Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai:
Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks. CVPR 2024: 24185-24198 - [c6]Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu, Yinan He, Guo Chen, Baoqi Pei, Rongkun Zheng, Zun Wang, Yansong Shi, Tianxiang Jiang, Songze Li, Jilan Xu, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang:
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding. ECCV (85) 2024: 396-416 - [c5]Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinhao Li, Guo Chen, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, Yali Wang, Limin Wang, Yu Qiao:
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation. ICLR 2024 - [i19]Jilan Xu, Yifei Huang, Junlin Hou, Guo Chen, Yuejie Zhang, Rui Feng, Weidi Xie:
Retrieval-Augmented Egocentric Video Captioning. CoRR abs/2401.00789 (2024) - [i18]Guo Chen, Yifei Huang, Jilan Xu, Baoqi Pei, Zhe Chen, Zhiqi Li, Jiahao Wang, Kunchang Li, Tong Lu, Limin Wang:
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding. CoRR abs/2403.09626 (2024) - [i17]Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu
, Yinan He, Guo Chen, Baoqi Pei, Rongkun Zheng, Jilan Xu, Zun Wang, Yansong Shi, Tianxiang Jiang, Songze Li, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang:
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding. CoRR abs/2403.15377 (2024) - [i16]Yifei Huang, Guo Chen, Jilan Xu, Mingfang Zhang, Lijin Yang, Baoqi Pei, Hongjie Zhang, Lu Dong, Yali Wang, Limin Wang, Yu Qiao:
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World. CoRR abs/2403.16182 (2024) - [i15]Baoqi Pei, Guo Chen, Jilan Xu, Yuping He, Yicheng Liu, Kanghua Pan, Yifei Huang, Yali Wang, Tong Lu, Limin Wang, Yu Qiao:
EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation. CoRR abs/2406.18070 (2024) - [i14]Guo Chen, Yicheng Liu, Yifei Huang, Yuping He, Baoqi Pei, Jilan Xu, Yali Wang, Tong Lu, Limin Wang:
CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding. CoRR abs/2412.12075 (2024) - [i13]Yifei Huang, Jilan Xu, Baoqi Pei, Yuping He, Guo Chen, Lijin Yang, Xinyuan Chen, Yaohui Wang, Zheng Nie, Jinyao Liu, Guoshun Fan, Dechen Lin, Fang Fang, Kunpeng Li, Chang Yuan, Yali Wang, Yu Qiao, Limin Wang:
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model. CoRR abs/2412.21080 (2024) - 2023
- [j1]Min Yang
, Guo Chen, Yin-Dong Zheng
, Tong Lu, Limin Wang
:
BasicTAD: An astounding RGB-Only baseline for temporal action detection. Comput. Vis. Image Underst. 232: 103692 (2023) - [c4]Jiahao Wang, Guo Chen, Yifei Huang, Limin Wang, Tong Lu:
Memory-and-Anticipation Transformer for Online Action Understanding. ICCV 2023: 13778-13789 - [c3]Guo Chen, Yin-Dong Zheng, Zhe Chen, Jiahao Wang, Tong Lu:
ELAN: Enhancing Temporal Action Detection with Location Awareness. ICME 2023: 1020-1025 - [c2]Yin-Dong Zheng, Guo Chen, Minglei Yuan, Tong Lu:
MRSN: Multi-Relation Support Network for Video Action Detection. ICME 2023: 1026-1031 - [i12]Shengyi Gao, Zhe Chen, Guo Chen, Wenhai Wang, Tong Lu:
Champion Solution for the WSDM2023 Toloka VQA Challenge. CoRR abs/2301.09045 (2023) - [i11]Yin-Dong Zheng, Guo Chen, Minglei Yuan, Tong Lu:
MRSN: Multi-Relation Support Network for Video Action Detection. CoRR abs/2304.11975 (2023) - [i10]Guo Chen, Yin-Dong Zheng, Jiahao Wang, Jilan Xu, Yifei Huang, Junting Pan, Yi Wang, Yali Wang, Yu Qiao, Tong Lu, Limin Wang:
VideoLLM: Modeling Video Sequence with Large Language Models. CoRR abs/2305.13292 (2023) - [i9]Shengyi Gao, Zhe Chen, Guo Chen, Wenhai Wang, Tong Lu:
AVSegFormer: Audio-Visual Segmentation with Transformer. CoRR abs/2307.01146 (2023) - [i8]Jiahao Wang, Guo Chen, Yifei Huang, Limin Wang, Tong Lu:
Memory-and-Anticipation Transformer for Online Action Understanding. CoRR abs/2308.07893 (2023) - [i7]Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Luo, Limin Wang, Yu Qiao
:
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark. CoRR abs/2311.17005 (2023) - [i6]Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai:
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks. CoRR abs/2312.14238 (2023) - 2022
- [c1]Guo Chen, Yin-Dong Zheng, Limin Wang, Tong Lu:
DCAN: Improving Temporal Action Detection via Dual Context Aggregation. AAAI 2022: 248-257 - [i5]Min Yang
, Guo Chen, Yin-Dong Zheng, Tong Lu, Limin Wang:
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection. CoRR abs/2205.02717 (2022) - [i4]Yin-Dong Zheng, Guo Chen, Jiahao Wang, Tong Lu, Limin Wang:
Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022. CoRR abs/2211.08728 (2022) - [i3]Guo Chen, Sen Xing, Zhe Chen, Yi Wang, Kunchang Li, Yizhuo Li, Yi Liu, Jiahao Wang, Yin-Dong Zheng, Bingkun Huang, Zhiyu Zhao, Junting Pan, Yifei Huang, Zun Wang, Jiashuo Yu
, Yinan He, Hongjie Zhang, Tong Lu, Yali Wang, Limin Wang, Yu Qiao:
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges. CoRR abs/2211.09529 (2022) - [i2]Yi Wang, Kunchang Li, Yizhuo Li, Yinan He, Bingkun Huang, Zhiyu Zhao, Hongjie Zhang, Jilan Xu, Yi Liu, Zun Wang, Sen Xing, Guo Chen, Junting Pan, Jiashuo Yu
, Yali Wang, Limin Wang, Yu Qiao:
InternVideo: General Video Foundation Models via Generative and Discriminative Learning. CoRR abs/2212.03191 (2022) - 2021
- [i1]Guo Chen, Yin-Dong Zheng, Limin Wang, Tong Lu:
DCAN: Improving Temporal Action Detection via Dual Context Aggregation. CoRR abs/2112.03612 (2021)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-08-22 23:26 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint