Li Bo

Ph.D Student, Computer Science
Nanyang Technological University, Singapore

About Me

I am a third-year PhD student and luckily advised by Prof. Ziwei Liu. My research focuses on multimodal models and building true intelligence.

I am lucky to work with many brilliant researchers in a non-profit research-oriented organization, LMMs-Lab, we share the sincere passion for developing multimodal intelligence.

Email: drluodian[at]gmail[dot]com

Selected Publications

[17]

Bo Li, Yuanhan Zhang, Dong Guo, Renrui Zhang, Feng Li, Hao Zhang, Kaichen Zhang, Yanwei Li, Ziwei Liu, Chunyuan Li LLaVA-OneVision: Easy Visual Task Transfer arXiv preprint arXiv:2408.03326.
[16]

Peiyuan Zhang*, Kaichen Zhang*, Bo Li*, Guangtao Zeng, Jingkang Yang, Yuanhan Zhang, Ziyue Wang, Haoran Tan, Chunyuan Li, Ziwei Liu Long Context Transfer from Language to Vision arXiv preprint arXiv:2406.16852.
[12]

Haotian Liu, Chunyuan Li, Yuheng Li, Bo Li, Yuanhan Zhang, Sheng Shen, Yong Jae Lee LLaVA-NeXT: Improved reasoning, OCR, and world knowledge Technical Blog. [code]
[11]

Bo Li*, Yuanhan Zhang*, Liangyu Chen, Jinghao Wang, Fanyi Pu, Jingkang Yang, Chunyuan Li, Ziwei Liu MIMIC-IT: Multi-modal In-Context Instruction Tuning ArXiv Preprint. [code]
[10]

Bo Li*, Yuanhan Zhang*, Liangyu Chen, Jinghao Wang, Jingkang Yang, Ziwei Liu Otter: A multi-modal model with in-context instruction tuning ArXiv Preprint. [code]
[9]

Liangyu Chen*, Bo Li*, Sheng Shen, Jingkang Yang, Chunyuan Li, Kurt Keutzer, Trevor Darrell, Ziwei Liu Coordinating Multiple Vision-Language Models for Visual Reasoning NeurIPS 2023, In Conference on Neural Information Processing Systems. Short version in ICLR 2023 Workshop on Mathematical and Empirical Understanding of Foundation Models (ME-FoMo).
[8]

Bo Li*, Yifei Shen*, Jingkang Yang, Yezhen Wang, Jiawei Ren, Tong Che, Jun Zhang, Ziwei Liu Sparse Mixture-of-Experts are Domain Generalizable Learners ICLR 2023 (Oral), In International Conference on Representation Learning 2023. [code] Short version in NeurIPS 2022 Workshop on Distribution Shift.
[7]

Yezhen Wang, Tong Che, Bo Li, Kaitao Song, Hengzhi Pei, Yoshua Bengio, Dongsheng Li Your Autoregressive Generative Model Can be Better If You Treat It as an Energy-Based One Under Review.
[6]

Bo Li, Yifei Shen, Yezhen Wang, Wenzhen Zhu, Dongsheng Li, Kurt Keutzer, Han Zhao Invariant information bottleneck for domain generalization AAAI 2022, In Proceedings of the AAAI Conference on Artificial Intelligence. [code]
[5]

Yezhen Wang, Bo Li, Tong Che, Kaiyang Zhou, Ziwei Liu, Dongsheng Li Energy-Based Open-World Uncertainty Modeling for Confidence Calibration ICCV 2021, In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). [code]
[4]

Bo Li, Yezhen Wang, Shanghang Zhang, Dongsheng Li, Kurt Keutzer, Trevor Darrell, Han Zhao Learning invariant representations and risks for semi-supervised domain adaptation CVPR 2021, In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. [code]
[3]

Sicheng Zhao, Bo Li, Pengfei Xu, Xiangyu Yue, Guiguang Ding, Kurt Keutzer MADAN: multi-source adversarial domain aggregation network for domain adaptation IJCV 2021, International Journal of Computer Vision.
[2]

Bo Li, Yezhen Wang, Tong Che, Shanghang Zhang, Yoshua Bengio, Kurt Keutzer Rethinking distributional matching based domain adaptation arXiv preprint arXiv:2006.13352.
[1]

Sicheng Zhao*, Bo Li*, Xiangyu Yue*, Yang Gu, Pengfei Xu, Runbo Hu, Hua Chai, Kurt Keutzer Multi-source domain adaptation for semantic segmentation NeurIPS 2019, In Neural Information Processing Systems. [code]

Experiences

I have been fortunately collaborating and doing research at/with

Sep. 2020 - Dec. 2021: Microsoft Research, Shanghai

Supervised by Dr. Dongsheng Li in the beautiful and relaxing WestBud office, with chill and smart colleagues.
Oct. 2019 - Aug. 2020 (remote till May 2021): Berkeley AI Research, CA, USA

Supervised by Prof. Kurt Keutzer and Prof. Sicheng Zhao, Prof. Xiangyu Yue, Prof. Shanghang Zhang and Dr. Colorado Reed. Enjoy the weather and front-tier research atmosphere. Go Cal and Roll on your Golden Bears!
Jan 2020 - Nov 2022: Dr. Tong Che, MILA/Nvidia Research

Great appreciation on guiding me to explore many fascinating ML topics.
May 2020 - Dec. 2021: Prof. Han Zhao, UIUC

Learn to write a paper with machine learning taste.
May 2018 - Oct. 2019: DiDi Visual Perception Team, Beijing

First internship and two papers there.

Professional Services

Talk/Technical Sharing:
- LMMs-Lab Projects@TwelveLabs (2024), Hosted by James Le
- LMMs-Lab Projects@Tiktok (2024)
- Otter & MIMICIT@Alibaba, Damo Academy, Hosted by Dr. Lidong Bing, Sep. 2023.
- Otter & MIMICIT@HITSZ, Hosted by Prof. Rui Shao, Jul. 2023.

Slab@NTU: Cluster Adminstrator (70+ users, 400+ GPUs)

The AI Talk: Organizer

Conference Reviewer / Program Committee:
- ICCV (2021,2023), NeurIPS (2022), BMVC (2023), AAAI (2023), CVPR (2022,2023), AISTATS (2023), ICML (2023).
- Workshop: ICLR 2023 (DG)

Journal Reviewer:
- Pattern Recognition (PR)
- Transactions on Multimedia (TMM)
- Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
- International Journal of Computer Vision (IJCV)

Acknowledgements: this website builds on al-folio and Jiaming Song.