Ph.D Student, Computer Science
Nanyang Technological University, Singapore


About Me

I am a third-year PhD student and luckily advised by Prof. Ziwei Liu. My research focuses on multimodal models and building true intelligence.

I am lucky to work with many brilliant researchers in a non-profit research-oriented organization, LMMs-Lab, we share the sincere passion for developing multimodal intelligence.

Email: drluodian[at]gmail[dot]com


Selected Publications
  1. [17]
    Bo Li, Yuanhan Zhang, Dong Guo, Renrui Zhang, Feng Li, Hao Zhang, Kaichen Zhang, Yanwei Li, Ziwei Liu, Chunyuan Li LLaVA-OneVision: Easy Visual Task Transfer arXiv preprint arXiv:2408.03326.
  2. [16]
    Peiyuan Zhang*, Kaichen Zhang*, Bo Li*, Guangtao Zeng, Jingkang Yang, Yuanhan Zhang, Ziyue Wang, Haoran Tan, Chunyuan Li, Ziwei Liu Long Context Transfer from Language to Vision arXiv preprint arXiv:2406.16852.
  3. [12]
    Haotian Liu, Chunyuan Li, Yuheng Li, Bo Li, Yuanhan Zhang, Sheng Shen, Yong Jae Lee LLaVA-NeXT: Improved reasoning, OCR, and world knowledge Technical Blog. [code]
  4. [11]
    Bo Li*, Yuanhan Zhang*, Liangyu Chen, Jinghao Wang, Fanyi Pu, Jingkang Yang, Chunyuan Li, Ziwei Liu MIMIC-IT: Multi-modal In-Context Instruction Tuning ArXiv Preprint. [code]
  5. [10]
    Bo Li*, Yuanhan Zhang*, Liangyu Chen, Jinghao Wang, Jingkang Yang, Ziwei Liu Otter: A multi-modal model with in-context instruction tuning ArXiv Preprint. [code]
  6. [9]
    Liangyu Chen*, Bo Li*, Sheng Shen, Jingkang Yang, Chunyuan Li, Kurt Keutzer, Trevor Darrell, Ziwei Liu Coordinating Multiple Vision-Language Models for Visual Reasoning NeurIPS 2023, In Conference on Neural Information Processing Systems. Short version in ICLR 2023 Workshop on Mathematical and Empirical Understanding of Foundation Models (ME-FoMo).
  7. [8]
    Bo Li*, Yifei Shen*, Jingkang Yang, Yezhen Wang, Jiawei Ren, Tong Che, Jun Zhang, Ziwei Liu Sparse Mixture-of-Experts are Domain Generalizable Learners ICLR 2023 (Oral), In International Conference on Representation Learning 2023. [code] Short version in NeurIPS 2022 Workshop on Distribution Shift.
  8. [7]
    Yezhen Wang, Tong Che, Bo Li, Kaitao Song, Hengzhi Pei, Yoshua Bengio, Dongsheng Li Your Autoregressive Generative Model Can be Better If You Treat It as an Energy-Based One Under Review.
  9. [6]
    Bo Li, Yifei Shen, Yezhen Wang, Wenzhen Zhu, Dongsheng Li, Kurt Keutzer, Han Zhao Invariant information bottleneck for domain generalization AAAI 2022, In Proceedings of the AAAI Conference on Artificial Intelligence. [code]
  10. [5]
    Yezhen Wang, Bo Li, Tong Che, Kaiyang Zhou, Ziwei Liu, Dongsheng Li Energy-Based Open-World Uncertainty Modeling for Confidence Calibration ICCV 2021, In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). [code]
  11. [4]
    Bo Li, Yezhen Wang, Shanghang Zhang, Dongsheng Li, Kurt Keutzer, Trevor Darrell, Han Zhao Learning invariant representations and risks for semi-supervised domain adaptation CVPR 2021, In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. [code]
  12. [3]
    Sicheng Zhao, Bo Li, Pengfei Xu, Xiangyu Yue, Guiguang Ding, Kurt Keutzer MADAN: multi-source adversarial domain aggregation network for domain adaptation IJCV 2021, International Journal of Computer Vision.
  13. [2]
    Bo Li, Yezhen Wang, Tong Che, Shanghang Zhang, Yoshua Bengio, Kurt Keutzer Rethinking distributional matching based domain adaptation arXiv preprint arXiv:2006.13352.
  14. [1]
    Sicheng Zhao*, Bo Li*, Xiangyu Yue*, Yang Gu, Pengfei Xu, Runbo Hu, Hua Chai, Kurt Keutzer Multi-source domain adaptation for semantic segmentation NeurIPS 2019, In Neural Information Processing Systems. [code]
Experiences

I have been fortunately collaborating and doing research at/with


Professional Services
  • Talk/Technical Sharing:
    • LMMs-Lab Projects@TwelveLabs (2024), Hosted by James Le
    • LMMs-Lab Projects@Tiktok (2024)
    • Otter & MIMICIT@Alibaba, Damo Academy, Hosted by Dr. Lidong Bing, Sep. 2023.
    • Otter & MIMICIT@HITSZ, Hosted by Prof. Rui Shao, Jul. 2023.

  • Slab@NTU: Cluster Adminstrator (70+ users, 400+ GPUs)


  • Conference Reviewer / Program Committee:

    • ICCV (2021,2023), NeurIPS (2022), BMVC (2023), AAAI (2023), CVPR (2022,2023), AISTATS (2023), ICML (2023).

    • Workshop: ICLR 2023 (DG)


  • Journal Reviewer:

    • Pattern Recognition (PR)
    • Transactions on Multimedia (TMM)
    • Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
    • International Journal of Computer Vision (IJCV)

Acknowledgements: this website builds on al-folio and Jiaming Song.