Haotian Liu

Cited by

	All	Since 2019
Citations	8597	8594
h-index	16	16
i10-index	18	18

8000

4000

2000

6000

20222023202444 968 7162

Public access

View all

8 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yong Jae LeeAssociate Professor of Computer Sciences, UW-MadisonVerified email at wisc.edu
Chunyuan LixAIVerified email at x.ai
Jianwei YangPrincipal Researcher, Microsoft Research, RedmondVerified email at microsoft.com
Yuheng LiUniversity of Wisconsin-MadisonVerified email at wisc.edu
Jianfeng GaoMicrosoft Research, RedmondVerified email at microsoft.com
Mu CaiUniversity of Wisconsin-MadisonVerified email at cs.wisc.edu
Sheng ShenUC BerkeleyVerified email at berkeley.edu
Fanyi XiaoMeta AIVerified email at ucdavis.edu
Xueyan ZouPostDoc at UC San DiegoVerified email at wisc.edu

Haotian Liu

xAI

Verified email at x.ai - Homepage

Computer Vision Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Visual instruction tuning H Liu, C Li, Q Wu, YJ Lee Advances in neural information processing systems 36, 2024	4513	2024
Improved Baselines with Visual Instruction Tuning H Liu, C Li, Y Li, YJ Lee CVPR 2024, 2023	1523	2023
GLIGEN: Open-Set Grounded Text-to-Image Generation Y Li, H Liu, Q Wu, F Mu, J Yang, J Gao, C Li, YJ Lee CVPR 2023, 2023	600*	2023
Llava-med: Training a large language-and-vision assistant for biomedicine in one day C Li, C Wong, S Zhang, N Usuyama, H Liu, J Yang, T Naumann, H Poon, ... NeurIPS 2023, Datasets and Benchmarks Track, 2023	511	2023
Llava-next: Improved reasoning, ocr, and world knowledge H Liu, C Li, Y Li, B Li, Y Zhang, S Shen, YJ Lee https://llava-vl.github.io/blog/2024-01-30-llava-next, 2024	469*	2024
Aligning Large Multimodal Models with Factually Augmented RLHF Z Sun, S Shen, S Cao, H Liu, C Li, Y Shen, C Gan, LY Gui, YX Wang, ... arXiv preprint arXiv:2309.14525, 2023	203	2023
Masked discrimination for self-supervised learning on point clouds H Liu, M Cai, YJ Lee Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022	156	2022
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models C Li, H Liu, LH Li, P Zhang, J Aneja, J Yang, P Jin, H Hu, Z Liu, YJ Lee, ... NeurIPS 2022, Datasets and Benchmarks Track, 2022	139	2022
YolactEdge: Real-time Instance Segmentation on the Edge H Liu, RAR Soto, F Xiao, YJ Lee 2021 IEEE International Conference on Robotics and Automation (ICRA), 9579-9585, 2021	91	2021
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents S Liu, H Cheng, H Liu, H Zhang, F Li, T Ren, X Zou, J Yang, H Su, J Zhu, ... arXiv preprint arXiv:2311.05437, 2023	84	2023
LLaVA-NeXT: A Strong Zero-shot Video Understanding Model Y Zhang, B Li, H Liu, YJ Lee, L Gui, D Fu, J Feng, Z Liu, C Li https://llava-vl.github.io/blog/2024-04-30-llava-next-video/, 2024	68*	2024
Making large multimodal models understand arbitrary visual prompts M Cai, H Liu, SK Mustikovela, GP Meyer, Y Chai, D Park, YJ Lee CVPR 2024, 2023	63*	2023
Learning Customized Visual Models with Retrieval-Augmented Knowledge H Liu, K Son, J Yang, C Liu, J Gao, YJ Lee, C Li CVPR 2023, 2023	51*	2023
An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models Y Lu, C Li, H Liu, J Yang, J Gao, Y Shen arXiv preprint arXiv:2309.09958, 2023	30	2023
Identity from here, Pose from there: Self-supervised Disentanglement and Generation of Objects using Unlabeled Videos F Xiao, H Liu, YJ Lee Proceedings of the IEEE International Conference on Computer Vision, 7013-7022, 2019	22	2019
CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs Z Wang, M Xia, L He, H Chen, Y Liu, R Zhu, K Liang, X Wu, H Liu, ... arXiv preprint arXiv:2406.18521, 2024	17	2024
Operation strategy of public building: Implications from trade-off between carbon emission and occupant satisfaction Y Chen, H Liu, L Shi Journal of Cleaner Production 205, 629-644, 2018	13	2018
Generate Anything Anywhere in Any Scene Y Li, H Liu, Y Wen, YJ Lee arXiv preprint arXiv:2306.17154, 2023	12	2023
Fantastic Copyrighted Beasts and How (Not) to Generate Them L He, Y Huang, W Shi, T Xie, H Liu, Y Wang, L Zettlemoyer, C Zhang, ... arXiv preprint arXiv:2406.14526, 2024	8	2024
Computer Vision on the Edge: Individual Cattle Identification in Real-Time With ReadMyCow System M Smink, H Liu, D Döpfer, YJ Lee Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024	8	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors