MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT

Thawakar, Omkar; Vayani, Ashmal; Khan, Salman; Cholakal, Hisham; Anwer, Rao M.; Felsberg, Michael; Baldwin, Tim; Xing, Eric P.; Khan, Fahad Shahbaz

Computer Science > Computation and Language

arXiv:2402.16840 (cs)

[Submitted on 26 Feb 2024]

Title:MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT

Authors:Omkar Thawakar, Ashmal Vayani, Salman Khan, Hisham Cholakal, Rao M. Anwer, Michael Felsberg, Tim Baldwin, Eric P. Xing, Fahad Shahbaz Khan

View PDF HTML (experimental)

Abstract:"Bigger the better" has been the predominant trend in recent Large Language Models (LLMs) development. However, LLMs do not suit well for scenarios that require on-device processing, energy efficiency, low memory footprint, and response efficiency. These requisites are crucial for privacy, security, and sustainable deployment. This paper explores the "less is more" paradigm by addressing the challenge of designing accurate yet efficient Small Language Models (SLMs) for resource constrained devices. Our primary contribution is the introduction of an accurate and fully transparent open-source 0.5 billion (0.5B) parameter SLM, named MobiLlama, catering to the specific needs of resource-constrained computing with an emphasis on enhanced performance with reduced resource demands. MobiLlama is a SLM design that initiates from a larger model and applies a careful parameter sharing scheme to reduce both the pre-training and the deployment cost. Our work strives to not only bridge the gap in open-source SLMs but also ensures full transparency, where complete training data pipeline, training code, model weights, and over 300 checkpoints along with evaluation codes is available at : this https URL.

Comments:	Code available at : this https URL
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2402.16840 [cs.CL]
	(or arXiv:2402.16840v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2402.16840

Submission history

From: Omkar Thawakar [view email]
[v1] Mon, 26 Feb 2024 18:59:03 UTC (2,210 KB)

Computer Science > Computation and Language

Title:MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators