Releases · foldl/chatllm.cpp · GitHub

22 Jan 09:45

foldl

v0.19 Latest

Latest

As always, more models are supported. Note that most of the new models are special in some way.
- Step3-VL: strong vision capability
- GLM-4.7-Flash: strong coding capability
- TranslateGemma: translation
- WeDLM: diffusion with AR
- QWen3-VL-Embedding/Reranker: multimodal embedding
- HY-MT: translation
- GLM-ASR-Nano: ASR
- Qwen3-VL: strong vision capability

Assets 3

27 Dec 02:02

foldl

v0.18

As always, more models are supported.
Windows: prebuilt binary with Vulkan (1.4.335.0). Use -ngl all to run whole model on default GPU.
New server.exe with built-in llama.cpp WebUI

Assets 3

27 Oct 02:04

foldl

v0.17

As always, more models are supported, notably LLaDA2.0.
Windows: prebuilt binary with Vulkan (1.4.321.1). Use -ngl all to run whole model on default GPU.

Assets 3

13 Oct 11:15

foldl

v0.16

As always, more models are supported, notably Janus-Pro.
Windows: prebuilt binary with Vulkan (1.4.321.1). Use -ngl all to run whole model on default GPU.

Assets 3

07 Sep 10:47

foldl

v0.15

As always, more models are supported.
Windows: prebuilt binary with Vulkan (1.4.321.1). Use -ngl all to run whole model on default GPU.

Assets 3

18 Aug 08:32

foldl

v0.14

Fix main_nim.exe: could not download models that are > 2GB due to this.

Assets 3

15 Aug 13:50

foldl

v0.13

As always, more models are supported.
Windows: prebuilt binary with Vulkan (1.4.321.1). Use -ngl all to run whole model on default GPU.

Update 2025-08-16: chatllm_win_x64.7z updated due to outdated main.exe.

Assets 3

12 Jun 10:59

foldl

v0.12

As always, more models are supported.
Multimodal: vision & TTS.
Windows: prebuilt binary with Vulkan (1.4.304.1). Use -ngl all to run whole model on default GPU.

Assets 3

11 May 23:57

foldl

v0.11

As always, more models are supported;
Windows: prebuilt binary with Vulkan (1.4.304.1). Use -ngl all to run whole model on default GPU.

Assets 3

20 Apr 23:29

foldl

v0.10

As always, more models are supported;
Windows: prebuilt binary with Vulkan (1.4.304.1). Use -ngl all to run whole model on default GPU.

Assets 3