Skip to content

Releases: foldl/chatllm.cpp

v0.19

22 Jan 09:45

Choose a tag to compare

  • As always, more models are supported. Note that most of the new models are special in some way.

    • Step3-VL: strong vision capability
    • GLM-4.7-Flash: strong coding capability
    • TranslateGemma: translation
    • WeDLM: diffusion with AR
    • QWen3-VL-Embedding/Reranker: multimodal embedding
    • HY-MT: translation
    • GLM-ASR-Nano: ASR
    • Qwen3-VL: strong vision capability

v0.18

27 Dec 02:02

Choose a tag to compare

  • As always, more models are supported.
  • Windows: prebuilt binary with Vulkan (1.4.335.0). Use -ngl all to run whole model on default GPU.
  • New server.exe with built-in llama.cpp WebUI
image

v0.17

27 Oct 02:04

Choose a tag to compare

  • As always, more models are supported, notably LLaDA2.0.
  • Windows: prebuilt binary with Vulkan (1.4.321.1). Use -ngl all to run whole model on default GPU.

v0.16

13 Oct 11:15

Choose a tag to compare

  • As always, more models are supported, notably Janus-Pro.
  • Windows: prebuilt binary with Vulkan (1.4.321.1). Use -ngl all to run whole model on default GPU.

v0.15

07 Sep 10:47

Choose a tag to compare

  • As always, more models are supported.
  • Windows: prebuilt binary with Vulkan (1.4.321.1). Use -ngl all to run whole model on default GPU.

v0.14

18 Aug 08:32

Choose a tag to compare

  • Fix main_nim.exe: could not download models that are > 2GB due to this.

v0.13

15 Aug 13:50

Choose a tag to compare

  • As always, more models are supported.
  • Windows: prebuilt binary with Vulkan (1.4.321.1). Use -ngl all to run whole model on default GPU.

Update 2025-08-16: chatllm_win_x64.7z updated due to outdated main.exe.

v0.12

12 Jun 10:59

Choose a tag to compare

  • As always, more models are supported.
  • Multimodal: vision & TTS.
  • Windows: prebuilt binary with Vulkan (1.4.304.1). Use -ngl all to run whole model on default GPU.

v0.11

11 May 23:57

Choose a tag to compare

  • As always, more models are supported;
  • Windows: prebuilt binary with Vulkan (1.4.304.1). Use -ngl all to run whole model on default GPU.

v0.10

20 Apr 23:29

Choose a tag to compare

  • As always, more models are supported;
  • Windows: prebuilt binary with Vulkan (1.4.304.1). Use -ngl all to run whole model on default GPU.