Tencent just released WeDLM 8B, it's a diffusion language model that runs 3-6× faster than vLLM-optimized Qwen3-8B on math reasoning tasks.
(huggingface.co)
from yogthos@lemmy.ml to technology@lemmy.ml on 29 Dec 17:59
https://lemmy.ml/post/40970102
from yogthos@lemmy.ml to technology@lemmy.ml on 29 Dec 17:59
https://lemmy.ml/post/40970102
instruct model huggingface.co/tencent/WeDLM-8B-Instruct
threaded - newest