Tencent just released WeDLM 8B, it's a diffusion language model that runs 3-6× faster than vLLM-optimized Qwen3-8B on math reasoning tasks. (huggingface.co)
from yogthos@lemmy.ml to technology@lemmy.ml on 29 Dec 17:59
https://lemmy.ml/post/40970102

instruct model huggingface.co/tencent/WeDLM-8B-Instruct

#technology

threaded - newest