Nemotron 3 Nano is a 30B parameter hybrid reasoning MoE model with ~3.6B active parameters - built for fast, accurate coding, math and agentic tasks, and has a 1M context window.
(docs.unsloth.ai)
in technology@lemmy.ml from yogthos@lemmy.ml on 15 Dec 20:35
comments (0)
in technology@lemmy.ml from yogthos@lemmy.ml on 15 Dec 20:35
comments (0)
3x Faster LLM Training with Unsloth Kernels + Packing
(docs.unsloth.ai)
in technology@lemmy.ml from yogthos@lemmy.ml on 10 Dec 20:00
comments (0)
in technology@lemmy.ml from yogthos@lemmy.ml on 10 Dec 20:00
comments (0)