Nunchaku is a high-performance inference engine optimized for 4-bit neural networks (github.com)
from yogthos@lemmy.ml to technology@lemmy.ml on 27 Dec 20:40
https://lemmy.ml/post/40890196

#technology

threaded - newest