MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU (arxiv.org)
from yogthos@lemmy.ml to technology@lemmy.ml on 08 Apr 23:45
https://lemmy.ml/post/45660953

#technology

threaded - newest