News

Notifications You must be signed in to change notification settings Fork 2 ...
【大模型】3小时完全从0训练一个仅有26M的小参数GPT,最低仅需2G显卡即可推理训练!. Contribute to wjl520/minimind_llm_demo development by creating an account on GitHub.