-
minimind_New Public
Forked from jingyaogong/minimind【大模型】3小时完全从0训练一个仅有26M的小参数GPT,最低仅需2G显卡即可推理训练!
Python Apache License 2.0 UpdatedSep 12, 2024 -
GRUtopia Public
Forked from OpenRobotLab/GRUtopiaGRUtopia: Dream General Robots in a City at Scale
Python MIT License UpdatedSep 5, 2024 -
Zero-Chatgpt Public
Forked from AI-Study-Han/Zero-Chatgpt从0开始,将chatgpt的技术路线跑一遍。
Python UpdatedSep 5, 2024 -
Book-Mathematical-Foundation-of-Reinforcement-Learning Public
Forked from MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-LearningThis is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
MATLAB UpdatedSep 1, 2024 -
ReKep Public
Forked from huangwl18/ReKepReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
Python UpdatedAug 30, 2024 -
build_MiniLLM_from_scratch Public
Forked from Tongjilibo/build_MiniLLM_from_scratch从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)
Python MIT License UpdatedAug 29, 2024 -
tiny-llm-zh_LLM Public
Forked from wdndev/tiny-llm-zh从零实现一个小参数量中文大语言模型。
Python UpdatedAug 22, 2024 -
CLIP Public
Forked from openai/CLIPCLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Jupyter Notebook MIT License UpdatedJul 23, 2024 -
mobile-aloha Public
Forked from MarkFzp/mobile-alohaMobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
Jupyter Notebook MIT License UpdatedJun 22, 2024 -
baby-llama2-chinese Public
Forked from DLLXW/baby-llama2-chinese用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
Python MIT License UpdatedMay 21, 2024 -
act-plus-plus Public
Forked from MarkFzp/act-plus-plusImitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Python MIT License UpdatedMay 15, 2024 -
ChatLM-mini-Chinese Public
Forked from charent/ChatLM-mini-Chinese中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。
Python Apache License 2.0 UpdatedApr 20, 2024 -
-
gym Public
Forked from openai/gymA toolkit for developing and comparing reinforcement learning algorithms.
Python Other UpdatedAug 5, 2022