Starred repositories
Learning material for CMU10-714: Deep Learning System
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
A Survey on Large Language Model-Based Game Agents
QuickJS是一个小型并且可嵌入的Javascript引擎,它支持ES2020规范,包括模块,异步生成器和代理器。
Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents"
An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
A simple, easy-to-hack GraphRAG implementation
codes for the paper "POMO: Policy Optimization with Multiple Optima for Reinforcement Learning"
A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …
[ICML'24 FM-Wild Oral] RouteFinder: Towards Foundation Models for Vehicle Routing Problems
More than 50+ collections of Thai Natural Language Processing libraries. Update daily.
GraphLLM: Boosting Graph Reasoning Ability of Large Language Model
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.
This project is implementation code of AlphaStar
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)