This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 6,996 692 Updated Sep 29, 2024

gusye1234 / nano-graphrag

A simple, easy-to-hack GraphRAG implementation

Python 849 89 Updated Oct 1, 2024

gingasan / delta-engine

Python 7 Updated Sep 22, 2024

BrendanGraham14 / mcts-llm

Python 42 7 Updated Jun 18, 2024

yd-kwon / POMO

codes for the paper "POMO: Policy Optimization with Multiple Optima for Reinforcement Learning"

Python 145 40 Updated Oct 2, 2022

ai4co / rl4co

A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)

Python 400 72 Updated Sep 18, 2024

poloclub / transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 2,671 233 Updated Sep 30, 2024

sail-sg / scaling-with-vocab

[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623

Python 53 4 Updated Sep 26, 2024

Unity-Technologies / ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …

C# 16,981 4,145 Updated Sep 30, 2024

ai4co / routefinder

[ICML'24 FM-Wild Oral] RouteFinder: Towards Foundation Models for Vehicle Routing Problems

Python 41 3 Updated Oct 2, 2024

kobkrit / nlp_thai_resources

More than 50+ collections of Thai Natural Language Processing libraries. Update daily.

378 73 Updated Apr 9, 2023

mistyreed63849 / Graph-LLM

GraphLLM: Boosting Graph Reasoning Ability of Large Language Model

Python 86 8 Updated Dec 14, 2023

opendilab / PPOxFamily

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

Python 1,911 172 Updated May 15, 2024

opendilab / DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,002 366 Updated Sep 26, 2024

limccn / cacl2

Lexicon for Chinese lexical analyzing, 中文语言分词词库

Python 115 21 Updated Nov 18, 2021

1989Ryan / llm-mcts

[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.

Python 150 15 Updated May 23, 2024

kimbring2 / AlphaStar_Implementation

This project is implementation code of AlphaStar

Python 186 26 Updated Jan 19, 2024

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,073 110 Updated Sep 30, 2024

lightvector / KataGo

GTP engine and self-play learning in Go

C++ 3,495 564 Updated Aug 29, 2024

google-deepmind / alphastar

Python 397 50 Updated Sep 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hkr04

Block or report hkr04

Starred repositories

PKUFlyingPig / CMU10-714

srush / GPU-Puzzles

princeton-nlp / tree-of-thought-llm

git-disl / awesome-LLM-game-agent-papers

quickjs-zh / QuickJS

quickjs-zh / quickjspp

ggerganov / llama.cpp

CraftJarvis / MC-Planner

Cranial-XIX / llm-pddl

VProv / BPE-Dropout

NirDiamant / RAG_Techniques