Skip to content
View hkr04's full-sized avatar

Block or report hkr04

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Learning material for CMU10-714: Deep Learning System

Jupyter Notebook 211 35 Updated May 12, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 9,254 647 Updated Sep 1, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,652 435 Updated Jun 22, 2024

A Survey on Large Language Model-Based Game Agents

246 10 Updated Sep 22, 2024

QuickJS是一个小型并且可嵌入的Javascript引擎,它支持ES2020规范,包括模块,异步生成器和代理器。

C 3,124 298 Updated Feb 7, 2024

QuickJS C++ wrapper

C 18 7 Updated Jul 14, 2019

LLM inference in C/C++

C++ 65,715 9,434 Updated Oct 2, 2024

Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents"

Python 251 18 Updated Aug 3, 2023
SAS 361 32 Updated Sep 27, 2023

An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.

Python 48 5 Updated Feb 17, 2021

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 6,996 692 Updated Sep 29, 2024

A simple, easy-to-hack GraphRAG implementation

Python 849 89 Updated Oct 1, 2024
Python 7 Updated Sep 22, 2024
Python 42 7 Updated Jun 18, 2024

codes for the paper "POMO: Policy Optimization with Multiple Optima for Reinforcement Learning"

Python 145 40 Updated Oct 2, 2022

A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)

Python 400 72 Updated Sep 18, 2024

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 2,671 233 Updated Sep 30, 2024

[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623

Python 53 4 Updated Sep 26, 2024

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …

C# 16,981 4,145 Updated Sep 30, 2024

[ICML'24 FM-Wild Oral] RouteFinder: Towards Foundation Models for Vehicle Routing Problems

Python 41 3 Updated Oct 2, 2024

More than 50+ collections of Thai Natural Language Processing libraries. Update daily.

378 73 Updated Apr 9, 2023

GraphLLM: Boosting Graph Reasoning Ability of Large Language Model

Python 86 8 Updated Dec 14, 2023

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Python 1,911 172 Updated May 15, 2024

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,002 366 Updated Sep 26, 2024

Lexicon for Chinese lexical analyzing, 中文语言分词词库

Python 115 21 Updated Nov 18, 2021

[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.

Python 150 15 Updated May 23, 2024

This project is implementation code of AlphaStar

Python 186 26 Updated Jan 19, 2024

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,073 110 Updated Sep 30, 2024

GTP engine and self-play learning in Go

C++ 3,495 564 Updated Aug 29, 2024
Next