Skip to content
View KelleyYin's full-sized avatar
😁
Focusing
😁
Focusing
  • AntGroup
  • Shanghai & Suzhou
  • 21:02 (UTC +08:00)

Block or report KelleyYin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 10 1 Updated Oct 15, 2020

DLRover: An Automatic Distributed Deep Learning System

Python 1,218 153 Updated Sep 29, 2024

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 238 6 Updated Jul 15, 2024

[ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!

Python 26 Updated Aug 2, 2024

Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"

Python 107 4 Updated Jun 5, 2024

Retrieval and Retrieval-augmented LLMs

Python 6,988 510 Updated Sep 26, 2024

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 975 101 Updated Jul 29, 2024

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Python 1,640 365 Updated Sep 28, 2024

🚌 The IK Analysis plugin integrates Lucene IK analyzer into Elasticsearch and OpenSearch, support customized dictionary.

Java 16,509 3,266 Updated Sep 29, 2024

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Python 347 14 Updated Jul 9, 2024

How to use wandb?

Python 588 49 Updated Sep 5, 2023

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,316 115 Updated Apr 17, 2024

DISC-LawLLM, an intelligent legal system utilizing large language models (LLMs) to provide a wide range of legal services

Python 517 59 Updated Apr 24, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,928 810 Updated Aug 15, 2024

中文Mixtral-8x7B(Chinese-Mixtral-8x7B)

Python 638 32 Updated Aug 17, 2024

Official inference library for Mistral models

Jupyter Notebook 9,568 846 Updated Sep 20, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,225 3,823 Updated Sep 29, 2024
Jupyter Notebook 6 1 Updated Apr 8, 2024

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Python 2,189 433 Updated Sep 25, 2024

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 4,022 367 Updated Sep 27, 2024

LlamaIndex is a data framework for your LLM applications

Python 35,769 5,055 Updated Sep 29, 2024

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

Python 764 69 Updated Sep 5, 2024
Rust 295 22 Updated Jul 23, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,753 938 Updated Sep 26, 2024

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 673 94 Updated Sep 29, 2024

A tool for extracting plain text from Wikipedia dumps

Python 3,738 965 Updated May 23, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,364 466 Updated Sep 28, 2024

library supporting NLP and CV research on scientific papers

Python 679 53 Updated Sep 27, 2024

State-of-the-art 2D and 3D Face Analysis Project

Python 22,962 5,362 Updated Sep 18, 2024
Next