- New Delhi, India
Block or Report
Block or report saivig
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (32)
Sort Name ascending (A-Z)
awesome
BigData
books
Databases
Datasets
interview
java
ML-Active Learning
ML-Audio
ML-Auto
ML-BIO
ML-Causal
ML-Data
ML-EVAL
ML-GAN
ML-GPT
ML-Health
ML-Infra
ML-Interpretability
ML-Misc
ML-MultiModal
ML-Optimiztion
ML-Privacy
Ml-RecSys
ML-Safety
ML-Text
ML-Tools
ML-TS
ML-Vision
Python
Tools
Web
Language
Sort by: Recently starred
Starred repositories
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application
Multilingual Large Language Models Evaluation Benchmark
The official evaluation suite and dynamic data release for MixEval.
Official repository of RankEval: An Evaluation and Analysis Framework for Learning-to-Rank Solutions.
Evaluation suite for large-scale language models.
Python SDK for running evaluations on LLM generated responses
Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.
Production-Grade Evaluation for LLM-Powered Applications
A curated list of open source repositories for AI Engineers
Evaluation and Tracking for LLM Experiments
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].
🗺️ Data Cleaning and Textual Data Visualization 🗺️
Convert Compute And Books Into Instruct-Tuning Datasets (or classifiers)!
Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).
Manage scalable open LLM inference endpoints in Slurm clusters
Interact, analyze and structure massive text, image, embedding, audio and video datasets
Open source project for data preparation of LLM application builders
High-quality datasets, tools, and concepts for LLM fine-tuning.
Quick exploration into fine tuning florence 2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.
Mass document analytics platform based on LlamaIndex, Pgvector, React and Django.
Repository contains LinkedIn posts about Generative AI knowledge sharing, learning resources and research explanations.
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure
A modular graph-based Retrieval-Augmented Generation (RAG) system