A curated list of resources dedicated to retrieval-augmented generation (RAG).
The retrieval-augmented generation (RAG) is to combine the merits of retrieval system and llm to generation high-quality answers for users.
Typically, the rag system consists of a set of modules, where each task are described as follows:
Sub-Tasks | Input | Output |
---|---|---|
query understanding | question | search queries |
document retrieval | question/queries | documents |
evidence extraction | questions+documents | contexts |
answer generation | question+contexts | answer |
result enhancement | question+answer+contexts | answer |
pip3 install -r requirements.txt
python3 healthcheck.py
- The Organization column only record the organization of the first author.
Date | Title | Organization | Code |
---|---|---|---|
2024/02/29 | Retrieval-Augmented Generation for AI-Generated Content: A Survey | Peking University | Code |
2024/01/03 | Retrieval-Augmented Generation for Large Language Models: A Survey | Tongji University | Code |
2024/01/03 | A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models | Islamic University of Technology | No |
2023/12/07 | Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications | Harbin Institute of Technology | No |
2023/09/19 | The Rise and Potential of Large Language Model Based Agents: A Survey | Fudan NLP Group | Code |
2023/08/14 | Large Language Models for Information Retrieval: A Survey | Renmin University | Code |
2022/02/02 | A Survey on Retrieval-Augmented Text Generation | Nara Institute of Science and Techonology | No |
- The Organization column only record the organization of the first author.
Date | Title | Organization | Code |
---|---|---|---|
2023/11/22 | FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation | Code |
|
2023/11/08 | PDFTriage: Question Answering over Long, Structured Documents | Stanford | Code |
2023/10/27 | WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia | Stanford | Code |
2023/10/27 | LeanDojo: Theorem Proving with Retrieval-Augmented Language Models | Caltech | Code |
2023/06/13 | WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences | Tsinghua University | Code |
2023/05/23 | WebCPM: Interactive Web Search for Chinese Long-form Question Answering | Tsinghua University | Code |
2022/06/01 | WebGPT: Browser-assisted question-answering with human feedback | Open AI | No |
- The Organization column only record the organization of the first author.