A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering various contexts and tasks (task-oriented dialogue systems, abstr…

165 12 Updated Apr 1, 2023

XueFuzhao / InstructionWild

451 41 Updated Jun 9, 2024

teknium1 / GPTeacher

A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer

Python 1,608 169 Updated Sep 15, 2023

vaguenebula / AlpacaDataReflect

An experiment to see if chatgpt can improve the output of the stanford alpaca dataset

Python 12 2 Updated Mar 29, 2023

databrickslabs / dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,810 1,153 Updated Jun 30, 2023

FreedomIntelligence / InstructionZoo

265 24 Updated Apr 26, 2024

PhoebusSi / Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,583 244 Updated Dec 12, 2023

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,319 2,907 Updated Sep 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LuckyMind baleksey

Block or report baleksey

LLAMA Datasets

sahil280114 / codealpaca

gururise / AlpacaDataCleaned

IntoThatGoodNight / Guanaco

LAION-AI / Open-Assistant

orhonovich / unnatural-instructions

google-research-datasets / presto