Awesome-LLM-Hypothesis

LLM works very well on many tasks. It's quite amazing to reflect how many things the next-token prediction has realized. We cannot take this for granted. Here we collect all the popular hypothesis why LLM works.

Next-token prediction as a massive multi-task learning problem (https://x.com/_jasonwei/status/1729585618311950445, https://www.youtube.com/watch?v=kYWUEV_e2ss)

The Linear Representation Hypothesis and the Geometry of Large Language Models (https://arxiv.org/pdf/2311.03658)

The Platonic Representation Hypothesis(https://arxiv.org/abs/2405.07987)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome-LLM-Hypothesis

About

Releases

Packages

Learn2Solve/Awesome-LLM-Hypothesis

Folders and files

Latest commit

History

Repository files navigation

Awesome-LLM-Hypothesis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages