Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add StableLM support #410

Merged
merged 5 commits into from
Apr 6, 2024
Merged

Add StableLM support #410

merged 5 commits into from
Apr 6, 2024

Conversation

Isotr0py
Copy link
Contributor

Related issue:

Features:

  • Add stabilityai/stable-code-3b support
  • Support partial ROPE in QuantAttentionFused (partial rotary embedding used in stablelm and phi-2)

@Isotr0py Isotr0py changed the title Support Stablelm Add StableLM support Mar 26, 2024
@casper-hansen casper-hansen mentioned this pull request Apr 6, 2024
13 tasks
@casper-hansen casper-hansen merged commit e9f6269 into casper-hansen:main Apr 6, 2024
@ktotam1
Copy link

ktotam1 commented Jun 10, 2024

Just for clarification, this code only works with use_parallel_residual false, right? If use_parallel_residiual was true stablelm has basically the same architecture as phi-2 right? Have you experimented with implementing this?
Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants