Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add clue-ai/ChatYuan #18

Closed
alreadydone opened this issue Mar 30, 2023 · 5 comments
Closed

add clue-ai/ChatYuan #18

alreadydone opened this issue Mar 30, 2023 · 5 comments

Comments

@alreadydone
Copy link

https://github.com/clue-ai/ChatYuan

I opened an issue there but got no response. Maybe someone here will volunteer to add it.

@nichtdax
Copy link
Owner

nichtdax commented Apr 1, 2023

Can you provide the link to some kind of Readme in English?

@alreadydone
Copy link
Author

alreadydone commented Apr 1, 2023

The official readme contains English:

ChatYuan large v2 is an open-source large language model for dialogue, supports both Chinese and English languages, and in ChatGPT style.

Based on the original functions of Chatyuan-large-v1, we optimized the model as follows:

-Added the ability to speak in both Chinese and English.

-Added the ability to refuse to answer. Learn to refuse to answer some dangerous and harmful questions.

-Added code generation functionality. Basic code generation has been optimized to a certain extent.

-Enhanced basic capabilities. The original contextual Q&A and creative writing skills have significantly improved.

-Added a table generation function. Make the generated table content and format more appropriate.

-Enhanced basic mathematical computing capabilities.

-The maximum number of length tokens has been expanded to 4096.

-Enhanced ability to simulate scenarios

But there are some parts in the Chinese readme that isn't translated:

It uses the same technical approach as the v1 version and has been optimized in aspects such as fine-tuning data, RLHF, and chain-of-thoughts.

ChatYuan-large-v2 is a lightweight model in the ChatYuan series that achieves high quality. Inference can be perform on consumer-level graphics cards, PCs, or even mobile phones (only 400M for INT4).

online Demo(Huggingface) | online Demo(ModelScope, in Chinese) | try online in Colab

The base model is PromptCLUE (770M parameters, download) pre-trained on a 100B-token Chinese corpus, which has also undergone Prompt-task-based training on hundreds of tasks. It was then trained on 100M functional dialogue / multi-turn dialogue data to obtain the ChatYuan model. ClueAI's online service is based on proprietary models of dozens of billions of parameters, and they have plans to open-source larger models, according to this article (in Chinese).

@nichtdax
Copy link
Owner

nichtdax commented Apr 1, 2023

But there are some parts in the Chinese readme that isn't translated:

I would prefer a fully English readme with example. But do you think we should just add your comment in Related Links (if this project is added to the list)

@alreadydone
Copy link
Author

Indeed it seems they are too focused on Chinese users (their company is centered around the CLUE benchmark as you can see from its name ClueAI, where C stands for Chinese), so that all examples provided are in Chinese. Maybe ChatYuan is not friendly enough to English users, and maybe it's not a good fit for this repo for this reason, but if you decide to add it, feel free to link to my comment above.

@nichtdax
Copy link
Owner

Resolved by 50ee7e2

This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants