-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add metrec: arabic poetry dataset #893
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That looks awesome good job !
For the sprint that is going to begin soon, we're now asking to add one dataset card per dataset.
You can se more information in those PRs : #896 and #894
Could you create a dataset card for Metrec as well ? The goal of a dataset card is to detail the characteristics of a dataset. The content of the dataset card is going to be displayed in the huggingface.co dataset search tool :)
@lhoestq removed prints and added the dataset card. |
@lhoestq, I want to add other datasets as well. I am not sure if it is possible to do so with the same branch. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Really nice good job :)
To add another dataset, please use a create a new branch from master and create another PR
Hi @zaidalyafeai, really excited to get more Arabic coverage in the lib, thanks for your contribution! Couple of last comments:
|
132e5c0
to
48037c4
Compare
I have no idea how some other files changed. I tried to rebase and push but this created some errors. I had to run the command |
Feel free to create another branch/another PR without all the other changes |
@yjernite can you explain which other files are changed because of the PR ? https://github.com/huggingface/datasets/pull/893/files only shows files related to the dataset. |
Right ! github is nice with us today :) |
Looks like this one is ready to merge, thanks @zaidalyafeai ! |
@lhoestq thanks for the merge. I am not a GitHub geek. I already have another dataset to add. I'm not sure how to add another given my forked repo. Do I follow the same steps with a different checkout name ? |
If you've followed the instructions in here : https://github.com/huggingface/datasets/blob/master/ADD_NEW_DATASET.md#start-by-preparing-your-environment (especially point 2. and the command Then you can try
|
* add metrec arabic poetry dataset * add metrec arabic poetry dataset * remove prints and add dataset card * fix conflicts * fix conflicts * Update datasets/metrec/README.md * Update datasets/metrec/README.md * Update datasets/metrec/README.md Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>
No description provided.