Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adapt image datasets #3362

Merged
merged 78 commits into from
Dec 9, 2021
Merged

Adapt image datasets #3362

merged 78 commits into from
Dec 9, 2021

Conversation

mariosasko
Copy link
Collaborator

@mariosasko mariosasko commented Dec 1, 2021

This PR:

  • adapts the ImageClassification template to use the new Image feature
  • adapts the following datasets to use the new Image feature:
    • beans (+ fixes streaming)
    • cast_vs_dogs (+ fixes streaming)
    • cifar10
    • cifar100
    • fashion_mnist
    • mnist
    • head_qa

cc @nateraw

@mariosasko
Copy link
Collaborator Author

mariosasko commented Dec 6, 2021

This PR can be merged after #3163 is merged (this PR is pretty big because I was working on the forked branch).

@lhoestq @albertvillanova Could you please take a look at the changes in src/datasets/utils/streaming_download_manager.py? These changes were required to support streaming of the cats_vs_dogs and the beans datasets.

@mariosasko mariosasko marked this pull request as ready for review December 7, 2021 14:24
@mariosasko
Copy link
Collaborator Author

The CI failures are due to the missing fields in the README files.

@mariosasko mariosasko closed this Dec 7, 2021
@mariosasko mariosasko reopened this Dec 7, 2021
Copy link
Member

@lhoestq lhoestq left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks all good to me ! feel free to merge if it's good on your side

@lhoestq
Copy link
Member

lhoestq commented Dec 9, 2021

and thanks for adding support for Path.name and Path.parent for streaming :)

@mariosasko mariosasko merged commit e855844 into master Dec 9, 2021
@mariosasko mariosasko deleted the adapt-image-datasets branch December 9, 2021 18:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants