-
Notifications
You must be signed in to change notification settings - Fork 68
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] FastTextLangId doesnt return a list truely #33
Comments
Thanks for raising this issue! I'm always happy to see people contributing to the project. I think I've discovered the root cause. It appears that when Dask performs type inference on the function, it passes in a string instead of a list. I'll make a PR that explicitly annotates the meta so no type inference is needed. This bug has seemingly revealed another though, as the mentioned fix isn't actually enough. If you only implement that, you may be left with another error Thanks again for raising the issue! I should have a PR up shortly for this. |
i added this line in the start of main function the next error occured
|
i changed
to
so problem solved |
so i close the issue |
Describe the bug
IndexError: string index out of range
Steps/Code to reproduce bug
i used examples files to download common crawl with download_common_crawl.py after that i decided to seprate Persian language so i run identify_languages_and_fix_unicode.py but when i run the python script the above error occurs
here are varibles that i changed in first file
second file changes
Expected behavior
run with no error
**Environment overview **
Environment details
The text was updated successfully, but these errors were encountered: