Skip to content

Issues: explosion/spaCy

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

Filter duplicate vectors when pruning vectors bug Bugs and behaviour differing from documentation feat / vectors Feature: Word vectors and similarity
#5397 opened May 4, 2020 by adrianeboyd
Efficient GPU support for Entity Linking pipe enhancement Feature requests and improvements feat / nel Feature: Named Entity linking gpu Using spaCy on GPU help wanted Contributions welcome!
#4281 opened Sep 12, 2019 by svlandeg
training with custom loss function, or assigning weights to classes enhancement Feature requests and improvements feat / transformer Feature: Transformer
#6873 opened Oct 5, 2019 by fhamborg
Add checks (and converters?) for documents with multiple sentences in debug-data enhancement Feature requests and improvements feat / cli Feature: Command-line interface
#4409 opened Oct 9, 2019 by adrianeboyd
Make it possible to merge Vocab/StringStore instances enhancement Feature requests and improvements feat / serialize Feature: Serialization, saving and loading scaling Scaling, serving and parallelizing spaCy
#4411 opened Oct 9, 2019 by johann-petrak
Doc.tensor into user_hooks enhancement Feature requests and improvements feat / doc Feature: Doc, Span and Token objects feat / vectors Feature: Word vectors and similarity help wanted Contributions welcome!
#4439 opened Oct 14, 2019 by tamuhey
Sentencepiece base Language enhancement Feature requests and improvements feat / tokenizer Feature: Tokenizer feat / transformer Feature: Transformer new language Adding support for new languages to spaCy.
#6874 opened Oct 21, 2019 by tamuhey
German adjectives ending on -e are not lemmatized using the lookup lemmatizer enhancement Feature requests and improvements feat / lemmatizer Feature: Rule-based and lookup lemmatization help wanted (easy) Contributions welcome! (also suited for spaCy beginners) lang / de German language data and models
#4622 opened Nov 11, 2019 by SuzanaK
Memory usage of debug-data with a huge training set enhancement Feature requests and improvements feat / cli Feature: Command-line interface perf / memory Performance: memory use
#4748 opened Dec 3, 2019 by sfragis
Handle sentence boundaries from multiple components enhancement Feature requests and improvements feat / doc Feature: Doc, Span and Token objects feat / parser Feature: Dependency Parser feat / sentencizer Feature: Sentencizer (rule-based sentence segmenter)
#4775 opened Dec 5, 2019 by adrianeboyd
displaCy dependency tree labels backwards (and upside down) in RTL languages in certain browsers bug Bugs and behaviour differing from documentation feat / visualizers Feature: Built-in displaCy and other visualizers help wanted Contributions welcome!
#4854 opened Dec 30, 2019 by erip
Spacy + Dask? bug Bugs and behaviour differing from documentation scaling Scaling, serving and parallelizing spaCy v3.0 Related to v3.0
#5111 opened Mar 5, 2020 by AlJohri
📚 Inaccurate pre-trained model predictions master thread models Issues related to the statistical models perf / accuracy Performance: accuracy
#3052 opened Dec 14, 2018 by ines
Windows .pyd files sneakily depend on msvcp140.dll help wanted Contributions welcome! install Installation issues windows Issues related to Windows
#5332 opened Apr 21, 2020 by gthb
Supporting out-of-band buffers with pickle protocol 5 enhancement Feature requests and improvements feat / serialize Feature: Serialization, saving and loading help wanted Contributions welcome!
#5472 opened May 21, 2020 by jakirkham
Tokenizer special cases do not work around infix punctuation enhancement Feature requests and improvements feat / tokenizer Feature: Tokenizer lang / en English language data and models
#5598 opened Jun 16, 2020 by cassidylaidlaw
Character-based orthographic variants enhancement Feature requests and improvements feat / cli Feature: Command-line interface training Training and updating models
#5609 opened Jun 19, 2020 by adrianeboyd
"Value Error: bytes object is too large" when using to_disk on large model. feat / serialize Feature: Serialization, saving and loading feat / transformer Feature: Transformer v2 spaCy v2.x
#6875 opened Jun 22, 2020 by JaronMink
Issue resuming training on tansformer based NER feat / transformer Feature: Transformer 🌙 nightly Discussion and contributions related to nightly builds perf / memory Performance: memory use training Training and updating models
#6323 opened Oct 29, 2020 by fcggamou
Lookaround operators on Matcher patterns enhancement Feature requests and improvements feat / matcher Feature: Token, phrase and dependency matcher help wanted Contributions welcome!
#6420 opened Nov 20, 2020 by kinghuang
Models are not deterministic / reproducible on GPU bug Bugs and behaviour differing from documentation feat / ner Feature: Named Entity Recognizer gpu Using spaCy on GPU reproducibility Consistency, reproducibility, determinism, and randomness
#6490 opened Dec 3, 2020 by echatzikyriakidis
Suffix doesn't match for sentence ending in uppercase. feat / tokenizer Feature: Tokenizer lang / en English language data and models
#6695 opened Jan 8, 2021 by jdupl123
Displacy Visualizer : Show fine_grain Tags and POS Tags in SpaCy Dependency Visualizer enhancement Feature requests and improvements feat / visualizers Feature: Built-in displaCy and other visualizers
#6773 opened Jan 20, 2021 by Fxlix
Use mmap to share models across processes and speed up loading enhancement Feature requests and improvements scaling Scaling, serving and parallelizing spaCy
#6784 opened Jan 21, 2021 by alexgarel
ProTip! Find all open issues with in progress development work with linked:pr.