-
-
Notifications
You must be signed in to change notification settings - Fork 4.3k
Issues: explosion/spaCy
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Filter duplicate vectors when pruning vectors
bug
Bugs and behaviour differing from documentation
feat / vectors
Feature: Word vectors and similarity
#5397
opened May 4, 2020 by
adrianeboyd
Efficient GPU support for Entity Linking pipe
enhancement
Feature requests and improvements
feat / nel
Feature: Named Entity linking
gpu
Using spaCy on GPU
help wanted
Contributions welcome!
#4281
opened Sep 12, 2019 by
svlandeg
training with custom loss function, or assigning weights to classes
enhancement
Feature requests and improvements
feat / transformer
Feature: Transformer
#6873
opened Oct 5, 2019 by
fhamborg
Add checks (and converters?) for documents with multiple sentences in debug-data
enhancement
Feature requests and improvements
feat / cli
Feature: Command-line interface
#4409
opened Oct 9, 2019 by
adrianeboyd
Make it possible to merge Vocab/StringStore instances
enhancement
Feature requests and improvements
feat / serialize
Feature: Serialization, saving and loading
scaling
Scaling, serving and parallelizing spaCy
#4411
opened Oct 9, 2019 by
johann-petrak
Doc.tensor into user_hooks
enhancement
Feature requests and improvements
feat / doc
Feature: Doc, Span and Token objects
feat / vectors
Feature: Word vectors and similarity
help wanted
Contributions welcome!
#4439
opened Oct 14, 2019 by
tamuhey
Sentencepiece base Language
enhancement
Feature requests and improvements
feat / tokenizer
Feature: Tokenizer
feat / transformer
Feature: Transformer
new language
Adding support for new languages to spaCy.
#6874
opened Oct 21, 2019 by
tamuhey
German adjectives ending on Feature requests and improvements
feat / lemmatizer
Feature: Rule-based and lookup lemmatization
help wanted (easy)
Contributions welcome! (also suited for spaCy beginners)
lang / de
German language data and models
-e
are not lemmatized using the lookup lemmatizer
enhancement
#4622
opened Nov 11, 2019 by
SuzanaK
Memory usage of Feature requests and improvements
feat / cli
Feature: Command-line interface
perf / memory
Performance: memory use
debug-data
with a huge training set
enhancement
#4748
opened Dec 3, 2019 by
sfragis
Handle sentence boundaries from multiple components
enhancement
Feature requests and improvements
feat / doc
Feature: Doc, Span and Token objects
feat / parser
Feature: Dependency Parser
feat / sentencizer
Feature: Sentencizer (rule-based sentence segmenter)
#4775
opened Dec 5, 2019 by
adrianeboyd
displaCy dependency tree labels backwards (and upside down) in RTL languages in certain browsers
bug
Bugs and behaviour differing from documentation
feat / visualizers
Feature: Built-in displaCy and other visualizers
help wanted
Contributions welcome!
#4854
opened Dec 30, 2019 by
erip
📚 Inaccurate pre-trained model predictions master thread
models
Issues related to the statistical models
perf / accuracy
Performance: accuracy
#3052
opened Dec 14, 2018 by
ines
Windows .pyd files sneakily depend on msvcp140.dll
help wanted
Contributions welcome!
install
Installation issues
windows
Issues related to Windows
#5332
opened Apr 21, 2020 by
gthb
Supporting out-of-band buffers with pickle protocol 5
enhancement
Feature requests and improvements
feat / serialize
Feature: Serialization, saving and loading
help wanted
Contributions welcome!
#5472
opened May 21, 2020 by
jakirkham
Tokenizer special cases do not work around infix punctuation
enhancement
Feature requests and improvements
feat / tokenizer
Feature: Tokenizer
lang / en
English language data and models
#5598
opened Jun 16, 2020 by
cassidylaidlaw
Character-based orthographic variants
enhancement
Feature requests and improvements
feat / cli
Feature: Command-line interface
training
Training and updating models
#5609
opened Jun 19, 2020 by
adrianeboyd
"Value Error: bytes object is too large" when using to_disk on large model.
feat / serialize
Feature: Serialization, saving and loading
feat / transformer
Feature: Transformer
v2
spaCy v2.x
#6875
opened Jun 22, 2020 by
JaronMink
Issue resuming training on tansformer based NER
feat / transformer
Feature: Transformer
🌙 nightly
Discussion and contributions related to nightly builds
perf / memory
Performance: memory use
training
Training and updating models
#6323
opened Oct 29, 2020 by
fcggamou
Lookaround operators on Matcher patterns
enhancement
Feature requests and improvements
feat / matcher
Feature: Token, phrase and dependency matcher
help wanted
Contributions welcome!
#6420
opened Nov 20, 2020 by
kinghuang
Models are not deterministic / reproducible on GPU
bug
Bugs and behaviour differing from documentation
feat / ner
Feature: Named Entity Recognizer
gpu
Using spaCy on GPU
reproducibility
Consistency, reproducibility, determinism, and randomness
#6490
opened Dec 3, 2020 by
echatzikyriakidis
Suffix doesn't match for sentence ending in uppercase.
feat / tokenizer
Feature: Tokenizer
lang / en
English language data and models
#6695
opened Jan 8, 2021 by
jdupl123
Displacy Visualizer : Show fine_grain Tags and POS Tags in SpaCy Dependency Visualizer
enhancement
Feature requests and improvements
feat / visualizers
Feature: Built-in displaCy and other visualizers
#6773
opened Jan 20, 2021 by
Fxlix
Use mmap to share models across processes and speed up loading
enhancement
Feature requests and improvements
scaling
Scaling, serving and parallelizing spaCy
#6784
opened Jan 21, 2021 by
alexgarel
Feature Request: Pass custom values from Matcher pattern definitions to matched tokens
#13520
opened Jun 5, 2024 by
apodgorny
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.