Issues: explosion/spaCy
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Spacy-LLM code sample produces no output
feat/llm
Feature: LLMs (incl. spacy-llm)
#13132
opened Nov 17, 2023 by
rkatriel
Pre-trained coreference pipeline incompatible with spaCy > 3.4
experimental
Experimental components and features
#13111
opened Nov 6, 2023 by
Fohlen
spancat training not working with span group other than "sc"
feat / spancat
Feature: Span Categorizer
feat / training
Feature: Training utils, Example, Corpus and converters
training
Training and updating models
#13090
opened Oct 29, 2023 by
nrodnova
Feature request: Extension attributes for span groups
enhancement
Feature requests and improvements
feat / doc
Feature: Doc, Span and Token objects
#13067
opened Oct 17, 2023 by
svlandeg
Dependency sentence segmenter handles newlines inconsistently between languages
feat / senter
Feature: Sentence Recognizer
lang / it
Italian language data and models
#13059
opened Oct 11, 2023 by
freddyheppell
Random 'Segmentation fault (core dumped)' error when training for long spancat
bug
Bugs and behaviour differing from documentation
feat / spancat
Feature: Span Categorizer
#13026
opened Sep 28, 2023 by
belalsalih
Lemmatization issues [Italian][Spanish][French]
feat / lemmatizer
Feature: Rule-based and lookup lemmatization
lang / es
Spanish language data and models
lang / fr
French language data and models
lang / it
Italian language data and models
perf / accuracy
Performance: accuracy
#12954
opened Sep 4, 2023 by
databill86
Sentence-terminal periods not tokenized properly in Malayalam text
bug
Bugs and behaviour differing from documentation
lang / ml
Malayalam language data and models
#12898
opened Aug 9, 2023 by
BLKSerene
Unaligned predicted spans are ignored in Feature: Scorer
Scorer.score_spans
feat / scorer
#12811
opened Jul 10, 2023 by
connorbrinton
NER fails on warning "Token indices sequence length is longer than the specified maximum"
perf / accuracy
Performance: accuracy
#12622
opened May 11, 2023 by
schudoku
Installation issue on old macOSes for new Korean tokenizer in v4.0 alpha
lang / ko
Korean language data and models
#12416
opened Mar 14, 2023 by
BLKSerene
Displacy visualiser only sometimes shows labels
feat / visualizers
Feature: Built-in displaCy and other visualizers
#12411
opened Mar 13, 2023 by
goonhoon
Training transformer model goes from score 0.97 to ZERO
bug
Bugs and behaviour differing from documentation
feat / ner
Feature: Named Entity Recognizer
feat / training
Feature: Training utils, Example, Corpus and converters
feat / transformer
Feature: Transformer
perf / memory
Performance: memory use
#12383
opened Mar 8, 2023 by
svlandeg
Not all displaCy templates can be overridden
enhancement
Feature requests and improvements
feat / visualizers
Feature: Built-in displaCy and other visualizers
#12267
opened Feb 9, 2023 by
drnextgis
Access NEL prediction scores across KB candidates
enhancement
Feature requests and improvements
feat / nel
Feature: Named Entity linking
#12048
opened Jan 3, 2023 by
Luis-R-Flores
Mismatched IDs error when using nlp.rehearse with listeners
bug
Bugs and behaviour differing from documentation
feat / textcat
Feature: Text Classifier
training
Training and updating models
#12044
opened Jan 2, 2023 by
thomashacker
Doc span group spans aren't adjusted for retokenization
bug
Bugs and behaviour differing from documentation
feat / doc
Feature: Doc, Span and Token objects
feat / tokenizer
Feature: Tokenizer
#12024
opened Dec 24, 2022 by
kinghuang
spacy package CLI command accepts list of code_paths, but the others do not
enhancement
Feature requests and improvements
feat / cli
Feature: Command-line interface
feat / ux
Feature: User experience, error messages etc.
#12000
opened Dec 19, 2022 by
kinghuang
Inconsistent NER predictions from identical inputs while using ThreadPoolExecutor
reproducibility
Consistency, reproducibility, determinism, and randomness
scaling
Scaling, serving and parallelizing spaCy
third-party
Third-party packages and services
#11868
opened Nov 25, 2022 by
pege345
Italian tagger and lemmatizer performance dropped with the new v3.4 version
feat / lemmatizer
Feature: Rule-based and lookup lemmatization
feat / tagger
Feature: Part-of-speech tagger
lang / it
Italian language data and models
perf / accuracy
Performance: accuracy
#11298
opened Aug 12, 2022 by
databill86
Tokenizer uses a significant amount of memory compared to the input
feat / doc
Feature: Doc, Span and Token objects
feat / tokenizer
Feature: Tokenizer
perf / memory
Performance: memory use
🔜 v4.0
Related to upcoming v4.0
#11295
opened Aug 11, 2022 by
itamarst
Problems and errors in new German lemmatizer (since 3.3.0)
feat / lemmatizer
Feature: Rule-based and lookup lemmatization
lang / de
German language data and models
#10953
opened Jun 13, 2022 by
lutz-100worte
Executing a none python script using "Spacy Projects" generates an error
projects
spaCy projects and project templates
windows
Issues related to Windows
#10845
opened May 25, 2022 by
dhirajsuvarna
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.
