-
-
Notifications
You must be signed in to change notification settings - Fork 4.3k
explosion spaCy Language-support Discussions
Sort by:
Latest activity
Label
Categories, most helpful, and community links
Categories
Community links
🌍 Language Support Discussions
Discuss the language data and training models for new languages
Pinned to Language Support
-
🌍 Adding models for new languages master thread
enhancementFeature requests and improvements lang / allGlobal language data new languageAdding support for new languages to spaCy.
Discussions
-
You must be logged in to vote 🌍 Hindi Language support
lang / hiHindi language data and models v2spaCy v2.x -
You must be logged in to vote 🌍 Spanish lemmatizer doesn't work for future tense verbs
lang / esSpanish language data and models feat / lemmatizerFeature: Rule-based and lookup lemmatization -
You must be logged in to vote 🌍 Custom NER for other languages.
trainingTraining and updating models feat / nerFeature: Named Entity Recognizer -
You must be logged in to vote 🌍 Add a custom language to spacy
enhancementFeature requests and improvements -
You must be logged in to vote 🌍 Ukrainian model proposal
enhancementFeature requests and improvements lang / ukUkrainian language data and models new languageAdding support for new languages to spaCy. -
You must be logged in to vote 🌍 Lemmatization is not working for Chinese language
lang / zhChinese language data and models feat / lemmatizerFeature: Rule-based and lookup lemmatization -
You must be logged in to vote 🌍 Addition of "entity_ruler" in spacy 3.2 - Portuguese
lang / ptPortuguese language data and models feat / matcherFeature: Token, phrase and dependency matcher -
You must be logged in to vote 🌍 Does spacy_hunspell support multiple languages?
third-partyThird-party packages and services -
You must be logged in to vote 🌍 List of definition token.lemma, token.dep abbrev used in doc/token
docsDocumentation and website feat / docFeature: Doc, Span and Token objects -
You must be logged in to vote 🌍 French model : tense of a verb is removed in version 3.x.
modelsIssues related to the statistical models lang / frFrench language data and models feat / morphologyFeature: Morphology and MorphAnalysis -
You must be logged in to vote 🌍 Lemmatization for Indonesian Language support
lang / idIndonesian language data and models feat / lemmatizerFeature: Rule-based and lookup lemmatization -
You must be logged in to vote 🌍 How to train lemmatizer? Are lookup tables required?
feat / lemmatizerFeature: Rule-based and lookup lemmatization -
You must be logged in to vote 🌍 Wrapping independently trained Pytorch model with Thinc
🔮 thincspaCy's machine learning library Thinc -
You must be logged in to vote 🌍 French and Italian noun chunks, contributors are welcomed!
lang / itItalian language data and models lang / frFrench language data and models -
You must be logged in to vote 🌍 Training data for English language models
lang / enEnglish language data and models -
You must be logged in to vote 🌍 German lemmatizer confused by capitalization
lang / deGerman language data and models feat / lemmatizerFeature: Rule-based and lookup lemmatization -
You must be logged in to vote 🌍 Problem with French parsing when using apostrophe
lang / frFrench language data and models perf / accuracyPerformance: accuracy -
You must be logged in to vote 🌍 Adding Vietnamese language support for Spacy
lang / viVietnamese language data and models new languageAdding support for new languages to spaCy. -
You must be logged in to vote 🌍 Using non-UD Arabic data
feat / cliFeature: Command-line interface -
You must be logged in to vote 🌍 Japanese transformers-based model
enhancementFeature requests and improvements lang / jaJapanese language data and models feat / transformerFeature: Transformer -
You must be logged in to vote 🌍 German lemmatizer based on outdated spelling rules
enhancementFeature requests and improvements lang / deGerman language data and models help wanted (easy)Contributions welcome! (also suited for spaCy beginners) feat / lemmatizerFeature: Rule-based and lookup lemmatization -
You must be logged in to vote 🌍 French tokenization - iconsistent application of exceptions in FR_BASE_EXCEPTIONS & other unexpected tokenization
lang / frFrench language data and models feat / tokenizerFeature: Tokenizer -
You must be logged in to vote 🌍 NER differences in spaCy v2 and v3.
lang / enEnglish language data and models feat / nerFeature: Named Entity Recognizer -
You must be logged in to vote 🌍 Wrong location detection in Spanish
lang / esSpanish language data and models feat / tokenizerFeature: Tokenizer