#
text-segmentation
Here are 33 public repositories matching this topic...
SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm
spellcheck
fuzzy-search
fuzzy-matching
edit-distance
levenshtein
levenshtein-distance
spelling
spell-check
chinese-text-segmentation
word-segmentation
approximate-string-matching
spelling-correction
damerau-levenshtein
text-segmentation
chinese-word-segmentation
symspell
-
Updated
Jul 9, 2020 - C#
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、句法分析等),无监督或弱监督(种子词)方法
nlp
sentiment-analysis
unsupervised
named-entity-recognition
text-summarization
dependency-parser
text-segmentation
text-cleaning
new-word-discovery
-
Updated
Jul 23, 2020 - Python
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
nlp
tokenizer
text-processing
semeval
nlp-library
word-segmentation
spelling-correction
tokenization
text-segmentation
spell-corrector
word-normalization
-
Updated
Aug 13, 2020 - Python
Python port of SymSpell
python
spellcheck
fuzzy-search
fuzzy-matching
edit-distance
levenshtein
levenshtein-distance
spelling
spell-check
chinese-text-segmentation
word-segmentation
approximate-string-matching
spelling-correction
damerau-levenshtein
text-segmentation
chinese-word-segmentation
symspell
-
Updated
Apr 28, 2020 - Python
A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).
machine-learning
awesome
ocr
computer-vision
deep-learning
text-recognition
text-detection
text-segmentation
end-to-end-ocr
video-ocr
-
Updated
Sep 15, 2019
A sentence segmenter that actually works!
-
Updated
Aug 18, 2020 - Python
Implementation of the paper: Text Segmentation as a Supervised Learning Task
-
Updated
Oct 2, 2019 - Python
(yet another not really) awesome topic/text segmentation list
-
Updated
Nov 26, 2018
Fast Word Segmentation with Triangular Matrix
spellcheck
spell-check
spelling-checker
spell-checker
word-segmentation
spelling-correction
spelling-corrector
spellchecker
text-segmentation
spell-corrector
symspell
-
Updated
May 6, 2018 - C#
Mandarin Chinese text segmentation and mobile dictionary Android app (中文分词)
-
Updated
May 18, 2018 - Java
Word Segmentation with Dynamic Programming
spellcheck
spell-check
spell-checker
word-segmentation
spelling-correction
spelling-corrector
spellchecker
text-segmentation
spell-corrector
symspell
-
Updated
May 6, 2018 - C#
Spelling correction and string segmentation written in Go
golang
spellcheck
spelling
spell-check
word-segmentation
spelling-correction
text-segmentation
symspell
string-segmentation
-
Updated
Jan 26, 2020 - Go
Text segmentation into separate words using a simple unigram model and the Viterbi algorithm
-
Updated
Oct 21, 2018 - C#
Uses GloVe embeddings and greedy sequence segmentation to semantically segment a text document into any number of k segments.
-
Updated
Feb 17, 2019 - Jupyter Notebook
Data for the ACL 2020 paper - Improving Segmentation for Technical Support Problems
natural-language-processing
nlp-machine-learning
text-segmentation
nlp-datasets
ibm-research-ai
acl2020
-
Updated
Jun 23, 2020
Neural and nonneural text segmentation methods.
-
Updated
Oct 8, 2018 - Jupyter Notebook
Language processing interface: some tools to process different natural languages
natural-language-processing
wordnet
segmentation
preprocessing
tokenization
stemming
text-segmentation
word-tokenizing
-
Updated
Jul 28, 2017 - Java
Perl wrapper for CppJieba (Chinese text segmentation)
-
Updated
May 27, 2017 - Perl
Image operator learning with CNNs
python
tensorflow
image-processing
cnn
classification
convolutional-neural-networks
text-segmentation
image-operators
-
Updated
Aug 31, 2017 - Jupyter Notebook
一些容易被分词工具被分错的句子。
natural-language-processing
text-analysis
segmentation
chinese-nlp
chinese-text-segmentation
text-segmentation
chinese-word-segmentation
-
Updated
Nov 13, 2019 - Jupyter Notebook
Using Otsu's thresholding for text segmentation on images of sticky notes.
computer-vision
image-processing
text-detection
thresholding
text-segmentation
otsu
otsu-thresholding
-
Updated
May 5, 2020 - HTML
How to add user dictionary to MeCab
-
Updated
Jun 22, 2020
Program that allows to detect and classify the segments of medieval royal charters according to their diplomatic formulae.
-
Updated
Apr 21, 2020 - Java
Thai sentence segmentation website
-
Updated
Aug 16, 2020 - PHP
Identification of Knowledge Gaps In Text
nlp
natural-language-processing
readability
text-segmentation
natural-language-understanding
coherence-analysis
-
Updated
Jul 11, 2020
Improve this page
Add a description, image, and links to the text-segmentation topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the text-segmentation topic, visit your repo's landing page and select "manage topics."


If you try to run experiment in system/docker container where missing git (in system) your code will crush at this line:
https://github.com/catalyst-team/catalyst/blob/master/catalyst/utils/pipelines.py#L4
with message like this: