Shell
Updated May 28, 2019
#
text-processing
Repositories 527
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.
Python
Updated Apr 28, 2019
Intuitive find & replace CLI (sed alternative)
Rust
Updated May 27, 2019
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
natural-language-processing
deep-learning
nlp-library
nlp-parsing
chinese-nlp
text-classification
text-processing
Python
Updated May 28, 2019
Simple SQL-like syntax on top of Perl text processing.
Python
Updated May 16, 2019
Text Classification Algorithms: A Survey
text-classification
nlp-machine-learning
document-classification
text-processing
dimensionality-reduction
rocchio-algorithm
boosting-algorithms
logistic-regression
naive-bayes-classifier
k-nearest-neighbours
support-vector-machines
decision-trees
random-forest
conditional-random-fields
deep-learning
deep-neural-network
recurrent-neural-networks
convolutional-neural-networks
deep-belief-network
hierarchical-attention-networks
Python
Updated May 22, 2019
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules u…
nlp
python
computational-linguistics
linguistics
library
folia
machine-learning
language-modelling
search-algorithms
evaluation-metrics
text-processing
nlp-library
natural-language-processing
Python
Updated Mar 13, 2019
Open Korean Text Processor - An Open-source Korean Text Processor
korean
korean-text-processing
natural-language-processing
text-processing
tokenizer
korean-tokenizer
Scala
Updated Aug 7, 2018
Natural language detection library for Go
Go
Updated Mar 6, 2019
A simple Python module for parsing human names into their individual components
Python
Updated Apr 20, 2019
python
python2
python3
python-2
python-3
parser-combinators
parsing-expression-grammar
parsing
parsing-library
text-processing
Python
Updated May 28, 2019
machine-learning
classification
python
python3
python2
text
text-mining
adversarial-examples
spam
spam-filtering
spam-detection
spam-classification
text-classification
text-analysis
data-science
data-mining
text-processing
black-box-benchmarking
black-box-attacks
metrics
Python
Updated Oct 14, 2018
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis…
nlp
text-processing
nlp-library
spelling-correction
tokenizer
tokenization
word-segmentation
word-normalization
spell-corrector
text-segmentation
semeval
Python
Updated Nov 22, 2018
A fast implementation of Aho-Corasick in Rust.
Rust
Updated May 1, 2019
Util collection for Japanese text processing. Hiraganize, Katakanize, and Romanize.
JavaScript
Updated Apr 18, 2019
UNIC: Unicode and Internationalization Crates for Rust
unicode
internationalization
text-processing
crates
rust
cldr
locale-data
unic
unicode-characters
unicode-algorithms
Good first issues
Rust
Updated Mar 5, 2019
Stanford NLP group's shared Python tools.
Python
Updated Mar 14, 2018
Textpipe: clean and extract metadata from text
Good first issues
#17 opened 11 months ago by dungchu
1
Python
Updated May 27, 2019
A low level regular expression library that uses deterministic finite automata.
Rust
Updated Feb 25, 2019
CogComp's light-weight Python NLP annotators
Python
Updated Feb 18, 2019
Text vectorization tool to outperform TFIDF for classification tasks
python
nlp
machine-learning
text-analysis
text-classification
text-processing
tf-idf
natural-language-processing
Python
Updated Sep 10, 2018
pyarabic
Python
Updated Mar 27, 2019
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such …
c-plus-plus
python
nlp
ngrams
skipgram
ngram
corpus
linguistics
library
text-processing
computational-linguistics
pattern-recognition
C++
Updated Jan 15, 2019
Multi-lingual Text Processing
Updated Jan 22, 2019
Extract indicators of compromise from text, including "escaped" ones.
ioc
iocs
extract
extraction
text-mining
text-processing
indicators-of-compromise
command-line-tool
command-line
defang
escaping
regex
regexp
data-mining
Go
Updated May 6, 2019
Vision Framework IOS WWDC 2017
image-analysis
face-detection
rectangle-detection
char-detection
ios-vision
visionframework
wwdc2017
ios
machine
learning
text-detection
text-processing
Swift
Updated Jun 26, 2017
A flexible Java text processor. BB, BBCode, BB-code, HTML, Textile, Markdown, parser, translator, converter.
Java
Updated Oct 19, 2016
Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku and Zenkaku
Python
Updated Feb 3, 2019
Unix Text Processing Command Reference
Updated Sep 12, 2016
vims - use vim commands for pipeline filtering in terminal
Shell
Updated Aug 12, 2018

