RARE Technologies
Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign upRepositories
-
gensim
Topic Modelling for Humans
-
smart_open
Utils for streaming large files (S3, HDFS, gzip, bz2...)
-
sqlitedict
Persistent dict, backed by sqlite3 and pickle, multithread-safe.
-
gensim-wheels
Repository to build and test Gensim wheels
-
bounter
Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.
-
movie-plots-by-genre
Movie plots by genre tutorial at PyData Berlin 2016
-
talks
Presentations & notebooks from our talks /workshops/meetups/etc
-
gensim-data
Data repository for pretrained NLP models and NLP corpora.
-
benchmark_GPU_platforms
Code for the GPU mega-benchmark article
-
w2v_server_googlenews
Code for the word2vec HTTP server running at https://rare-technologies.com/word2vec-tutorial/#bonus_app
-
topic_eval
Tools and services for evaluating topic models
-
rarebot
Scripts and utilities for the RaReBot competition
-
sparsesvd
Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition
-
gensim-simserver
[NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]

