word-embeddings
- Sign up for GitHub or sign in to edit this page
Here are 427 public repositories matching this topic...
Train a simple NER tagger for Swedish trained for instance over this dataset.
For this task, we need to adapt the NLPTaskDataFetcher for the appropriate Swedish dataset and train a simple model using Swedish word embeddings. How to train a model is [illustrated here](https://github.com/zalandoresearch/flair/blob/master/resources/docs/TUTORIAL_TRAI
100+ Chinese Word Vectors 上百种预训练中文词向量
-
Updated
Nov 20, 2019 - Python
A curated list of pretrained sentence and word embedding models
-
Updated
Nov 19, 2019 - Python
Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
-
Updated
Nov 20, 2019 - Jupyter Notebook
A fast, efficient universal vector embedding utility package.
-
Updated
Nov 17, 2019 - Python
Beautiful visualizations of how language differs among document types.
-
Updated
Nov 20, 2019 - Python
Overview of Modern Deep Learning Techniques Applied to Natural Language Processing
-
Updated
Nov 20, 2019 - CSS
I would like to know what all the abbreviations mean? Some I can guess, like "PUNCT", but no idea what "X" might be. I want to retain contractions, but hard to choose options without documentation.
Thanks. Great performance code!
We should revisit our alpha and beta default values. 1.0 is way to large.
🔡 Token level embeddings from BERT model on mxnet and gluonnlp
-
Updated
Nov 20, 2019 - Python
Simple downloader for pre-trained word vectors
-
Updated
Nov 14, 2019 - Python
A collection of notebooks for Natural Language Processing from NLP Town
-
Updated
Nov 19, 2019 - Jupyter Notebook
Named-Entity-Recognition-with-Bidirectional-LSTM-CNNs
-
Updated
Nov 15, 2019 - Python
A collection of supervised learning models based on shallow neural network approaches (e.g., word2vec and fastText) with some additional exclusive features. Written in Python and fully compatible with Scikit-learn.
-
Updated
Nov 1, 2019 - Python
Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
-
Updated
Nov 11, 2019 - Jupyter Notebook
Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
-
Updated
Nov 12, 2019 - Python
Master Deep Learning Algorithms with Extensive Math by Implementing them using TensorFlow
-
Updated
Nov 20, 2019 - Jupyter Notebook
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
-
Updated
Nov 19, 2019 - Jupyter Notebook
Spanish word embeddings computed with different methods and from different corpora
-
Updated
Nov 4, 2019
Code for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)
-
Updated
Nov 15, 2019 - Python
Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
-
Updated
Nov 20, 2019 - Java
Word Embeddings for Information Retrieval
-
Updated
Nov 19, 2019 - Python
Remove problematic gender bias from word embeddings.
-
Updated
Nov 18, 2019 - Jupyter Notebook
A short tutorial on Elmo training (Pre trained, Training on new data, Incremental training)
-
Updated
Nov 15, 2019 - Jupyter Notebook
ACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks
-
Updated
Nov 20, 2019 - Python
Kadot, the unsupervised natural language processing library.
-
Updated
Jul 9, 2019 - Python


Client code:
model = LogEntropyModel(corpus=data_corpus, normalize=True)Referenced code:
https://github.com/RaRe-Technologies/gensim/blob/44ea7931c916349821aa1c717fbf7e90fb138297/gensim/models/logentropy_model.py#L115
Exception thrown: