word2vec
Here are 1,210 public repositories matching this topic...
This is an awesome library, thanks @ddbourgin!!
Users might not know the best way to install this package and try it out. (I didn't, so I eventually just copied the source files.)
Neither the readme nor readthedocs have install instructions.
I couldn't find it on PyPi or Anaconda, and there doesn't appear to be a pyproject.toml, setup.cfg, setup.py, or conda recipe.
Moreover, the t
-
Updated
Dec 1, 2019
-
Updated
Jun 5, 2020 - Python
-
Updated
Mar 23, 2019 - Python
-
Updated
Apr 7, 2019 - Jupyter Notebook
Hi there,
I think there might be a mistake in the documentation. The Understanding Scaled F-Score section says
The F-Score of these two values is defined as:
$$ \mathcal{F}_\beta(\mbox{prec}, \mbox{freq}) = (1 + \beta^2) \frac{\mbox{prec} \cdot \mbox{freq}}{\beta^2 \cdot \mbox{prec} + \mbox{freq}}. $$
$\beta \in \mathcal{R}^+$ is a scaling factor where frequency is favored if $\beta
-
Updated
Jul 7, 2020 - Python
-
Updated
Jul 8, 2020 - Python
-
Updated
May 29, 2020 - Python
-
Updated
Apr 22, 2019 - Python
I would like to know what all the abbreviations mean? Some I can guess, like "PUNCT", but no idea what "X" might be. I want to retain contractions, but hard to choose options without documentation.
Thanks. Great performance code!
-
Updated
Jan 10, 2020 - Python
大佬您好,我参考了您得assignment1中得word2vec.py得实现。但是在运行过程中梯度检测报错了。
==== Gradient check for skip-gram ====
Gradient check failed.
First gradient error found at index (0, 0)
Your gradient: -0.087147 Numerical gradient: 1254.567123
我是用py3实现的,之前的所有代码几乎一致,也都正确通过了,唯独这里通过不了。之后我将您的代码直接全部拷贝下来运行,同样报以上错误,请问您知道怎么回事嘛,您当时运行通过了吗?
-
Updated
Apr 22, 2020 - Jupyter Notebook
def get_all_words(self): """ Return all words tokenized, in lowercase and without punctuation """ return [w.lower() for w in word_tokenize(self.text) if w not in string.punctuation]
I found that in this function, only punctuation of the text was removed. But there are other types of words that have not been removed.
eg:
`from nltk.corpus import stopwords
-
Updated
May 31, 2020 - Python
-
Updated
Jun 9, 2020 - C#
-
Updated
Mar 11, 2020 - Jupyter Notebook
-
Updated
Jan 28, 2020 - Python
-
Updated
Feb 22, 2018 - Python
-
Updated
Jun 6, 2020 - Go
-
Updated
Jun 1, 2020 - OpenEdge ABL
-
Updated
Jul 5, 2018 - Python
Improve this page
Add a description, image, and links to the word2vec topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the word2vec topic, visit your repo's landing page and select "manage topics."


Example (from TfidfTransformer)
This method expects a list of tuples, instead of an iterable. This means that the entire corpus has to be stored as a lis