Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
consumes the docs and vocab objects. This means that user will need to make copies of the data if they wish to use either of these vectors in subsequent processing.
Should be simple to refactor the interface to borrow these instea
The java implementation of "Enhancing Topic Modeling for Short Texts with Auxiliary Word Embeddings" TOIS 2017, Chenliang Li, Yu Duan, Haoran Wang, Zhiqian Zhang, Aixin Sun, Zongyang Ma, https://dl.acm.org/citation.cfm?doid=3133943.3091108
The constructor for GSDMM with signature:
consumes the docs and vocab objects. This means that user will need to make copies of the data if they wish to use either of these vectors in subsequent processing.
Should be simple to refactor the interface to borrow these instea