information-retrieval
- Sign up for GitHub or sign in to edit this page
Here are 882 public repositories matching this topic...
I'm submitting a ... (check one with "x")
[x] bug report
[ ] new distro request
If I try to change default colors with -c option, bold is removed from labels:
I didn't find any option t
Learning to Rank in TensorFlow
-
Updated
Oct 26, 2019 - 91 commits
- Python
Extensible search engine - for when you want to build searchable vector spaces
-
Updated
Oct 25, 2019 - 2 commits
- C#
A curated list of papers dedicated to neural text (semantic) matching.
-
Updated
Oct 25, 2019 - 45 commits
- Python
My Keras implementation of the Deep Semantic Similarity Model (DSSM)/Convolutional Latent Semantic Model (CLSM) described here: http://research.microsoft.com/pubs/226585/cikm2014_cdssm_final.pdf.
-
Updated
Oct 22, 2019 - 40 commits
- Python
Python Keyphrase Extraction module
-
Updated
Oct 25, 2019 - 268 commits
- Python
Tools and recipes to train deep learning models and build services for NLP tasks such as text classification, semantic search ranking and recall fetching, cross-lingual information retrieval, and question answering etc.
-
Updated
Oct 22, 2019 - 38 commits
- Python
Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
-
Updated
Oct 17, 2019 - 124 commits
Scrape any website, article or RSS/Atom Feed with ease!
-
Updated
Oct 23, 2019 - 172 commits
- Elixir
RMDL: Random Multimodel Deep Learning for Classification
-
Updated
Oct 23, 2019 - 453 commits
- Python
Extract subdomains from SSL certificates in HTTPS sites.
-
Updated
Oct 19, 2019 - 227 commits
- Python
Spark-Crawler : Evolving Apache Nutch to run on Spark.
-
Updated
Oct 15, 2019 - 417 commits
- Java
A math-aware search engine.
-
Updated
Oct 21, 2019 - 594 commits
- C
Hi and thanks for the project. I am considering using it for research, and thus just began to read the documentation.
I found 3 small and easy-to-fix bugs in the page https://pisa.readthedocs.io/en/latest/compress_index.html
-
first bug: in the "Elias-Fano" item, the latex code is not rendered properly (neither in the html doc, nor in the .pdf document)
-
second bug: the "Simple8b" is app
A Lucene toolkit for replicable information retrieval research
-
Updated
Oct 25, 2019 - 687 commits
- Java
Train a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs
-
Updated
Oct 7, 2019 - 84 commits
- Jupyter Notebook
An End-To-End Closed Domain Question Answering System.
-
Updated
Oct 25, 2019 - 236 commits
- Python
word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction, knowledge graph, text generation, network embedding
-
Updated
Oct 23, 2019 - 589 commits
- OpenEdge ABL
Burp Extender plugin that generates a sitemap of a website using Wayback Machine
-
Updated
Oct 21, 2019 - 30 commits
- Java
Trinity IR Infrastructure
-
Updated
Oct 1, 2019 - 256 commits
- C++
My (slightly modified) Keras implementation of RankNet and PyTorch implementation of LambdaRank.
-
Updated
Oct 10, 2019 - 26 commits
- Python
Search Formula-1——A distributed high performance massive data engine for enterprise/vertical search
-
Updated
Aug 13, 2019 - 6 commits
- C++
K-NRM: End-to-End Neural Ad-hoc Ranking with Kernel Pooling
-
Updated
Oct 14, 2019 - 53 commits
- Python
HDLTex: Hierarchical Deep Learning for Text Classification
-
Updated
Oct 22, 2019 - 244 commits
- Python
Resources for Tutorial on "Utilizing Knowledge Graphs in Text-centric Information Retrieval"
-
Updated
Oct 13, 2019 - 37 commits
- TeX
Word Embeddings for Information Retrieval
-
Updated
Oct 16, 2019 - 367 commits
- Python
The Cognitive Foundry is an open-source Java library for building intelligent systems using machine learning
-
Updated
Aug 13, 2019 - 372 commits
- Java



We currently support reading FastText models from Facebook's format. The gensim.models._fasttext_bin does this.
This enables people to use gensim with a model that was trained using Facebook's binaries.
Sometimes, people want things to work the other way: they start with gensim, train a model, and then