#
information-retrieval
Here are 1,150 public repositories matching this topic...
Apache Lucene and Solr open-source search software
-
Updated
Oct 4, 2020 - Java
Fetches system/theme information in terminal for Linux desktop screenshots.
-
Updated
Jul 21, 2020 - Shell
Open
Naming inconsistency
16
ogvalt
commented
Apr 25, 2020
Describe the bug
I found that some names agruments in framework aren't consistent.
So for example:
class SupervisedRunner(Runner):
"""Runner for experiments with supervised model."""
_experiment_fn: Callable = SupervisedExperiment
def __init__(
self,
model: Model = None,
device: Device = None,
input_key: Any = "features",
Learning to Rank in TensorFlow
-
Updated
Aug 29, 2020 - Python
Deep neural network to extract intelligent information from invoice documents.
information-retrieval
deep-neural-networks
deep-learning
invoices
keras
information-extraction
classification
invoice
billing
deeplearning
keras-neural-networks
invoice-pdf
invoice-management
keras-tensorflow
invoice-software
invoice-insight
invoice-parser
-
Updated
Aug 27, 2020 - Python
Python Keyphrase Extraction module
python
natural-language-processing
information-retrieval
keyword
computational-linguistics
keyword-extraction
keyphrase-extraction
keyphrase
-
Updated
Sep 24, 2020 - Python
A collection of research on knowledge graphs
natural-language-processing
information-retrieval
paper
survey
knowledge-graph
question-answering
representation-learning
cross-modal
knowledge-graph-completion
ner
dialogue-systems
reasoning
relation-extraction
commonsense
temporal-knowledge-graph
recommendation-systems
meta-relational-learning
-
Updated
Aug 6, 2020 - JavaScript
A curated list of papers dedicated to neural text (semantic) matching.
-
Updated
Sep 14, 2020 - HTML
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
machine-learning
natural-language-processing
information-retrieval
clustering
record-linkage
fuzzy-matching
deduplication
-
Updated
Sep 2, 2020 - JavaScript
Document-oriented search engine with HTTP API.
-
Updated
Jun 9, 2020 - C#
A Lucene toolkit for replicable information retrieval research
-
Updated
Sep 26, 2020 - Java
My Keras implementation of the Deep Semantic Similarity Model (DSSM)/Convolutional Latent Semantic Model (CLSM) described here: http://research.microsoft.com/pubs/226585/cikm2014_cdssm_final.pdf.
-
Updated
Jun 5, 2017 - Python
nlp
natural-language-processing
information-retrieval
deep-learning
transformers
pytorch
artificial-intelligence
question-answering
reading-comprehension
bert
-
Updated
Apr 30, 2020 - Python
PISA: Performant Indexes and Search for Academia
-
Updated
Oct 4, 2020 - C++
Tools and recipes to train deep learning models and build services for NLP tasks such as text classification, semantic search ranking and recall fetching, cross-lingual information retrieval, and question answering etc.
-
Updated
Dec 24, 2018 - Python
Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
natural-language-processing
information-retrieval
corpus
language-detection
embeddings
named-entity-recognition
normalizer
spell-check
persian-language
stemmer
dependency-parser
persian-nlp
part-of-speech-tagger
morphological-analysis
persian-stemmer
shallow-parser
-
Updated
Sep 29, 2020
RMDL: Random Multimodel Deep Learning for Classification
machine-learning
information-retrieval
text-mining
data-mining
deep-neural-networks
deep-learning
text-classification
tensorflow
keras
cnn
dnn
recurrent-neural-networks
classification
rnn
image-classification
ensemble-learning
convolutional-neural-networks
multimodel
-
Updated
Jul 1, 2020 - Python
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
search
search-engine
distributed-systems
information-retrieval
big-data
spark
solr
web-crawler
nutch
tika
sparkles
-
Updated
Oct 4, 2020 - Java
word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding
information-retrieval
text-classification
word2vec
text-generation
information-extraction
knowledge-graph
network-embedding
sequence-labeling
dialogue-systems
sentence2vec
machine-reading-comprehension
pretrained-language-model
-
Updated
Jun 1, 2020 - OpenEdge ABL
telegram group scraper tool. fetch all information about group members
linux
information-retrieval
telegram
python3
promotion
termux
information-gathering
smsbomber
termux-tool
telegram-scraper-bot
telegram-scraper
-
Updated
Aug 17, 2020 - Python
Scrape any website, article or RSS/Atom Feed with ease!
-
Updated
Jul 25, 2020 - Elixir
Extract subdomains from SSL certificates in HTTPS sites.
dns
ssl
information-retrieval
tool
https
certificates
discovery
subdomain
ssl-certificate
infosec
pentesting
pentest
pentest-scripts
pentest-tool
-
Updated
Aug 5, 2020 - Python
A math-aware search engine.
-
Updated
Jun 29, 2020 - C
BitMagic Library
c
c-plus-plus
information-retrieval
cmake
algorithm
avx
bit-manipulation
simd
integer-compression
sparse-vectors
sparse-matrix
bit-array
indexing-engine
bit-vector
adjacency-matrix
associative-array
sparse-vector
-
Updated
Oct 4, 2020 - C++
Train a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs
python
search-engine
information-retrieval
solr
synonyms
word2vec-model
semantic-search
conceptual-search
synonym-files
-
Updated
Apr 26, 2019 - Jupyter Notebook
Track any ip address with IP-Tracer. IP-Tracer is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracer.
linux
information-retrieval
ip-location
ip-geolocation
termux
hacking-tool
linux-tools
information-gathering
hacking-tools
termux-tool
termux-hacking
ip-tracer
gnuroot-debian
-
Updated
Aug 9, 2020 - PHP
Trinity IR Infrastructure
-
Updated
Nov 8, 2019 - C++
2
freakeinstein
commented
Jul 13, 2019
Current docker image size is insane. It is 2.55 GB. Reduce that to below 1GB or less. Apply changes from this reference: https://hackernoon.com/tips-to-reduce-docker-image-sizes-876095da3b34
Burp Extender plugin that generates a sitemap of a website using Wayback Machine
-
Updated
May 8, 2018 - Java
Improve this page
Add a description, image, and links to the information-retrieval topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the information-retrieval topic, visit your repo's landing page and select "manage topics."


Not a high-priority at all, but it'd be more sensible for such a tutorial/testing utility corpus to be implemented elsewhere - maybe under
/test/or some other data- or doc- related module – rather than ingensim.models.word2vec.Originally posted by @gojomo in RaRe-Technologies/gensim#2939 (comment)