-
Updated
May 3, 2022 - Python
#
information-retrieval
Here are 1,658 public repositories matching this topic...
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
python
machine-learning
information-retrieval
data-mining
ocr
deep-learning
image-processing
cnn
pytorch
lstm
optical-character-recognition
crnn
scene-text
scene-text-recognition
easyocr
bug
Issue described a bug
difficulty easy
Easy issue: required small fix
good first issue
Issue for new contributors (not required gensim understanding + very simple)
fasttext
Issues related to the FastText model
ZanSara
commented
Mar 16, 2022
Problem
Currently FARMReader will ask users to raise max_seq_length every time some samples are longer than the value set to it. However, this can be confusing if max_seq_length is already set to the maximum value allowed by the model, because raising it further will cause hard-to-read CUDA errors.
See #2177.
Solution
We should find a way to query the model for the maximum va
type:feature
New feature or request
good first issue
Good for newcomers
topic:models
journey:intermediate
e.g. model training, api, evaluation...
Apache Lucene and Solr open-source search software
-
Updated
Apr 8, 2022
Fetches system/theme information in terminal for Linux desktop screenshots.
-
Updated
Feb 23, 2022 - Shell
Accelerated deep learning R&D
python
infrastructure
machine-learning
natural-language-processing
information-retrieval
research
reinforcement-learning
computer-vision
deep-learning
text-classification
distributed-computing
image-processing
pytorch
image-classification
metric-learning
recommender-system
object-detection
image-segmentation
reproducibility
text-segmentation
-
Updated
Apr 29, 2022 - Python
Learning to Rank in TensorFlow
-
Updated
Apr 26, 2022 - Python
Deep neural network to extract intelligent information from invoice documents.
information-retrieval
deep-neural-networks
deep-learning
invoices
keras
information-extraction
classification
invoice
billing
deeplearning
keras-neural-networks
invoice-pdf
invoice-management
keras-tensorflow
invoice-software
invoice-insight
invoice-parser
-
Updated
Jul 8, 2021 - Python
A collection of research on knowledge graphs
natural-language-processing
information-retrieval
paper
survey
knowledge-graph
question-answering
representation-learning
cross-modal
knowledge-graph-completion
ner
dialogue-systems
reasoning
relation-extraction
commonsense
temporal-knowledge-graph
recommendation-systems
meta-relational-learning
-
Updated
Mar 24, 2022 - JavaScript
Python Keyphrase Extraction module
python
natural-language-processing
information-retrieval
keyword
computational-linguistics
keyword-extraction
keyphrase-extraction
keyphrase
-
Updated
Apr 20, 2022 - Python
Track any ip address with IP-Tracer. IP-Tracer is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracer.
linux
information-retrieval
ip-location
ip-geolocation
termux
hacking-tool
linux-tools
information-gathering
hacking-tools
termux-tool
termux-hacking
ip-tracer
gnuroot-debian
-
Updated
Feb 12, 2022 - PHP
Apache Lucene open-source search software
-
Updated
May 5, 2022 - Java
Resources to learn more about Machine Learning and Artificial Intelligence
machine-learning
natural-language-processing
information-retrieval
reinforcement-learning
deep-learning
artificial-intelligence
knowledge-graph
question-answering
probabilistic-programming
bayesian-inference
recommender-systems
causal-inference
knowledge-representation
reasoning
-
Updated
Feb 8, 2022
telegram group scraper tool. fetch all information about group members
linux
information-retrieval
telegram
python3
promotion
termux
information-gathering
smsbomber
termux-tool
telegram-scraper-bot
telegram-scraper
-
Updated
Jun 20, 2021 - Python
Anserini is a Lucene toolkit for reproducible information retrieval research
-
Updated
May 2, 2022 - Java
A curated list of papers dedicated to neural text (semantic) matching.
-
Updated
Dec 6, 2020 - HTML
Information Gathering Instagram.
python
linux
instagram
information-retrieval
scraper
osint
python3
instagram-scraper
termux
information-gathering
termux-tool
-
Updated
May 4, 2022 - Python
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
machine-learning
natural-language-processing
information-retrieval
clustering
record-linkage
fuzzy-matching
deduplication
-
Updated
May 5, 2021 - JavaScript
PISA: Performant Indexes and Search for Academia
-
Updated
Apr 13, 2022 - C++
nlp
natural-language-processing
information-retrieval
deep-learning
transformers
pytorch
artificial-intelligence
question-answering
reading-comprehension
bert
-
Updated
Apr 30, 2020 - Python
Hardware-accelerated vector database and search engine. Available as a HTTP service or as an embedded library.
search
search-engine
machine-learning
information-retrieval
nlu
vector-space-model
vector-space
search-algorithms
resin
nlu-engine
-
Updated
Apr 12, 2022 - C#
Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
natural-language-processing
information-retrieval
corpus
language-detection
embeddings
named-entity-recognition
normalizer
spell-check
persian-language
stemmer
dependency-parser
persian-nlp
part-of-speech-tagger
morphological-analysis
persian-stemmer
shallow-parser
-
Updated
Apr 29, 2022
My Keras implementation of the Deep Semantic Similarity Model (DSSM)/Convolutional Latent Semantic Model (CLSM) described here: http://research.microsoft.com/pubs/226585/cikm2014_cdssm_final.pdf.
-
Updated
Jun 5, 2017 - Python
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
nlp
elasticsearch
benchmark
information-retrieval
deep-learning
retrieval
pytorch
dataset
bert
dpr
passage-retrieval
question-generation
sentence-transformers
sbert
zero-shot-retrieval
colbert
retrieval-models
ance
use-qa
-
Updated
Apr 16, 2022 - Python
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
-
Updated
May 5, 2022 - Python
allRank is a framework for training learning-to-rank neural models based on PyTorch.
python
machine-learning
information-retrieval
deep-learning
pytorch
transformer
ranking
learning-to-rank
ndcg
click-model
-
Updated
Aug 16, 2021 - Python
ACL'2021: Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too
-
Updated
Apr 27, 2022 - Python
Tools and recipes to train deep learning models and build services for NLP tasks such as text classification, semantic search ranking and recall fetching, cross-lingual information retrieval, and question answering etc.
-
Updated
Dec 24, 2018 - Python
Open
Tor support
2
edoardottt
commented
Nov 13, 2020
enhancement
New feature or request
help wanted
Extra attention is needed
good first issue
Good for newcomers
GopherZone
Apache Solr open-source search software
-
Updated
May 4, 2022 - Java
Improve this page
Add a description, image, and links to the information-retrieval topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the information-retrieval topic, visit your repo's landing page and select "manage topics."


In gensim/models/fasttext.py: