Build software better, together

nltk / nltk

Open

In CI, refresh `nltk_data` cache if the hash of `index.xml` differs from the cached hash

4

tomaarsen commented Dec 16, 2021

Rather than simply caching nltk_data until the cache expires and it's forced to re-download the entire nltk_data, we should perform a check on the index.xml which refreshes the cache if it differs from some previous cache.

I would advise doing this in the same way that it's done for requirements.txt:
https://github.com/nltk/nltk/blob/59aa3fb88c04d6151f2409b31dcfe0f332b0c9ca/.github/wor

Manually wrap output lines in HOWTO files

Open

Expand ~ in paths

1

Find more good first issues

sloria / TextBlob

Star

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

python nlp natural-language-processing pattern nltk python-3 python-2

Updated Dec 9, 2021
Python

dipanjanS / practical-machine-learning-with-python

Star

Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.

python machine-learning natural-language-processing computer-vision deep-learning jupyter notebook clustering tensorflow scikit-learn keras jupyter-notebook pandas spacy nltk classification convolutional-neural-networks prophet statsmodels time-series-analysis

Updated Oct 1, 2020
Jupyter Notebook

alfredfrancis / ai-chatbot-framework

Star

A python chatbot framework with Natural Language Understanding and Artificial Intelligence.

python ai natural-language sklearn chatbot conversation nltk chatbots ai-chatbot-framework

Updated Mar 22, 2022
Python

shirosaidev / stocksight

Star

Stock market analyzer and predictor using Elasticsearch, Twitter, News headlines and Python natural language processing and sentiment analysis

python elasticsearch natural-language-processing twitter sentiment-analysis sentiment twitter-streaming-api stock-market nltk stock-price-prediction tweepy twitter-sentiment-analysis vader-sentiment-analysis stock-prediction textblob stock-analysis stock-analyzer

Updated Feb 26, 2021
Python

dipanjanS / text-analytics-with-python

Star

Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.

python semantic natural-language-processing sentiment-analysis text-classification clustering pattern natural-language scikit-learn sentiment spacy nltk text-summarization gensim stanford-nlp text-analytics

Updated Dec 24, 2020
Jupyter Notebook

csurfer / rake-nltk

Star

Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.

python text-mining algorithm nltk keyword-extraction

Updated Sep 18, 2021
Python

nltk / nltk_data

Star

NLTK Data

nlp natural-language-processing linguistics nltk corpora

Updated Mar 17, 2022
Python

alexgreene / WikiQuiz

Star

Generates a quiz for a Wikipedia page using parts of speech and text chunking.

wikipedia nltk

Updated Jul 15, 2020
JavaScript

cltk / cltk

Star

The Classical Language Toolkit

python nlp ai latin greek spacy nltk stanza ling historical-linguistics

Updated Feb 11, 2022
Python

csurfer / gitsuggest

Star

A tool to suggest github repositories based on the repositories you have shown interest in.

github repository nltk lda-model suggestion-engine

Updated Jun 1, 2021
Python

thalesgroup-cert / Watcher

Star

Watcher - Open Source Cybersecurity Threat Hunting Platform. Developed with Django & React JS.

Updated Feb 17, 2022
Python

hb20007 / hands-on-nltk-tutorial

Star

The hands-on NLTK tutorial for NLP in Python

nlp tutorial didactic binder jupyter notebook jupyter-notebook tutorials nltk notebooks jupyter-notebooks nlp-resources nlp-machine-learning nltk-library nltk3 notebook-jupyter

Updated Jan 17, 2019
Jupyter Notebook

ggeop / Python-ai-assistant

Star

Python AI assistant 🧠

python nlp ai mongodb sklearn pymongo voice-commands voice-recognition nltk voice-chat voice-control python35 nlp-machine-learning wolfram-language voice-assistant google-speech-recognition voice-activity-detection voice-recognition-experiment google-speech-to-text linux-assistant

Updated Mar 17, 2022
Python

vardanagarwal / Proctoring-AI

Star

Creating a software for automatic monitoring in online proctoring

opencv automation proctoring nltk eye-tracking ssd face-detection speech-to-text dlib hacktoberfest mobilenet vision-and-language tflite yolov3 face-spoofing proctoring-ai phone-detection

Updated Jun 21, 2021
Python

javedsha / text-classification

Star

Machine Learning and NLP: Text Classification using python, scikit-learn and NLTK

python nlp text-classification scikit-learn nltk machinelearning

Updated Jul 10, 2019
Jupyter Notebook

chen0040 / keras-english-resume-parser-and-analyzer

Star

keras project that parses and analyze english resumes

deep-learning keras recurrent-neural-networks nltk convolutional-neural-networks resume-parser

Updated Apr 22, 2020
Python

NLP-kr / tensorflow-ml-nlp

Star

텐서플로우와 머신러닝으로 시작하는 자연어처리(로지스틱회귀부터 트랜스포머 챗봇까지)

Updated Sep 14, 2020
Jupyter Notebook

gionanide / Speech_Signal_Processing_and_Classification

Star

Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].

nlp classifier natural-language-processing feature-extraction nltk gaussian-mixture-models support-vector-machines mfcc principal-component-analysis speech-processing linear-discriminant-analysis isomap spectral-clustering long-short-term-memory kernel-pca spectral-embedding locally-linear-embedding linear-prediction-coefficients speech-utterance

Updated Dec 6, 2021
Python

fendouai / Awesome-Text-Classification

Star

Awesome-Text-Classification Projects,Papers,Tutorial .

nlp text-mining awesome text-classification tensorflow text-analysis nltk classification

Updated Nov 24, 2017

dipanjanS / nlp_workshop_odsc_europe20

Star

Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and Topic Models.

python machine-learning natural-language-processing deep-learning tensorflow scikit-learn jupyter-notebook transformers pytorch spacy nltk gensim transfer-learning

Updated Sep 18, 2020
Jupyter Notebook

cristianzsh / youtube-video-maker

Star

📹 A tool for automatic video creation and uploading on YouTube

automation youtube video ffmpeg watson wikipedia video-processing robots nltk google-api ibm-watson video-maker

Updated Dec 20, 2020
Python

anujvyas / Natural-Language-Processing-Projects

Star

This repository consists of all my NLP Projects

python nlp natural-language-processing sentiment-analysis text-classification wordcloud nltk stemming lemmatization

Updated Nov 10, 2020
Jupyter Notebook

biolab / orange3-text

Sponsor

Star

Open

Corpus: save data set as relative, check if in same folder

ajdapretnar commented May 25, 2020

Text version

0.9.1

Orange version

3.26.0.dev

Expected behavior

When sharing workflows with o

gyanesh-m / Sentiment-analysis-of-financial-news-data

Star

Sentiment Analysis of news on stock prices

scraper sentiment-analysis nltk news-data

Updated Mar 2, 2022
Python

TiesdeKok / Python_NLP_Tutorial

Star

This repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)

python nlp natural-language-processing text-mining research spacy nltk computational-linguistics textblob textual-analysis

Updated Jun 5, 2020
Jupyter Notebook

lfcipriani / punkt-segmenter

Star

Ruby port of the NLTK Punkt sentence segmentation algorithm

ruby nltk ruby-port nlp-library sentence-tokenizer rubynlp sentence-boundaries tokenized-sentences punkt-segmenter

Updated Jun 10, 2018
Ruby

saidziani / Arabic-News-Article-Classification

Star

Automatic categorization of documents, consists in assigning a category to a text based on the information it contains. We'll follow different approach of Supervised Machine Learning.

nlp machine-learning python3 nltk corpora arabic-nlp arabic-language text-categorization

Updated Jan 1, 2019
Python

prabhakar267 / vertikin

Star

👓 Platform to automatically detect what user might be interested in buying in near future

android python flask natural-language-processing nltk walmart buying python-flask buying-trends

Updated Jun 2, 2021
Python

arop / ner-re-pt

Star

Named entity extraction from Portuguese web text

nlp spacy named-entity-recognition nltk stanford-corenlp opennlp

Updated Aug 16, 2017
Python

Feb	MAR	Apr
	22
2021	2022	2023

nltk

Here are 2,311 public repositories matching this topic...

nltk / nltk

In CI, refresh `nltk_data` cache if the hash of `index.xml` differs from the cached hash

Manually wrap output lines in HOWTO files

Expand ~ in paths

sloria / TextBlob

dipanjanS / practical-machine-learning-with-python

alfredfrancis / ai-chatbot-framework

shirosaidev / stocksight

dipanjanS / text-analytics-with-python

csurfer / rake-nltk

nltk / nltk_data

alexgreene / WikiQuiz

cltk / cltk

csurfer / gitsuggest

thalesgroup-cert / Watcher

hb20007 / hands-on-nltk-tutorial

ggeop / Python-ai-assistant

vardanagarwal / Proctoring-AI

javedsha / text-classification

chen0040 / keras-english-resume-parser-and-analyzer

NLP-kr / tensorflow-ml-nlp

gionanide / Speech_Signal_Processing_and_Classification

fendouai / Awesome-Text-Classification

dipanjanS / nlp_workshop_odsc_europe20

cristianzsh / youtube-video-maker

anujvyas / Natural-Language-Processing-Projects

biolab / orange3-text

Corpus: save data set as relative, check if in same folder

Text version

Orange version

Expected behavior

gyanesh-m / Sentiment-analysis-of-financial-news-data

TiesdeKok / Python_NLP_Tutorial

lfcipriani / punkt-segmenter

saidziani / Arabic-News-Article-Classification

prabhakar267 / vertikin

arop / ner-re-pt

Improve this page

Add this topic to your repo