The Wayback Machine - https://web.archive.org/web/20220813003222/http://github.com/kavgan/
Skip to content
Avatar

Highlights

  • Pro
Block or Report

Block or report kavgan

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, …

    Jupyter Notebook 976 755

  2. Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English

    Python 115 43

  3. ROUGE-2.0 Public

    ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.

    Java 182 39

  4. OpinRank Public

    OpinRank Dataset. Dataset containing user reviews for entities namely cars and hotels. Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews)

    31 10

  5. word_cloud Public

    Python word cloud library for use within Jupyter notebook and Python apps.

    Jupyter Notebook 36 11

  6. This repo contains code and dataset for the Opinosis Summarization Framework

    49 18

4 contributions in the last year

Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Mon Wed Fri

Contribution activity

August 2022

kavgan has no activity yet for this period.