The Wayback Machine - https://web.archive.org/web/20220716070159/https://github.com/topics/data-exploration
Skip to content
#

data-exploration

Here are 255 public repositories matching this topic...

odd-platform

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

  • Updated Jul 14, 2022
  • C++
sound_classification_ml_production

In this repository you will find an end to end hands-on tutorial of an example of machine learning in production. The objective will be to create and deploy in the cloud a machine learning application able to recognize and classify different audio sounds.

  • Updated Feb 8, 2021
  • Jupyter Notebook

We will analyze a dataset provided by an e-commerce marketplace called [Olist](https://www.olist.com) to answer the CEO's question: Should Olist remove underperforming sellers from its marketplace? How to increase customer satisfaction (so as to increase profit margin) while maintaining a healthy order volume?

  • Updated Dec 7, 2021
  • HTML
vc

A Python repo illustrating principles to make your data exploration more efficient, and how to make your plots more readable and consistent. 14 scripts, each illustrating a particular point, and a companion presentation to expound on the points. Makes heavy use of the Streamlit module which is brilliant for making interactive plots, and the Bokeh module, which lets you do things like labeling of data via plot interactions.

  • Updated Jun 27, 2022
  • Python

Improve this page

Add a description, image, and links to the data-exploration topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-exploration topic, visit your repo's landing page and select "manage topics."

Learn more