-
Updated
Mar 31, 2022 - Jupyter Notebook
#
data-exploration
Here are 254 public repositories matching this topic...
Create HTML profiling reports from pandas DataFrame objects
python
data-science
machine-learning
statistics
deep-learning
jupyter
pandas-dataframe
exploratory-data-analysis
jupyter-notebook
eda
pandas
exploration
data-analysis
html-report
data-exploration
hacktoberfest
pandas-profiling
data-quality
data-profiling
big-data-analytics
Visualize and compare datasets, target values and associations, with one line of code.
python
data-science
machine-learning
statistics
pandas-dataframe
exploratory-data-analysis
eda
pandas
data-visualization
exploration
data-analysis
data-exploration
data-profiling
-
Updated
Oct 1, 2021 - Python
DataPrep — The easiest way to prepare data in Python
data-science
connector
exploratory-data-analysis
eda
apis
data-exploration
cleaning
dataprep
datacleaning
dataconnector
apiwrapper
webconnector
datapreparation
-
Updated
Apr 3, 2022 - Python
data-science
machine-learning
spark
bigdata
data-transformation
pyspark
data-extraction
data-analysis
data-wrangling
dask
data-exploration
data-preparation
data-cleaning
data-profiling
data-cleansing
big-data-cleaning
data-cleaner
cudf
dask-cudf
-
Updated
Apr 1, 2022 - Python
Data Explorer by Keen - point-and-click interface for analyzing and visualizing event data.
charts
dataviz
analytics
analysis
web-analytics
data-visualization
data-exploration
keen-io
analytics-api
native-analytics
-
Updated
Feb 14, 2022 - TypeScript
Feature exploration for supervised learning
-
Updated
Feb 4, 2021 - Jupyter Notebook
Automate Data Exploration and Treatment
-
Updated
Aug 12, 2021 - R
A library for managing, validating, summarizing, and visualizing data.
data-science
statistics
spark
plotly
pandas
data-visualization
dataops
data-analysis
matplotlib
dask
data-exploration
pandas-summary
dataframes
data-summary
data-quality-checks
data-quality
data-profiling
mlops
data-quality-monitoring
data-reporting
-
Updated
Feb 17, 2022 - Python
Automated data exploratory analysis and visualization tools.自动化数据探索分析和智能可视化设计应用.
visualization
machine-learning
eda
data-visualization
recommendation
data-exploration
datamining
automated-data-science
automated-insights
augmented-analytic
automated-visualization
automated-data-visualization
automated-data-analysis
-
Updated
Mar 30, 2022 - TypeScript
edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab
interactive
plotly
jupyter-notebook
eda
pandas
data-visualization
seaborn
data-analysis
matplotlib
data-exploration
pyhon
altair
project-jupyter
qgrid
edaviz
data-sciene
exploratory-data
-
Updated
Nov 20, 2019 - Python
kind: bug
Something isn't working
good first issue
Good for newcomers
scope: backend
Backend task
scope: frontend
Frontend task
zblz
commented
Aug 15, 2017
Currently all of the metrics computed are independent of a target variable or column, but if lens.summarise took the name of a column as the target variable, the output of some metrics could be more interpretable even if the target variable is not used in any kind of predictive modelling.
A good example of this could be PCA (see #14), which could plot the different categories of the target va
bioinformatics
gene-expression
data-visualization
awesome-list
data-exploration
atlas
rna-seq-data
gene-expression-profiles
expression-browser
humancellatlas
-
Updated
Oct 5, 2021
R package that makes basic data exploration radically simple (interactive data exploration, reproducible data science)
-
Updated
Jan 29, 2022 - R
Enjoy your transcriptomic data and analysis responsibly - like sipping a cocktail
gui
r
shiny
reproducible-research
gene-expression
data-visualization
bioconductor
transcriptome
user-friendly
data-exploration
transcriptomics
rna-seq-analysis
pathway-analysis
rna-seq-data
bioconductor-package
functional-enrichment-analysis
-
Updated
Mar 30, 2022 - R
Multidimensional data explorer and visualization tool.
-
Updated
May 23, 2017 - HTML
light and fast implementation of web pivot table / pivot chart components. 轻量快速的web数据透视表/透视图组件
visualization
excel
eda
pivot-tables
business-intelligence
olap
pivot-chart
cube
tableau
data-exploration
-
Updated
Mar 8, 2022 - JavaScript
A collection of Jupyter notebooks exploring different datasets.
python
python-tutorials
mapping
jupyter-notebook
tutorials
pandas
data-visualization
data-analysis
matplotlib
data-exploration
-
Updated
Jul 24, 2021 - Jupyter Notebook
Grep through all Grafana entities in the spirit of git-wtf.
search
search-in-text
grafana
toolbox
grafana-client
data-exploration
search-replace
search-and-replace
grafana-utils
grafana-toolbox
-
Updated
Mar 25, 2022 - Python
bamboolib - template for creating your own binder notebook
docker
data-science
data-transformation
data-visualization
data-visualisation
data-viz
data-exploration
binder-jupyter-notebook
-
Updated
Dec 14, 2021 - Jupyter Notebook
Light, personalized, interactive dashboards for urban data exploration.
-
Updated
Feb 11, 2022 - CSS
An User-Friendly Application for Exploratory Factor Analysis
ggplot2
r
shiny
data-visualization
exploratory-factor-analysis
data-management
data-exploration
factor-loadings
factor-extraction
-
Updated
Mar 22, 2019 - R
A collection of handy ML and data visualization and validation tools. Go ahead and train, evaluate and validate your ML models and data with minimal effort.
data-science
data
machine-learning
ai
data-validation
cross-validation
eda
data-visualization
datascience
classification
data-exploration
concept-drift
data-integrity
model-compression
model-comparison
evaluate-models
ml-models
-
Updated
Jan 4, 2022 - Jupyter Notebook
Understand the relationships between various features in relation with the sale price of a house using exploratory data analysis and statistical analysis. Applied ML algorithms such as Multiple Linear Regression, Ridge Regression and Lasso Regression in combination with cross validation. Performed parameter tuning, compared the test scores and suggested a best model to predict the final sale price of a house. Seaborn is used to plot graphs and scikit learn package is used for statistical analysis.
python
machine-learning
correlation
linear-regression
cross-validation
data-visualization
data-extraction
data-analysis
regularization
standardization
datawrangling
predictive-modeling
ridge-regression
data-exploration
k-fold
lasso-regression
encoding-library
parameter-tuning
root-mean-squared-error-metric
regression-analysis
-
Updated
Jan 19, 2018 - Jupyter Notebook
Applying Machine Learning Algorithms to the Kaggle "Titanic Survival Prediction Problem".
machine-learning
machine-learning-algorithms
artificial-intelligence
titanic-kaggle
data-exploration
titanic-survival-prediction
titanic-problem
-
Updated
Jun 16, 2017 - Python
A simple wrapper for SQL connections using SQLAlchemy and Pandas read_sql to standardize SQL workflow with multiple data sources.
data-science
sqlalchemy
sql
pandas
data-analytics
data-analysis
relational-databases
data-exploration
standardized-api
-
Updated
Jan 7, 2022 - Python
This case study is contained within the Serious SQL course by Danny Ma
-
Updated
Sep 14, 2021 - SQL
Slides, code and more for my class: Data Analytics and Machine Learning on Big Data
-
Updated
Dec 21, 2017 - Jupyter Notebook
Build a data catalog by running a single call with reading privileges
-
Updated
Mar 16, 2022 - Python
CodersMUSE is a prototype implementation to explore multi-modal data of program-comprehension experiments.
-
Updated
Jan 28, 2022 - Python
Improve this page
Add a description, image, and links to the data-exploration topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-exploration topic, visit your repo's landing page and select "manage topics."


Describe the bug
Currently, we have places in project where paged responses are returned with hardcoded hasNext and total properties. Need to fix this and return valid values to frontend. Frontend need to use these values instead of current solution