The Wayback Machine - https://web.archive.org/web/20211014035147/https://github.com/topics/data-science
Skip to content
#

Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

Here are 22,114 public repositories matching this topic...

mburaksayici
mburaksayici commented Sep 30, 2021

Describe the issue linked to the documentation

Hi.
In https://scikit-learn.org/stable/auto_examples/model_selection/plot_roc.html , there's section on "Plot ROC curves for the multilabel problem". At the end, it plots roc curve as this image, https://scikit-learn.org/stable/_images/sphx_glr_plot_roc_002.png, containing table name = "...characterictics to multi-class" . So either name or the

superset
jinghua-qa
jinghua-qa commented Oct 6, 2021

Owner can not click and edit on the chart name when open a saved chart, can only do click edit when creating a new chart.

How to reproduce the bug

1, Explore saved chart own by you
2, Hover mouse over to chart's title
3, Click on chart title

Expected results

1,Should see tooltip "Click to edit"
2,Title is successfully edit and saved

Actual results

There is no opportunit

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated May 13, 2021
  • Python
pytorch-lightning
piseabhijeet
piseabhijeet commented Oct 6, 2021

🐛 Bug

Tensorboard logging metrics from different GPUs when using DataParallel training.

To Reproduce

Use any arbitrary toy model with DP as an accelerator and in training_step(), include the below line of code:

self.log(mode + '_loss', sum(losses), on_epoch=False, on_step=True, prog_bar=True, logger=True, sync_dist=True)

In the trainer, specify tensorboard logger with

dash
DWesl
DWesl commented Oct 7, 2021

Bug summary

Calling ax.contour(x, y, z) fails if ax was created with projection="3d" and x and y are both 1d and different lengths.

Code for reproduction

import numpy as np
import matplotlib.pyplot as plt

NX = 30
NY = 20
x = np.linspace(-10, 10, NX)
y = np.linspace(-10, 10, NY)
z = np.random.randint(0, 2, [NY, NX])

fig, ax = plt.subplots(subplot_k
danieldeutsch
danieldeutsch commented Jun 2, 2021

Is your feature request related to a problem? Please describe.
I typically used compressed datasets (e.g. gzipped) to save disk space. This works fine with AllenNLP during training because I can write my dataset reader to load the compressed data. However, the predict command opens the file and reads lines for the Predictor. This fails when it tries to load data from my compressed files.

nni