The Wayback Machine - https://web.archive.org/web/20220725115443/https://github.com/topics/data-science
Skip to content
#

Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

Here are 28,225 public repositories matching this topic...

jeremiedbb
jeremiedbb commented May 25, 2022

PR #22722 introduced a common method for the validation of the parameters of an estimator. We now need to use it in all estimators.

Please open one PR per estimator or family of estimators (if one inherits from another). The title of the PR should mention which estimator it's dealing with and the description of the PR should begin with towards #23462.

Steps

  • The estimator must define
Easy good first issue Meta-issue Validation
superset
rumbin
rumbin commented Jan 31, 2022

The Mixed Time-Series chart type allows for configuring the title of the primary and the secondary y-axis.
However, while only the title of the primary axis is shown next to the axis, the title of the secondary one is placed at the upper end of the axis where it gets hidden by bar values and zoom controls.

How to reproduce the bug

  1. Create a mixed time-series chart
  2. Configure axi
good first issue #bug validation:validated preset:cares
MarcoGorelli
MarcoGorelli commented Jul 23, 2022

Noticed in pandas-dev/pandas#47828 (comment)

Currently, it's possible to match multiple warnings, by passing them as a tuple. But, this isn't documented, nor do the types suggest it:

Example:

import warnings
import pytest
import pandas._testing as tm

def me():
    warnings.warn('foo', FutureWarning)
    warnings.warn('bar', UserWarning
Testing good first issue Warnings

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated Apr 3, 2022
  • Python
karrdy89
karrdy89 commented Jul 21, 2022

Hello.
I am trying to integrate ray with an existing web server that made of fastapi and uvicorn.
It uses https, and here's how to run the server.

app = FastAPI()
app.add_middleware(HTTPSRedirectMiddleware)

uvicorn.run("main:app",
            host="0.0.0.0",
            port=8080,
            ssl_keyfile="/home/ky/cert/key.pem",
            ssl_certfile="/home/ky/cert/cert.pem",
 
good first issue enhancement P2 serve
asaini
asaini commented Oct 1, 2021

Problem

See #3856 . Developer would like the ability to configure whether the developer menu or viewer menu is displayed while they are developing on cloud IDEs like Gitpod or Github Codespaces

Solution

Create a config option

showDeveloperMenu: true | false | auto

where

  • true: always shows the developer menu locally and while deployed
  • false: always sho
enhancement good first issue
lightning
sarpaykent
sarpaykent commented Jul 7, 2022

🚀 Feature

Currently, LRSchedulerConfig type is not supported by _configure_schedulers_automatic_opt.

Motivation

I created a custom ReduceLROnPlateau class that adds warmup epochs before ReduceLROnPlateau. However, the ReduceLROnPlateau sched

feature good first issue lr scheduler pl
dash
QuLogic
QuLogic commented Jul 14, 2022

Bug summary

When using Axes.indicate_inset/Axes.indicate_inset_zoom, it may be necessary to increase the size of edges in order to be visible. It is possible to change edge colour with edgecolor=... which affects both the frame and the connectors. However, using linewidth=... only affects the frame.

Code for reproduction

import matplotlib.pyplot as plt

fig, a
Good first issue
Data-Science-For-Beginners
soubhikmandal2000
soubhikmandal2000 commented Oct 31, 2021
  • Base README.md
  • Quizzes
  • Introduction base README
    • Defining Data Science README
    • Defining Data Science assignment
    • Ethics README
    • Ethics assignment
    • Defining Data README
    • Defining Data assignment
    • Stats and Probability README
    • Stats and Probability assignment
  • Working with Data base README
    • Rel
good first issue help wanted translations
gensim
mpenkov
mpenkov commented Jun 22, 2021

In gensim/models/fasttext.py:

    model = FastText(
        vector_size=m.dim,
        vector_size=m.dim,
        window=m.ws,
        window=m.ws,
        epochs=m.epoch,
        epochs=m.epoch,
        negative=m.neg,
        negative=m.neg,
        # FIXME: these next 2 lines read in unsupported FB FT modes (loss=3 softmax or loss=4 onevsall,
        # or model=3 supervi
bug difficulty easy good first issue fasttext
nni