The Wayback Machine - https://web.archive.org/web/20220806214600/https://github.com/topics/data-science
Skip to content
#

Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

Here are 28,516 public repositories matching this topic...

swight-prc
swight-prc commented Jul 28, 2022

Describe the bug

The acceptable values for max_samples are (0, 1]. One possible option for class_weight is balanced_subsample. However, for values of max_samples near zero and class_weight='balanced_subsample', the model fails with an unhelpful error related to having an empty array as an indexer.

This is only likely to come up in a grid search, as it seems unlikely that some

Bug good first issue Easy with clear instructions to resolve help wanted module:ensemble
superset
ktmud
ktmud commented Jul 26, 2022

Time-series Bar Chart v2 does not update total values for stacked bar chart when toggling legends.

How to reproduce the bug

  1. Create a "Time-series Bar Chart v2"
  2. Go to "Customize" and select "Show value", "Stack series" and "Only total"
  3. Toggle series in legends
  4. The total value should update but it doesn't

The legacy Time-series Bar Chart does not have this issue.

good first issue Good first issues for new contributors viz:charts:echarts Related to Echarts viz:charts:bar Related to the Bar chart #bug Bug report
msilva-pass
msilva-pass commented Jul 29, 2022

Summary: read_pickle from Pandas version > 1.25 is unable to load pickles created with pandas version 0.24 and Python 2.7. The documentation claims it is compatible with Pandas versions starting in 0.20, and it does work with pandas version until 1.25 and Python 3.9

The desired behaviour is that it should work the same it did until version 1.2.5, as is it still described in the documentation.

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated Apr 3, 2022
  • Python
asaini
asaini commented Oct 1, 2021

Problem

See #3856 . Developer would like the ability to configure whether the developer menu or viewer menu is displayed while they are developing on cloud IDEs like Gitpod or Github Codespaces

Solution

Create a config option

showDeveloperMenu: true | false | auto

where

  • true: always shows the developer menu locally and while deployed
  • false: always sho
enhancement New feature or request good first issue Good for newcomers
lightning
otaj
otaj commented Jun 29, 2022

🚀 Typing coverage

Let's improve typing coverage of PyTorch Lightning together!

I'm creating a new issue in order to increase visibility. There are three older issues (#7037, #5023, #4698) which became stale over time.

Plan

Currently, there are 55 files which are excluded from mypy checks so that our CI does not fail. These files vastly differ in difficulty in order to make the t

help wanted Open to be worked on good first issue Good for newcomers let's do it! approved to implement code quality
dash
tacaswell
tacaswell commented Aug 3, 2022

I spent some time yesterday and today trying to get tests to run on old versions of Matplotlib which....was harder than I wished it would be. We may want to have one very pinned requirements file (possible managed by one of the auto-bump services 😞 or tools) that we rename to fully-pinned-v3.6.x.txt on branching. We can then use that (exclusively) for tests on the backport branch which should

Difficulty: Medium Good first issue Open a pull request against these issues if there are no active ones!
Data-Science-For-Beginners
soubhikmandal2000
soubhikmandal2000 commented Oct 31, 2021
  • Base README.md
  • Quizzes
  • Introduction base README
    • Defining Data Science README
    • Defining Data Science assignment
    • Ethics README
    • Ethics assignment
    • Defining Data README
    • Defining Data assignment
    • Stats and Probability README
    • Stats and Probability assignment
  • Working with Data base README
    • Rel
good first issue Good for newcomers help wanted Extra attention is needed translations
gensim
mpenkov
mpenkov commented Jun 22, 2021

In gensim/models/fasttext.py:

    model = FastText(
        vector_size=m.dim,
        vector_size=m.dim,
        window=m.ws,
        window=m.ws,
        epochs=m.epoch,
        epochs=m.epoch,
        negative=m.neg,
        negative=m.neg,
        # FIXME: these next 2 lines read in unsupported FB FT modes (loss=3 softmax or loss=4 onevsall,
        # or model=3 supervi
bug Issue described a bug difficulty easy Easy issue: required small fix good first issue Issue for new contributors (not required gensim understanding + very simple) fasttext Issues related to the FastText model
nni