The Wayback Machine - https://web.archive.org/web/20220515084528/https://github.com/topics/data-mining
Skip to content
#

data-mining

Here are 3,880 public repositories matching this topic...

LightGBM
jameslamb
jameslamb commented Jan 27, 2021

Summary

mypy shows some issues in LightGBM's Python package.

mypy \
    --exclude='python-package/compile/|python-package/build' \
    --ignore-missing-imports \
    python-package/
18 errors in 4 files (click me)
python-package/lightgbm/compat.py:12: error: Name 'Series' already defined (possibly by an import)
python-package
gensim
mpenkov
mpenkov commented Jun 22, 2021

In gensim/models/fasttext.py:

    model = FastText(
        vector_size=m.dim,
        vector_size=m.dim,
        window=m.ws,
        window=m.ws,
        epochs=m.epoch,
        epochs=m.epoch,
        negative=m.neg,
        negative=m.neg,
        # FIXME: these next 2 lines read in unsupported FB FT modes (loss=3 softmax or loss=4 onevsall,
        # or model=3 supervi
bug difficulty easy good first issue fasttext
fingoldo
fingoldo commented Mar 24, 2022

Problem:

_catboost.pyx in _catboost._set_features_order_data_pd_data_frame()

_catboost.pyx in _catboost.get_cat_factor_bytes_representation()

CatBoostError: Invalid type for cat_feature[non-default value idx=1,feature_idx=336]=2.0 : cat_features must be integer or string, real number values and NaN values should be converted to string.

Could you also print a feature name, not o

sktime
fkiraly
fkiraly commented May 8, 2022

Since recently, the following deprecation warning started appearing in many places:

FutureWarning: The default dtype for empty Series will be 'object' instead of 'float64' in a future version. Specify a dtype explicitly to silence this warning.

Due to sensitivity regarding dtype, that would mean tests, utilities, or estimators start breaking where construction starts with an empt

good first issue maintenance

人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域

  • Updated Feb 6, 2020

Improve this page

Add a description, image, and links to the data-mining topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-mining topic, visit your repo's landing page and select "manage topics."

Learn more