The Wayback Machine - https://web.archive.org/web/20200913140038/https://github.com/topics/feature-engineering
Skip to content
#

feature-engineering

Here are 726 public repositories matching this topic...

nni
CJStadler
CJStadler commented Jul 23, 2019

For example, if there is a relationship transaction.session_id -> sessions.id and we are calculating a feature transactions: sessions.SUM(transactions.value) any rows for which there is no corresponding session should be given the default value of 0 instead of NaN.

Of course this should not normally occur, but when it does it seems more reasonable to use the default_value.

`DirectF

mljar-supervised
pplonski
pplonski commented Sep 11, 2020

There can be a situation when all features are dropped during feature selection. Need to handle it. Maybe by throwing exception or raising a warning.

Code to reproduce:

import numpy as np
from supervised import AutoML

X = np.random.uniform(size=(1000, 31))
y = np.random.randint(0, 2, size=(1000,))

automl = AutoML(
    algorithms=["CatBoost", "Xgboost", "LightGBM"],
    model_t
RemixAutoML

Improve this page

Add a description, image, and links to the feature-engineering topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the feature-engineering topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.