Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
Suggestion for Catboost Tutorial #1117
Comments
|
This is a great idea for contributions! |
|
Is this still relevant? |
|
Yes, tutorials are always welcome! We have tutorials classification problems, but I think we don't have one for regression. So please, contribute! |
|
How to upload Jupyter notebooks in this thread? |
|
You could make a tutorial with Boston out of this battle for regression. 'metric_period':200, This is a weird combination. This means you only calculate metrics on every 200 iteration, you probably wanted to print it out overy 200 iteration instead. To do that use verbose=200. Actually you can see that this is not something you shouled use, because there is a warining: 'loss_function':'RMSE', If loss_function is RMSE, eval_metric is RMSE by default, you don't need to set it.
I'm not sure why this combination is used, I would suggest to first train with default parameters, then look if there is overfitting or underfitting and adjust learning_rate accordingly. And for regression it might be useful to experiment with CTR settings, for example set TargetBorderCount to 2 or 3 instead of 1. |
|
OK noted with thanks. That's what the instructor passed me this file. |


Problem: Request for a Catboost Tutorial for Regression problems
catboost version: Any version
Operating System: WIndows
CPU: i7
GPU: None
Hi Yandex, I am currently learning how to use Catboost for ML projects. Would love to have a tutorial on Regression problems using real data set consists of mixture of categorical and numerical features.
Please do not use those generic datasets like Boston Housing et al. You can select one proper dataset from Kaggle et al as an example.
The tutorial must geared for first time users hence comments and guidance in Jupyter Notebook will be helpful.
Thanks much.