Suggestion for Faster way to optimize vs GridSearch #69

mitrymoe · 2020-05-13T19:17:28Z

Hey,

First of all thank you for the hard work put into this, it's a really awesome project that I still learn how to use.
I'm not that proficient in using python so that is why I'll put my suggestion here. I have used python more for machine learning stuff and there the option of searching hyperparameters was also between grid and random search. However, I found that using bayesian optimization, code runs way faster because it add probabilities to the mix by building a surrogate probability model of the objective function. That being said, you can have a look maybe over this to find out more here

My guess is that this can be applied to your maximize_func in some way and computations should, in theory, run faster.
I'll spend the next few days trying and playing around with this and if I come up with something, I'll let you know.

Keep it up!

kernc · 2020-05-14T15:52:27Z

Thanks for the kind words. ❤️

bayesian optimization

Sounds useful. We should certainly look into using that package as a library:

https://github.com/fmfn/BayesianOptimization#1-specifying-the-function-to-be-optimized

In most cases probably even something as simple as random grid search should work much faster, while still giving sufficient ballpark results:

backtesting.py/backtesting/backtesting.py

Lines 816 to 823 in 7010d68

    
           heatmap = pd.Series(np.nan, 
        
                               index=pd.MultiIndex.from_tuples([p.values() for p in param_combos], 
        
                                                               names=next(iter(param_combos)).keys())) 
        
           # TODO: add parameter `max_tries:Union[int, float]=None` which switches 
        
           # exhaustive grid search to random search. This might need to avoid 
        
           # returning NaNs in stats on runs with no trades to differentiate those 
        
           # from non-tested parameter combos in heatmap.

ttfreeman · 2020-09-27T22:55:30Z

I would suggest using scikit-optimize? The random search is not guaranteed to converge faster to the optimum point because it does not learn from previous trial points.

kernc · 2020-10-01T19:27:00Z

We could use skopt as our constraints do match. 👍 The question is, will you implement it? Random search should be roughly a 5-line change.

ttfreeman · 2020-10-01T20:53:02Z

Sure, if nothing comes up I'll work on this in the next few days. I'll also look into skopt. So, we can just add a parameter method to optimize() method, and that way, different techniques of optimization can be chosen based on the input for method param.

kernc · 2020-10-01T22:05:26Z

Exactly. For random search I also envisioned an argument: max_tries: Union[int, float] = 200. Integer for absolute; float for relative to exhaustive search. This should then match n_calls= for skopt.

ttfreeman · 2020-10-04T21:21:04Z

I have implemented skopt's forest_minimize() method with encouraging results. Just need to validate the things and I also think it would be helpful to add the partial dependency plots. I am looking at how you've used bokeh and will add a function to plot this after the optimization. Hopefully will have something for your review next weekend.

kernc · 2020-10-08T00:48:27Z

it would be helpful to add the partial dependency plots

Think you can just return the raw scipy.optimize.OptimizeResult that skopt.plots.plot_objective() (and possibly other utils) already knows how to work with? No need to reinvent the wheel. Instead, we should document the workflow.

We should, however, look into adopting the lower-triangular subplot layout for the backtesting.lib.plot_heatmaps(). That makes it all much clearer!

ttfreeman · 2020-10-09T04:04:16Z

Yes, plotting the optimization results in a jupyter notebook is straightforward with plot_objective (and that's how I plotted the charts above for the Quick Start example). Though I am not sure if it would be as straightforward to integrate that with bokeh. The only thing that might be possible if we can export the plot_objective plots as PNG and bokeh can use them as image tag in their html file? It just won't be interactive like native bokeh charts.

I quickly looked at the heatmap plots from backtesting.lib.plot_heatmaps() and they are not as nice and clear with contour lines and test points etc.

kernc · 2020-10-09T14:03:06Z

Though I am not sure if it would be as straightforward to integrate that with bokeh.

I don't fully understand why we'd want it to integrate it with Bokeh. plot_objective() will output into notebooks just as well, and the returned matplotlib.Axes can be savefig'd for an image. Sure, the plot won't be interactive, but that's a miniature price to pay for not needing to write/maintain any extra code? What I'm suggesting, therefore, is that we just return the OptimizeResult and mark a note in the docs that the user might want to run skopt.plots.plot_objective() upon it. It's perfectly ok to mix best of breed utilities from the ecosystem. 😁

I quickly looked at the heatmap plots from backtesting.lib.plot_heatmaps() and they are not as nice and clear with contour lines and test points etc.

They are not. It's just an unrelated note-to-self proposal that we lay them out sorted in a more immediately clear lower-triangular instead of the simple grid layout.

guzuomuse · 2020-11-02T13:19:24Z

this should be on the top priority! big win if we have this feature!

kernc added enhancement good first issue labels Jul 15, 2020

kernc added the Hacktoberfest label Oct 1, 2020

kernc assigned ttfreeman Oct 5, 2020

Oct	NOV	Dec
	03
2019	2020	2021

kernc / backtesting.py

Suggestion for Faster way to optimize vs GridSearch #69

Suggestion for Faster way to optimize vs GridSearch #69

mitrymoe commented May 13, 2020

kernc commented May 14, 2020

ttfreeman commented Sep 27, 2020

kernc commented Oct 1, 2020 •

edited

ttfreeman commented Oct 1, 2020

kernc commented Oct 1, 2020 •

edited

ttfreeman commented Oct 4, 2020 •

edited

kernc commented Oct 8, 2020 •

edited

ttfreeman commented Oct 9, 2020 •

edited

kernc commented Oct 9, 2020 •

edited

guzuomuse commented Nov 2, 2020

kernc / backtesting.py

Sponsor kernc/backtesting.py

Join GitHub today

Suggestion for Faster way to optimize vs GridSearch #69

Suggestion for Faster way to optimize vs GridSearch #69

Comments

mitrymoe commented May 13, 2020

kernc commented May 14, 2020

ttfreeman commented Sep 27, 2020

kernc commented Oct 1, 2020 • edited

ttfreeman commented Oct 1, 2020

kernc commented Oct 1, 2020 • edited

ttfreeman commented Oct 4, 2020 • edited

kernc commented Oct 8, 2020 • edited

ttfreeman commented Oct 9, 2020 • edited

kernc commented Oct 9, 2020 • edited

guzuomuse commented Nov 2, 2020

Essential cookies

Always active

Analytics cookies

kernc commented Oct 1, 2020 •

edited

kernc commented Oct 1, 2020 •

edited

ttfreeman commented Oct 4, 2020 •

edited

kernc commented Oct 8, 2020 •

edited

ttfreeman commented Oct 9, 2020 •

edited

kernc commented Oct 9, 2020 •

edited