Stochastic Schemata Exploiter-Based Optimization of Hyper-parameters for XGBoost

Hiroya Makino; Eisuke Kita

doi:10.24423/cames.2024.1296

Authors

Hiroya Makino Graduate School of Informatics, Nagoya University, Nagoya, Japan
Eisuke Kita Graduate School of Informatics, Nagoya University, Nagoya, Japan http://orcid.org/0000-0002-7502-7529

Abstract

XGBoost is well-known as an open-source software library that provides a regularizing gradient boosting framework. Although it is widely used in the machine learning field, its performance depends on the determination of hyper-parameters. This study focuses on the optimization algorithm for hyper-parameters of XGBoost by using Stochastic Schemata Exploiter (SSE). SSE, which is one of Evolutionary Algorithms, is successfully applied to combinatorial optimization problems. SSE is applied for optimizing hyper-parameters of XGBoost in this study. The original SSE algorithm is modified for hyper-parameter optimization. When comparing SSE with a simple Genetic Algorithm, there are two interesting features: quick convergence and a small number of control parameters. The proposed algorithm is compared with other hyper-parameter optimization algorithms such as Gradient Boosted Regression Trees (GBRT), Tree-structured Parzen Estimator (TPE), Covariance Matrix Adaptation Evolution Strategy (CMA-ES), and Random Search in order to confirm its validity. The numerical results show that SSE has a good convergence property, even with fewer control parameters than other methods.

Keywords:

evolutionary computation, Stochastic Schemata Exploiter, hyper-parameter optimization, XGBoost

References

1. T. Chen, C. Guestrin, XGBoost: A scalable tree boosting system, [in:] Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794, 2016, https://doi.org/10.1145/2939672.2939785.

2. J. Bergstra, Y. Bengio, Random search for hyper-parameter optimization, Journal of Machine Learning Research, 13(10): 281–305, 2012.

3. R.G. Mantovani, A.L. Rossi, J. Vanschoren, B. Bischl, A. C. De Carvalho, Effectiveness of random search in SVM hyper-parameter tuning, [in:] Proceedings of 2015 International Joint Conference on Neural Networks, Killarney, Ireland, pp. 1–8, 2015, https://doi.org/10.1109/IJCNN.2015.7280664.

4. A.C. Florea, R. Andonie, Weighted random search for hyperparameter optimization, International Journal of Computers, Communications and Control, 14(2): 154–169, 2019.

5. Y. Xia, C. Liu, Y.Y. Li, N. Liu, A boosted decision tree approach using Bayesian hyperparameter optimization for credit scoring, Expert Systems with Applications, 78: 225–241, 2017, https://doi.org/10.1016/j.eswa.2017.02.017.

6. J. Snoek, H. Larochelle, R.P. Adams, Practical Bayesian optimization of machine learning algorithms, Advances in Neural Information Processing Systems, 25: 2960–2968, 2012.

7. M. Feurer, F. Hutter, Hyperparameter optimization, [in:] F. Hutter, L. Kotthoff, J. Vanschoren [Eds.], Automated Machine Learning: Methods, Systems, Challenges, pp. 3–33, Springer, Cham, 2019, https://doi.org/10.1007/978-3-030-05318-5_1.

8. N. Hansen, S.D. Müller, P. Koumoutsakos, Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES), Evolutionary Computation, 11(1): 1–18, 2003, https://doi.org/10.1162/106365603321828970.

9. F. Friedrichs, C. Igel, Evolutionary tuning of multiple SVM parameters, Neurocomputing, 64: 107–117, 2005, https://doi.org/10.1016/j.neucom.2004.11.022.

10. I. Loshchilov, F. Hutter, CMA-ES for hyperparameter optimization of deep neural networks, 2016, arXiv: 1604.07269v1.

11. A.N. Aizawa, Evolving SSE: A stochastic schemata exploiter, [in:] Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence, Orlando, FL, USA, Vol. 1, pp. 525–529, 1994, https://doi.org/10.1109/ICEC.1994.349895.

12. A.N. Aizawa, Evolving SSE: A new population-oriented search scheme based on schemata processing, Systems and Computers in Japan, 27(2): 41–52, 1996, https://doi.org/10.1002/scj.4690270204.

13. T. Maruyama, E. Kita, Extension of stochastic schemata exploiter to real-valued problem, The Special Interest Group MPS Technical Reports of Information Processing Society of Japan, 61: 17–20, 2006.

14. T. Maruyama, E. Kita, Investigation of real-valued stochastic schemata exploiter, Information Processing Society of Japan Transactions on Mathematical Modeling and its Applications, 48: 10–22, 2007.

15. L. Breiman, Arcing the Edge, Technical Report 486, Statistics Department, University of California, Berkeley, CA, 1997.

16. J.H. Friedman, Greedy function approximation: A gradient boosting machine, Annals of Statistics, 29(5): 1189–1232, 2001.

17. J. Bergstra, R. Bardenet, Y. Bengio, B. Kégl, Algorithms for hyper-parameter optimization, [in:] Proceedings of the 24th International Conference on Neural Information Processing Systems, Granada, Spain, pp. 2546–2554, 2011.

18. T. Head, M. Kumar, H. Nahrstaedt, G. Louppe, I. Shcherbatyi, scikitoptimize/scikitoptimize (v0.8.1), 2020, https://zenodo.org/records/4014775.

19. T. Akiba, S. Sano, T. Yanase, T. Ohta, M. Koyama, Optuna: A next-generation hyperparameter optimization framework, [in:] Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 2623–2631, 2019, https://doi.org/10.1145/3292500.3330701.

20. M. Feureret al., OpenML-Python: An extensible Python API for OpenML, The Journal of Machine Learning Research, 22(1): 4573–4577, 2019.

21. A.G. Koru, D. Zhang, H. Liu, Modeling the effect of size on defect proneness for opensource software, [in:] 29th International Conference on Software Engineering (ICSE’07 Companion), Minneapolis, MN, USA, pp. 115–124, 2007, https://doi.org/10.1109/ICSECOMPANION.2007.54.

22. J. Vanschoren, J.N. van Rijn, B. Bischl, L. Torgo, OpenML: Networked science in machine learning, SIGKDD Explorations, 15(2): 49–60, 2013, https://doi.org/10.1145/2641190.2641198.

23. W.J. Nash, T.L. Sellers, S.R. Talbot, A.J. Cawthorn, W.B. Ford, The population biology of abalone (Haliotis species) in Tasmania. I. Blacklip abalone (H. rubra) from the North Coast and Islands of Bass Strait, Technical Report, No. 48, Sea Fisheries Division, Department of Primary Industry and Fisheries, Tasmania, 1994.

24. P. Cortez, A. Cerdeira, F. Almeida, T. Matos, J. Reis, Modeling wine preferences by data mining from physicochemical properties, Decision Support Systems, 47(4): 547–553, 2009.

Online first
2025, Vol 32
	No 1	No 2
2024, Vol 31
	No 1	No 2	No 3	No 4
2023, Vol 30
	No 1	No 2	No 3	No 4
2022, Vol 29
	No 1-2		No 3	No 4
2021, Vol 28
	No 1	No 2	No 3	No 4
2020, Vol 27
	No 1	No 2-3		No 4
2019, Vol 26
	No 1	No 2	No 3-4
2018, Vol 25
	No 1	No 2-3		No 4
2017, Vol 24
	No 1	No 2	No 3	No 4
2016, Vol 23
	No 1	No 2-3		No 4
2015, Vol 22
	No 1	No 2	No 3	No 4
2014, Vol 21
	No 1	No 2	No 3-4
2013, Vol 20
	No 1	No 2	No 3	No 4
2012, Vol 19
	No 1	No 2	No 3	No 4
2011, Vol 18
	No 1-2		No 3	No 4
2010, Vol 17
	No 1	No 2/3/4
2009, Vol 16
	No 1	No 2	No 3-4
2008, Vol 15
	No 1	No 2	No 3-4
2007, Vol 14
	No 1	No 2	No 3	No 4
2006, Vol 13
	No 1	No 2	No 3	No 4
2005, Vol 12
	No 1	No 2-3		No 4
2004, Vol 11
	No 1	No 2-3		No 4
2003, Vol 10
	No 1	No 2	No 3	No 4
2002, Vol 9
	No 1	No 2	No 3	No 4
2001, Vol 8
	No 1	No 2-3		No 4
2000, Vol 7
	No 1	No 2	No 3	No 4
1999, Vol 6
	No 1	No 2	No 3-4
1998, Vol 5
	No 1	No 2	No 3	No 4
1997, Vol 4
	No 1	No 2	No 3-4
1996, Vol 3
	No 1	No 2	No 3	No 4
1995, Vol 2
	No 1	No 2	No 3	No 4
1994, Vol 1
	No 1-2		No 3-4

Stochastic Schemata Exploiter-Based Optimization of Hyper-parameters for XGBoost

Downloads

Authors

Abstract

Keywords:

References

Most read articles by the same author(s)

cover

ippt-pan

Issue

Pages

Section

DOI

Received

Accepted

Published

License

How to Cite

Principal Contact

Address

Support Contact