Entropy-based regularization of AdaBoost

Michał Bereta

doi:10.24423/cames.206

Authors

Michał Bereta Institute of Computer Science, Cracow University of Technology, Kraków, Poland

Abstract

In this study, we introduce an entropy-based method to regularize the AdaBoost algorithm. The AdaBoost algorithm is a well-known algorithm used to create aggregated classifiers. In many real-world classification problems in addition to paying special attention classification accuracy of the final classifier, great focus is placed on tuning the number of the so-called weak learners, which are aggregated by the final (strong) classifier. The proposed method is able to improve the AdaBoost algorithm in terms of both criteria. While many approaches to the regularization of boosting algorithms can be complicated, the proposed method is straightforward and easy to implement. We compare the results of the proposed method (EntropyAdaBoost) with the original AdaBoost and also with its regularized version, ǫ-AdaBoost on several classification problems. It is shown that the proposed methods of EntropyAdaBoost and ǫ-AdaBoost are strongly complementary when the improvement of AdaBoost is considered.

Keywords:

AdaBoost, regularization, entropy, EntropyAdaBoost

References

[1] T. Hastie, R. Tibshirani, J. Friedman. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd edition, Springer Series in Statistics, 2009. http://www.springer.com/gp/book/9780387848570.

[2] R. Meir, G. Rätsch. An introduction to boosting and leveraging. In: S. Mendelson, A.J. Smola [Eds.], Advanced Lectures on Machine Learning, Summer School 2002, Canberra, Australia, February 11–22, 2002, Revised Lectures, 118–183, Springer-Verlag New York, Inc., NY, USA, 2003. https://link.springer.com/chapter/10.1007%2F3-540-36434-X. 4.

[3] R.E. Schapire. Theoretical views of boosting and applications. In: O.Watanabe, T. Yokomori [Eds.], Algorithmic Learning Theory: 10th International Conference, ALT99 Tokyo, Japan, December 6–8, 1999 Proceedings, pp. 13–25, Springer, Berlin/Heidelberg, 1999. https://link.springer.com/chapter/10.1007/3-540-46769-6. 2.

[4] P. Viola, M.J. Jones. Robust real-time face detection. International Journal of Computer Vision, 57(2): 137–154, 2004. https://link.springer.com/article/10.1023/B:VISI.0000013087.49260.fb.

[5] W. Jiang. Is regularization unnecessary for boosting? In: Proceedings of the Eighth International Workshop on Artificial Intelligence and Statistics (AISTATS), 2001. http://citeseerx.ist.psu.edu/viewdoc/summary.?doi=10.1.1.32.5229.

[6] Y. Xi, Z. Xiang, P. Ramadge, R. Schapire. Speed and sparsity of regularized boosting. In: D. van Dyk, M.Welling [Eds.], Proceedings of the Twelth International Conference on Artificial Intelligence and Statistics, Clearwater Beach, Florida, USA, Vol. 5 of JMLR:W&CP 5, pp. 615–622, 2009. http://proceedings.mlr.press/v5/xi09a.html.

[7] P. Bühlmann, T. Hothorn. Boosting algorithms: regularization, prediction and model fitting, Statistical Science, 22: 477–505, 2007. https://www.jstor.org/stable/27645854.

[8] C. Shen, H. Li, A. van den Hengel. Fully corrective boosting with arbitrary loss and regularization. Neural Networks, 48: 44–58, 2013. http://www.sciencedirect.com/science/article/pii/S0893608013001913.

[9] M.K. Warmuth, J. Liao, G. Rätsch. Totally corrective boosting algorithms that maximize the margin. In: Proceedings of the 23rd International Conference on Machine Learning, ACM, New York, NY, USA, pp. 1001–1008, 2006. https://users.soe.ucsc.edu/~manfred/pubs/C75.pdf.

[10] D.D. Le, S. Satoh. Ent-Boost: boosting using entropy measures for robust object detection. Pattern Recognition Letters, 28: 1083–1090, 2007. http://www.sciencedirect.com/science/article/pii/S0167865507000190.

[11] R.E. Schapire. A brief introduction to boosting. In: Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI), 2: 1401–1406, 1999. https://www.cs.utah.edu/~piyush/teaching/brief. intro boosting.pdf.

[12] R.E. Schapire, Y. Freund. Boosting: Foundations and Algorithms. The MIT Press, 2012. https://mitpress.mit.edu/books/boosting.

[13] S. Rosset, J. Zhu, T. Hastie. Boosting as a regularized path to a maximum margin classifier. Journal of Machine Learning Research, 5: 941–973, 2004. http://www.jmlr.org/papers/volume5/rosset04a/rosset04a.pdf.

[14] M. Lichman. UCI Machine Learning Repository. Irvine, CA: University of California, School of Information and Computer Science, 2013. http://archive.ics.uci.edu/ml.

[15] J. Demšar. Statistical comparisons of classifiers over multiple data sets. Journal of Machine Learning Research, 7: 1–30, 2006. http://www.jmlr.org/papers/v7/demsar06a.html.

[16] J. Derrac, S. García, D. Molina, F. Herrera. A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms. Swarm and Evolutionary Computation, 1: 3–18, 2011. http://www.sciencedirect.com/science/article/pii/S2210650211000034.

[17] J. Alcalá-Fdez, L. Sánchez, S. García, M.J. del Jesus, S. Ventura, J.M. Garrell, J. Otero, C. Romero, J. Bacardit, V.M. Rivas, J.C. Fernández, F. Herrera. KEEL: a software tool to assess evolutionary algorithms for data mining problems. Soft Computing, 13: 307–318, 2008. https://link.springer.com/article/10.1007/s00500-008-0323-y.

[18] J. Alcalá-Fdez, A. Fernández, J. Luengo, J. Derrac, S. García, L. Sánchez, F. Herrera. KEEL data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. Journal of Multiple-Valued Logic and Soft Computing, 17: 255–287, 2011. http://sci2s.ugr.es/keel/pdf/keel/articulo/2011-KEEL-dataset-MVLSC.pdf.

Online first
2025, Vol 32
	No 1	No 2
2024, Vol 31
	No 1	No 2	No 3	No 4
2023, Vol 30
	No 1	No 2	No 3	No 4
2022, Vol 29
	No 1-2		No 3	No 4
2021, Vol 28
	No 1	No 2	No 3	No 4
2020, Vol 27
	No 1	No 2-3		No 4
2019, Vol 26
	No 1	No 2	No 3-4
2018, Vol 25
	No 1	No 2-3		No 4
2017, Vol 24
	No 1	No 2	No 3	No 4
2016, Vol 23
	No 1	No 2-3		No 4
2015, Vol 22
	No 1	No 2	No 3	No 4
2014, Vol 21
	No 1	No 2	No 3-4
2013, Vol 20
	No 1	No 2	No 3	No 4
2012, Vol 19
	No 1	No 2	No 3	No 4
2011, Vol 18
	No 1-2		No 3	No 4
2010, Vol 17
	No 1	No 2/3/4
2009, Vol 16
	No 1	No 2	No 3-4
2008, Vol 15
	No 1	No 2	No 3-4
2007, Vol 14
	No 1	No 2	No 3	No 4
2006, Vol 13
	No 1	No 2	No 3	No 4
2005, Vol 12
	No 1	No 2-3		No 4
2004, Vol 11
	No 1	No 2-3		No 4
2003, Vol 10
	No 1	No 2	No 3	No 4
2002, Vol 9
	No 1	No 2	No 3	No 4
2001, Vol 8
	No 1	No 2-3		No 4
2000, Vol 7
	No 1	No 2	No 3	No 4
1999, Vol 6
	No 1	No 2	No 3-4
1998, Vol 5
	No 1	No 2	No 3	No 4
1997, Vol 4
	No 1	No 2	No 3-4
1996, Vol 3
	No 1	No 2	No 3	No 4
1995, Vol 2
	No 1	No 2	No 3	No 4
1994, Vol 1
	No 1-2		No 3-4

Entropy-based regularization of AdaBoost

Downloads

Authors

Abstract

Keywords:

References

cover

ippt-pan

Issue

Pages

Section

DOI

License

How to Cite

Principal Contact

Address

Support Contact