0 citations0 references

Resampling or Reweighting: A Comparison of Boosting Implementations

2008pp. 445–451

Citations Over TimeTop 20% of 2008 papers

Chris Seiffert, Taghi M. Khoshgoftaar, Jason Van Hulse, Amri Napolitano

Abstract

Boosting has been shown to improve the performance of classifiers in many situations, including when data is imbalanced. There are, however, two possible implementations of boosting, and it is unclear which should be used. Boosting by reweighting is typically used, but can only be applied to base learners which are designed to handle example weights. On the other hand, boosting by resampling can be applied to any base learner. In this work, we empirically evaluate the differences between these two boosting implementations using imbalanced training data. Using 10 boosting algorithms, 4 learners and 15 datasets, we find that boosting by resampling performs as well as, or significantly better than, boosting by reweighting (which is often the default boosting implementation). We therefore conclude that in general, boosting by resampling is preferred over boosting by weighting.

Citations Over TimeTop 20% of 2008 papers

Abstract

Related Papers