Making Logistic Regression a Core Data Mining Tool with TR-IRLS
2006Vol. 758, pp. 685–688
Citations Over TimeTop 10% of 2006 papers
Abstract
Binary classification is a core data mining task. For large datasets or real-time applications, desirable classifiers are accurate, fast, and need no parameter tuning. We present a simple implementation of logistic regression that meets these requirements. A combination of regularization, truncated Newton methods, and iteratively re-weighted least squares make it faster and more accurate than modern SVM implementations, and relatively insensitive to parameters. It is robust to linear dependencies and some scaling problems, making most data preprocessing unnecessary.
Related Papers
- → Road Accident Data Analysis: Data Preprocessing for Better Model Building(2019)17 cited
- → Influence of Data Preprocessing(2016)24 cited
- → Data preprocessing based on missing value and discretisation(2020)2 cited
- Research and Application on Spatial Data Preprocessing Techniques in Logistics Area(2010)
- → Data preprocessing based on missing value and discretisation(2020)