Details

Ensemble Classification Methods with Applications in R


Ensemble Classification Methods with Applications in R


1. Aufl.

von: Esteban Alfaro, Matías Gámez, Noelia García

101,99 €

Verlag: Wiley
Format: PDF
Veröffentl.: 15.08.2018
ISBN/EAN: 9781119421573
Sprache: englisch
Anzahl Seiten: 224

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

<p><b>An essential guide to two burgeoning topics in machine learning – classification trees and ensemble learning</b><i> </i></p> <p><i>Ensemble Classification Methods with Applications in R</i> introduces the concepts and principles of ensemble classifiers methods and includes a review of the most commonly used techniques. This important resource shows how ensemble classification has become an extension of the individual classifiers. The text puts the emphasis on two areas of machine learning: classification trees and ensemble learning. The authors explore ensemble classification methods’ basic characteristics and explain the types of problems that can emerge in its application.</p> <p>Written by a team of noted experts in the field, the text is divided into two main sections. The first section outlines the theoretical underpinnings of the topic and the second section is designed to include examples of practical applications. The book contains a wealth of illustrative cases of business failure prediction, zoology, ecology and others. This vital guide:</p> <ul> <li>Offers an important text that has been tested both in the classroom and at tutorials at conferences</li> <li>Contains authoritative information written by leading experts in the field</li> <li>Presents a comprehensive text that can be applied to courses in machine learning, data mining and artificial intelligence </li> <li>Combines in one volume two of the most intriguing topics in machine learning: ensemble learning and classification trees</li> </ul> <p>Written for researchers from many fields such as biostatistics, economics, environment, zoology, as well as students of data mining and machine learning, <i>Ensemble Classification Methods with Applications in R</i> puts the focus on two topics in machine learning: classification trees and ensemble learning.</p> <p> </p>
<p>List of Contributors ix</p> <p>List of Tables xi</p> <p>List of Figures xv</p> <p>Preface xvii</p> <p><b>1 Introduction 1<br /></b><i>Esteban Alfaro, Matías Gámez, and Noelia García</i></p> <p>1.1 Introduction 1</p> <p>1.2 Definition 1</p> <p>1.3 Taxonomy of Supervised Classification Methods 2</p> <p>1.4 Estimation of the Accuracy of a Classification System 3</p> <p>1.4.1 The Apparent Error Rate 4</p> <p>1.4.2 Estimation of the True Error Rate 4</p> <p>1.4.3 Error Rate Estimation Methods 4</p> <p>1.4.4 The Standard Error 6</p> <p>1.5 Classification Trees 7</p> <p>1.5.1 Classification Tree Building 8</p> <p>1.5.2 Splitting Rule 9</p> <p>1.5.3 Splitting Criteria 10</p> <p>1.5.4 Goodness of a Split 10</p> <p>1.5.5 The Impurity of a Tree 11</p> <p>1.5.6 Stopping Criteria 11</p> <p>1.5.7 Overfitting in Classification Trees 12</p> <p>1.5.8 Pruning Rules 14</p> <p><b>2 Limitation of the Individual Classifiers 19<br /></b><i>Esteban Alfaro, Matías Gámez, and Noelia García</i></p> <p>2.1 Introduction 19</p> <p>2.2 Error Decomposition: Bias and Variance 20</p> <p>2.3 Study of Classifier Instability 23</p> <p>2.4 Advantages of Ensemble Classifiers 26</p> <p>2.5 Bayesian Perspective of Ensemble Classifiers 28</p> <p><b>3 Ensemble Classifiers Methods 31<br /></b><i>Esteban Alfaro, Matías Gámez, and Noelia García</i></p> <p>3.1 Introduction 31</p> <p>3.2 Taxonomy of Ensemble Methods 32</p> <p>3.2.1 Non-Generative Methods 33</p> <p>3.2.2 Generative Methods 33</p> <p>3.3 Bagging 34</p> <p>3.4 Boosting 36</p> <p>3.4.1 AdaBoost Training Error 40</p> <p>3.4.2 AdaBoost and the MarginTheory 41</p> <p>3.4.3 Other Boosting Versions 43</p> <p>3.4.4 Comparing Bagging and Boosting 46</p> <p>3.5 Random Forests 46</p> <p><b>4 Classification with Individual and Ensemble Trees in R 51<br /></b><i>Esteban Alfaro, Matías Gámez, and Noelia García</i></p> <p>4.1 Introduction 51</p> <p>4.2 adabag: An R Package for Classification with Boosting and Bagging 52</p> <p>4.2.1 The bagging, predict.bagging, and bagging.cv Functions 56</p> <p>4.2.2 The boosting, predict.boosting, and boosting.cv Functions 65</p> <p>4.2.3 The margins, plot.margins, errorevol and plot.errorevol Functions 71</p> <p>4.2.4 The MarginOrderedPruning.Bagging Function 75</p> <p>4.3 The “German Credit” Example 79</p> <p>4.3.1 Classification Tree 81</p> <p>4.3.2 Combination using Bagging 85</p> <p>4.3.3 Combination using Boosting 88</p> <p>4.3.4 Combination using Random Forest 90</p> <p>4.3.5 Cross-Validation Comparison 95</p> <p><b>5 Bankruptcy Prediction Through Ensemble Trees 97<br /></b><i>Esteban Alfaro, Matías Gámez, and Noelia García</i></p> <p>5.1 Introduction 97</p> <p>5.2 Problem Description 97</p> <p>5.3 Applications 99</p> <p>5.3.1 The Dichotomous Case 99</p> <p>5.3.2 TheThree-Class Case 111</p> <p>5.4 Conclusions 117</p> <p><b>6 Experiments with Adabag in Biology Classification Tasks 119<br /></b><i>M. Fernández-Delgado, E. Cernadas, and M. Pérez-Ortiz</i></p> <p>6.1 Classification of Color Texture Feature Patterns Extracted From Cells in Histological Images of Fish Ovary 119</p> <p>6.2 Direct Kernel Perceptron: Ultra-Fast Kernel ELM-Based Classification with Non-Iterative Closed-Form Weight Calculation 122</p> <p>6.3 Do We Need Hundreds of Classifiers to Solve Real-World Classification Problems? 125</p> <p>6.4 On the use of Nominal and Ordinal Classifiers for the Discrimination of Stages of Development in Fish Oocytes 129</p> <p><b>7 Generalization Bounds for Ranking Algorithms 135<br /></b><i>W. Rejchel</i></p> <p>7.1 Introduction 135</p> <p>7.2 Assumptions, Main Theorem, and Application 136</p> <p>7.3 Experiments 138</p> <p>7.4 Conclusions 139</p> <p><b>8 Classification and Regression Trees for Analyzing Irrigation Decisions 141<br /></b><i>S. Andriyas andM.McKee</i></p> <p>8.1 Introduction 141</p> <p>8.2 Theory 143</p> <p>8.3 Case Study and Methods 144</p> <p>8.3.1 Study Site and Data Available 144</p> <p>8.3.2 Model, Specifications, and Performance Evaluation 146</p> <p>8.4 Results and Discussion 147</p> <p>8.5 Conclusions 153</p> <p><b>9 Boosted Rule Learner and its Properties 155<br /></b><i>M. Kubus</i></p> <p>9.1 Introduction 155</p> <p>9.2 Separate-and-Conquer 156</p> <p>9.3 Boosting in Rule Induction 157</p> <p>9.4 Experiments 158</p> <p>9.5 Conclusions 161</p> <p><b>10 Credit Scoring with Individuals and Ensemble Trees 163<br /></b><i>M. Chrzanowska, E. Alfaro, and D.Witkowska</i></p> <p>10.1 Introduction 163</p> <p>10.2 Measures of Accuracy 164</p> <p>10.3 Data Description 165</p> <p>10.4 Classification of Borrowers Applying Ensemble Trees 168</p> <p>10.5 Conclusions 173</p> <p><b>11 An Overview of Multiple Classifier Systems Based on Generalized Additive Models 175<br /></b><i>K.W. De Bock, K. Coussement, and D. Cielen</i></p> <p>11.1 Introduction 175</p> <p>11.2 Multiple Classifier Systems Based on GAMs 176</p> <p>11.2.1 Generalized AdditiveModels 176</p> <p>11.2.2 GAM-Based Multiple Classifier Systems 177</p> <p>11.2.3 GAMensPlus: Extending GAMens for Advanced Interpretability 179</p> <p>11.3 Experiments and Applications 180</p> <p>11.3.1 A Multi-Domain Benchmark Study of GAM-Based Ensemble Classifiers 180</p> <p>11.3.2 Benchmarking GAM-Based Ensemble Classifiers in Predictive Customer Analytics 181</p> <p>11.3.3 A Case Study of GAMensPlus used for Customer Churn Prediction in Financial Services 183</p> <p>11.4 Software Implementation in R: the GAMens Package 185</p> <p>11.5 Conclusions 185</p> <p>References 187</p> <p>Index 197</p>
<p><b>ESTEBAN ALFARO, MATÍAS GÁMEZ AND NOELIA GARCÍA</b> are<b></b> Associate Professors at the Applied Economics Department (Statistics), Faculty of Economics and Business of Albacete, and researchers at the Regional Development Institute (IDR), University of Castilla-La Mancha. Together they have published several papers in prestigious journals on topics such as applications of ensemble trees to corporate bankruptcy, credit scoring and statistical quality control with the most notable in <i>Journal of Statistical Software, Vol 54.</i>
<p><b>AN ESSENTIAL GUIDE TO TWO BURGEONING TOPICS IN MACHINE LEARNING – CLASSIFICATION TREES AND ENSEMBLE LEARNING</b> <p><i>Ensemble Classification Methods with Applications in R</i> introduces the concepts and principles of ensemble classifier methods and includes a review of the most commonly used techniques. This important resource shows how ensemble classification has become an extension of the individual classifiers. The text places emphasis on two areas of machine learning: classification trees and ensemble learning. The authors explore ensemble classification methods' basic characteristics and explain the types of problems that can emerge in its application. <p>Written by a team of noted experts in the field, the text is divided into two main sections. The first section outlines the theoretical underpinnings of the topic and the second section is designed to include examples of practical applications. The book contains a wealth of illustrative cases of business failure prediction, zoology, ecology and others. This vital guide: <ul> <li>Offers an important text that has been tested both in the classroom and at tutorials at conferences</li> <li>Contains authoritative information written by leading experts in the field</li> <li>Presents a comprehensive text that can be applied to courses in machine learning, data mining and artificial intelligence</li> <li>Combines in one volume, two of the most intriguing topics in machine learning: ensemble learning and classification trees</li> </ul> <p>Written for researchers from many fields such as biostatistics, economics, environment, zoology, as well as students of data mining and machine learning, <i>Ensemble Classification Methods with Applications in R</i> puts the focus on two topics in machine learning: classification trees and ensemble learning.

Diese Produkte könnten Sie auch interessieren:

Statistics for Microarrays
Statistics for Microarrays
von: Ernst Wit, John McClure
PDF ebook
90,99 €