Details

Data Analysis and Applications 1


Data Analysis and Applications 1

Clustering and Regression, Modeling-estimating, Forecasting and Data Mining
1. Aufl.

von: Christos H. Skiadas, James R. Bozeman

139,99 €

Verlag: Wiley
Format: PDF
Veröffentl.: 04.03.2019
ISBN/EAN: 9781119597575
Sprache: englisch
Anzahl Seiten: 288

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

This series of books collects a diverse array of work that provides the reader with theoretical and applied information on data analysis methods, models, and techniques, along with appropriate applications.<br /><br />Volume 1 begins with an introductory chapter by Gilbert Saporta, a leading expert in the field, who summarizes the developments in data analysis over the last 50 years. The book is then divided into three parts: Part 1 presents clustering and regression cases; Part 2 examines grouping and decomposition, GARCH and threshold models, structural equations, and SME modeling; and Part 3 presents symbolic data analysis, time series and multiple choice models, modeling in demography, and data mining.
<p>Preface xi</p> <p>Introduction xv<br /><i>Gilbert SAPORTA</i></p> <p><b>Part 1 Clustering and Regression </b><b>1</b></p> <p><b>Chapter 1 Cluster Validation by Measurement of Clustering Characteristics Relevant to the User </b><b>3<br /></b><i>Christian HENNIG</i></p> <p>1.1 Introduction 3</p> <p>1.2 General notation 5</p> <p>1.3 Aspects of cluster validity 6</p> <p>1.3.1 Small within-cluster dissimilarities 6</p> <p>1.3.2 Between-cluster separation 7</p> <p>1.3.3 Representation of objects by centroids 7</p> <p>1.3.4 Representation of dissimilarity structure by clustering 8</p> <p>1.3.5 Small within-cluster gaps 9</p> <p>1.3.6 Density modes and valleys 9</p> <p>1.3.7 Uniform within-cluster density 12</p> <p>1.3.8 Entropy 12</p> <p>1.3.9 Parsimony 13</p> <p>1.3.10 Similarity to homogeneous distributional shapes 13</p> <p>1.3.11 Stability 13</p> <p>1.3.12 Further Aspects 14</p> <p>1.4 Aggregation of indexes 14</p> <p>1.5 Random clusterings for calibrating indexes 15</p> <p>1.5.1 Stupid K-centroids clustering 16</p> <p>1.5.2 Stupid nearest neighbors clustering 16</p> <p>1.5.3 Calibration 17</p> <p>1.6 Examples 18</p> <p>1.6.1 Artificial data set 18</p> <p>1.6.2 Tetragonula bees data 20</p> <p>1.7 Conclusion 22</p> <p>1.8 Acknowledgment 23</p> <p>1.9 References 23</p> <p><b>Chapter 2 Histogram-Based Clustering of Sensor Network Data </b><b>25<br /></b><i>Antonio BALZANELLA </i>and<i> Rosanna VERDE</i></p> <p>2.1 Introduction 25</p> <p>2.2 Time series data stream clustering 28</p> <p>2.2.1 Local clustering of histogram data 30</p> <p>2.2.2 Online proximity matrix updating 32</p> <p>2.2.3 Off-line partitioning through the dynamic clustering algorithm for dissimilarity tables 33</p> <p>2.3 Results on real data 34</p> <p>2.4 Conclusions 36</p> <p>2.5 References 36</p> <p><b>Chapter 3 The Flexible Beta Regression Model </b><b>39<br /></b><i>Sonia MIGLIORATI, Agnese MDI BRISCO </i>and<i> Andrea ONGARO</i></p> <p>3.1 Introduction 39</p> <p>3.2 The FB distribution 41</p> <p>3.2.1 The beta distribution 41</p> <p>3.2.2 The FB distribution 41</p> <p>3.2.3 Reparameterization of the FB 42</p> <p>3.3 The FB regression model 43</p> <p>3.4 Bayesian inference 44</p> <p>3.5 Illustrative application 47</p> <p>3.6 Conclusion 48</p> <p>3.7 References 50</p> <p><b>Chapter 4 <i>S</i>-weighted Instrumental Variables </b><b>53<br /></b><i>Jan Ámos VÍŠEK</i></p> <p>4.1 Summarizing the previous relevant results 53</p> <p>4.2 The notations, framework, conditions and main tool 55</p> <p>4.3 <i>S</i>-weighted estimator and its consistency 57</p> <p>4.4 <i>S</i>-weighted instrumental variables and their consistency 59</p> <p>4.5 Patterns of results of simulations 64</p> <p>4.5.1 Generating the data 65</p> <p>4.5.2 Reporting the results 66</p> <p>4.6 Acknowledgment 69</p> <p>4.7 References 69</p> <p><b>Part 2 Models and Modeling </b><b>73</b></p> <p><b>Chapter 5 Grouping Property and Decomposition of Explained Variance in Linear Regression </b><b>75<br /></b><i>Henri WALLARD</i></p> <p>5.1 Introduction 75</p> <p>5.2 CAR scores 76</p> <p>5.2.1 Definition and estimators 76</p> <p>5.2.2 Historical criticism of the CAR scores 79</p> <p>5.3 Variance decomposition methods and SVD 79</p> <p>5.4 Grouping property of variance decomposition methods 80</p> <p>5.4.1 Analysis of grouping property for CAR scores 81</p> <p>5.4.2 Demonstration with two predictors 82</p> <p>5.4.3 Analysis of grouping property using SVD 83</p> <p>5.4.4 Application to the diabetes data set 86</p> <p>5.5 Conclusions 87</p> <p>5.6 References 88</p> <p><b>Chapter 6 On GARCH Models with Temporary Structural Changes </b><b>91<br /></b><i>Norio WATANABE </i>and <i>Fumiaki OKIHARA</i></p> <p>6.1 Introduction 91</p> <p>6.2 The model 92</p> <p>6.2.1 Trend model 92</p> <p>6.2.2 Intervention GARCH model 93</p> <p>6.3 Identification 96</p> <p>6.4 Simulation 96</p> <p>6.4.1 Simulation on trend model 96</p> <p>6.4.2 Simulation on intervention trend model 98</p> <p>6.5 Application 98</p> <p>6.6 Concluding remarks 102</p> <p>6.7 References 103</p> <p><b>Chapter 7 A Note on the Linear Approximation of TAR Models </b><b>105<br /></b><i>Francesco GIORDANO, Marcella NIGLIO </i>and<i> Cosimo Damiano VITALE</i></p> <p>7.1 Introduction 105</p> <p>7.2 Linear representations and linear approximations of nonlinear models 107</p> <p>7.3 Linear approximation of the TAR model 109</p> <p>7.4 References 116</p> <p><b>Chapter 8 An Approximation of Social Well-Being Evaluation Using Structural Equation Modeling </b><b>117<br /></b><i>Leonel SANTOS-BARRIOS, Monica RUIZ-TORRES, William GÓMEZ-DEMETRIO, Ernesto SÁNCHEZ-VERA, Ana LORGA DA SILVA </i>and<i> Francisco MARTÍNEZ-CASTAÑEDA</i></p> <p>8.1 Introduction 117</p> <p>8.2 Wellness118</p> <p>8.3 Social welfare 118</p> <p>8.4 Methodology 119</p> <p>8.5 Results 120</p> <p>8.6 Discussion 123</p> <p>8.7 Conclusions 123</p> <p>8.8 References 123</p> <p><b>Chapter 9 An SEM Approach to Modeling Housing Values </b><b>125<br /></b><i>Jim FREEMAN </i>and<i> Xin ZHAO</i></p> <p>9.1 Introduction 125</p> <p>9.2 Data 126</p> <p>9.3 Analysis 127</p> <p>9.4 Conclusions 134</p> <p>9.5 References 135</p> <p><b>Chapter 10 Evaluation of Stopping Criteria for Ranks in Solving Linear Systems </b><b>137<br /></b><i>Benard ABOLA, Pitos BIGANDA, Christopher ENGSTRÖM </i>and<i> Sergei SILVESTROV</i></p> <p>10.1 Introduction 137</p> <p>10.2 Methods 139</p> <p>10.2.1 Preliminaries 139</p> <p>10.2.2 Iterative methods 140</p> <p>10.3 Formulation of linear systems 142</p> <p>10.4 Stopping criteria 143</p> <p>10.5 Numerical experimentation of stopping criteria 146</p> <p>10.5.1 Convergence of stopping criterion 147</p> <p>10.5.2 Quantiles 147</p> <p>10.5.3 Kendall correlation coefficient as stopping criterion 148</p> <p>10.6 Conclusions 150</p> <p>10.7 Acknowledgments 151</p> <p>10.8 References 151</p> <p><b>Chapter 11 Estimation of a Two-Variable Second-Degree Polynomial via Sampling </b><b>153<br /></b><i>Ioanna PAPATSOUMA, Nikolaos FARMAKIS </i>and <i>Eleni KETZAKI</i></p> <p>11.1 Introduction 153</p> <p>11.2 Proposed method 154</p> <p>11.2.1 First restriction 154</p> <p>11.2.2 Second restriction 155</p> <p>11.2.3 Third restriction 156</p> <p>11.2.4 Fourth restriction 156</p> <p>11.2.5 Fifth restriction 157</p> <p>11.2.6 Coefficient estimates 158</p> <p>11.3 Experimental approaches 159</p> <p>11.3.1 Experiment A 159</p> <p>11.3.2 Experiment B 161</p> <p>11.4 Conclusions 163</p> <p>11.5 References 163</p> <p><b>Part 3 Estimators, Forecasting and Data Mining </b><b>165</b></p> <p><b>Chapter 12 Displaying Empirical Distributions of Conditional Quantile Estimates: An Application of Symbolic Data Analysis to the Cost Allocation Problem in Agriculture </b><b>167<br /></b><i>Dominique DESBOIS</i></p> <p>12.1 Conceptual framework and methodological aspects of cost allocation 167</p> <p>12.2 The empirical model of specific production cost estimates 168</p> <p>12.3 The conditional quantile estimation 169</p> <p>12.4 Symbolic analyses of the empirical distributions of specific costs 170</p> <p>12.5 The visualization and the analysis of econometric results 172</p> <p>12.6 Conclusion 178</p> <p>12.7 Acknowledgments 179</p> <p>12.8 References 179</p> <p><b>Chapter 13 Frost Prediction in Apple Orchards Based upon Time Series Models </b><b>181<br /></b><i>Monika ATOMKOWICZ </i>and<i> Armin OSCHMITT</i></p> <p>13.1 Introduction 181</p> <p>13.2 Weather database 182</p> <p>13.3 ARIMA forecast model 183</p> <p>13.3.1 Stationarity and differencing 184</p> <p>13.3.2 Non-seasonal ARIMA models 186</p> <p>13.4 Model building 188</p> <p>13.4.1 ARIMA and LR models 188</p> <p>13.4.2 Binary classification of the frost data 189</p> <p>13.4.3 Training and test set 189</p> <p>13.5 Evaluation 189</p> <p>13.6 ARIMA model selection 190</p> <p>13.7 Conclusions 192</p> <p>13.8 Acknowledgments 193</p> <p>13.9 References 193</p> <p><b>Chapter 14 Efficiency Evaluation of Multiple-Choice Questions and Exams </b><b>195<br /></b><i>Evgeny GERSHIKOV </i>and<i> Samuel KOSOLAPOV</i></p> <p>14.1 Introduction 195</p> <p>14.2 Exam efficiency evaluation 196</p> <p>14.2.1 Efficiency measures and efficiency weighted grades 196</p> <p>14.2.2 Iterative execution 198</p> <p>14.2.3 Postprocessing 199</p> <p>14.3 Real-life experiments and results 200</p> <p>14.4 Conclusions 203</p> <p>14.5 References 204</p> <p><b>Chapter 15 Methods of Modeling and Estimation in Mortality </b><b>205<br /></b><i>Christos HSKIADAS </i>and<i> Konstantinos NZAFEIRIS</i></p> <p>15.1 Introduction 205</p> <p>15.2 The appearance of life tables 206</p> <p>15.3 On the law of mortality 207</p> <p>15.4 Mortality and health 211</p> <p>15.5 An advanced health state function form 217</p> <p>15.6 Epilogue 220</p> <p>15.7 References 221</p> <p><b>Chapter 16 An Application of Data Mining Methods to the Analysis of Bank Customer Profitability and Buying Behavior </b><b>225<br /></b><i>Pedro GODINHO, Joana DIAS </i>and<i> Pedro TORRES</i></p> <p>16.1 Introduction 225</p> <p>16.2 Data set 227</p> <p>16.3 Short-term forecasting of customer profitability 230</p> <p>16.4 Churn prediction 235</p> <p>16.5 Next-product-to-buy 236</p> <p>16.6 Conclusions and future research 238</p> <p>16.7 References 239</p> <p>List of Authors 241</p> <p>Index 245</p>
Christos H. Skiadas is the Founder and former Director of the Data Analysis and Forecasting Laboratory at the Technical University of Crete, Greece. He continues his work at the university at the ManLab in the Department of Production Engineering and Management.<br /><br />James R. Bozeman holds a PhD in Mathematics from Dartmouth College, USA, and is Professor of Mathematics at the American University of Malta.
This series of books collects a diverse array of work that provides the reader with theoretical and applied information on data analysis methods, models, and techniques, along with appropriate applications.<br /><br />Volume 1 begins with an introductory chapter by Gilbert Saporta, a leading expert in the field, who summarizes the developments in data analysis over the last 50 years. The book is then divided into three parts: Part 1 presents clustering and regression cases; Part 2 examines grouping and decomposition, GARCH and threshold models, structural equations, and SME modeling; and Part 3 presents symbolic data analysis, time series and multiple choice models, modeling in demography, and data mining.

Diese Produkte könnten Sie auch interessieren:

Statistics for Microarrays
Statistics for Microarrays
von: Ernst Wit, John McClure
PDF ebook
90,99 €