Details

Introduction to Statistics Through Resampling Methods and R


Introduction to Statistics Through Resampling Methods and R


2. Aufl.

von: Phillip I. Good

59,99 €

Verlag: Wiley
Format: EPUB
Veröffentl.: 18.12.2012
ISBN/EAN: 9781118497579
Sprache: englisch
Anzahl Seiten: 224

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

A highly accessible alternative approach to basic statistics Praise for the First Edition:  "Certainly one of the most impressive little paperback 200-page introductory statistics books that I will ever see . . . it would make a good nightstand book for every statistician."—Technometrics  <br /> <br /> Written in a highly accessible style, Introduction to Statistics through Resampling Methods and R, Second Edition guides students in the understanding of descriptive statistics, estimation, hypothesis testing, and model building. The book emphasizes the discovery method, enabling readers to ascertain solutions on their own rather than simply copy answers or apply a formula by rote.  The Second Edition utilizes the R programming language to simplify tedious computations, illustrate new concepts, and assist readers in completing exercises. The text facilitates quick learning through the use of:  <br /> <br /> More than 250 exercises—with selected "hints"—scattered throughout to stimulate readers' thinking and to actively engage them in applying their newfound skills  <br /> <br /> An increased focus on why a method is introduced  <br /> <br /> Multiple explanations of basic concepts <br /> <br /> Real-life applications in a variety of disciplines  <br /> <br /> Dozens of thought-provoking, problem-solving questions in the final chapter to assist readers in applying statistics to real-life applications  <br /> <br /> Introduction to Statistics through Resampling Methods and R, Second Edition is an excellent resource for students and practitioners in the fields of agriculture, astrophysics, bacteriology, biology, botany, business, climatology, clinical trials, economics, education, epidemiology, genetics, geology, growth processes, hospital administration, law, manufacturing, marketing, medicine, mycology, physics, political science, psychology, social welfare, sports, and toxicology who want to master and learn to apply statistical methods.
<p>Preface xi</p> <p><b>1. Variation 1</b></p> <p>1.1 Variation 1</p> <p>1.2 Collecting Data 2</p> <p>1.2.1 A Worked-Through Example 3</p> <p>1.3 Summarizing Your Data 4</p> <p>1.3.1 Learning to Use R 5</p> <p>1.4 Reporting Your Results 7</p> <p>1.4.1 Picturing Data 8</p> <p>1.4.2 Better Graphics 10</p> <p>1.5 Types of Data 11</p> <p>1.5.1 Depicting Categorical Data 12</p> <p>1.6 Displaying Multiple Variables 12</p> <p>1.6.1 Entering Multiple Variables 13</p> <p>1.6.2 From Observations to Questions 14</p> <p>1.7 Measures of Location 15</p> <p>1.7.1 Which Measure of Location? 17</p> <p>1.7.2 The Geometric Mean 18</p> <p>1.7.3 Estimating Precision 18</p> <p>1.7.4 Estimating with the Bootstrap 19</p> <p>1.8 Samples and Populations 20</p> <p>1.8.1 Drawing a Random Sample 22</p> <p>1.8.2 Using Data That Are Already in Spreadsheet Form 23</p> <p>1.8.3 Ensuring the Sample Is Representative 23</p> <p>1.9 Summary and Review 23</p> <p><b>2. Probability 25</b></p> <p>2.1 Probability 25</p> <p>2.1.1 Events and Outcomes 27</p> <p>2.1.2 Venn Diagrams 27</p> <p>2.2 Binomial Trials 29</p> <p>2.2.1 Permutations and Rearrangements 30</p> <p>2.2.2 Programming Your Own Functions in R 32</p> <p>2.2.3 Back to the Binomial 33</p> <p>2.2.4 The Problem Jury 33</p> <p>2.3 Conditional Probability 34</p> <p>2.3.1 Market Basket Analysis 36</p> <p>2.3.2 Negative Results 36</p> <p>2.4 Independence 38</p> <p>2.5 Applications to Genetics 39</p> <p>2.6 Summary and Review 40</p> <p><b>3. Two Naturally Occurring Probability Distributions 43</b></p> <p>3.1 Distribution of Values 43</p> <p>3.1.1 Cumulative Distribution Function 44</p> <p>3.1.2 Empirical Distribution Function 45</p> <p>3.2 Discrete Distributions 46</p> <p>3.3 The Binomial Distribution 47</p> <p>3.3.1 Expected Number of Successes in <i>n </i>Binomial Trials 47</p> <p>3.3.2 Properties of the Binomial 48</p> <p>3.4 Measuring Population Dispersion and Sample Precision 51</p> <p>3.5 Poisson: Events Rare in Time and Space 53</p> <p>3.5.1 Applying the Poisson 53</p> <p>3.5.2 Comparing Empirical and Theoretical Poisson Distributions 54</p> <p>3.5.3 Comparing Two Poisson Processes 55</p> <p>3.6 Continuous Distributions 55</p> <p>3.6.1 The Exponential Distribution 56</p> <p>3.7 Summary and Review 57</p> <p><b>4. Estimation and the Normal Distribution 59</b></p> <p>4.1 Point Estimates 59</p> <p>4.2 Properties of the Normal Distribution 61</p> <p>4.2.1 Student’s <i>t</i>-Distribution 63</p> <p>4.2.2 Mixtures of Normal Distributions 64</p> <p>4.3 Using Confidence Intervals to Test Hypotheses 65</p> <p>4.3.1 Should We Have Used the Bootstrap? 65</p> <p>4.3.2 The Bias-Corrected and Accelerated Nonparametric Bootstrap 66</p> <p>4.3.3 The Parametric Bootstrap 68</p> <p>4.4 Properties of Independent Observations 69</p> <p>4.5 Summary and Review 70</p> <p><b>5. Testing Hypotheses 71</b></p> <p>5.1 Testing a Hypothesis 71</p> <p>5.1.1 Analyzing the Experiment 72</p> <p>5.1.2 Two Types of Errors 74</p> <p>5.2 Estimating Effect Size 76</p> <p>5.2.1 Effect Size and Correlation 76</p> <p>5.2.2 Using Confidence Intervals to Test Hypotheses 78</p> <p>5.3 Applying the <i>t</i>-Test to Measurements 79</p> <p>5.3.1 Two-Sample Comparison 80</p> <p>5.3.2 Paired <i>t</i>-Test 80</p> <p>5.4 Comparing Two Samples 81</p> <p>5.4.1 What Should We Measure? 81</p> <p>5.4.2 Permutation Monte Carlo 82</p> <p>5.4.3 One- vs. Two-Sided Tests 83</p> <p>5.4.4 Bias-Corrected Nonparametric Bootstrap 83</p> <p>5.5 Which Test Should We Use? 84</p> <p>5.5.1 <i>p</i>-Values and Significance Levels 85</p> <p>5.5.2 Test Assumptions 85</p> <p>5.5.3 Robustness 86</p> <p>5.5.4 Power of a Test Procedure 87</p> <p>5.6 Summary and Review 89</p> <p><b>6. Designing an Experiment or Survey 91</b></p> <p>6.1 The Hawthorne Effect 91</p> <p>6.1.1 Crafting an Experiment 92</p> <p>6.2 Designing an Experiment or Survey 94</p> <p>6.2.1 Objectives 94</p> <p>6.2.2 Sample from the Right Population 95</p> <p>6.2.3 Coping with Variation 97</p> <p>6.2.4 Matched Pairs 98</p> <p>6.2.5 The Experimental Unit 99</p> <p>6.2.6 Formulate Your Hypotheses 99</p> <p>6.2.7 What Are You Going to Measure? 100</p> <p>6.2.8 Random Representative Samples 101</p> <p>6.2.9 Treatment Allocation 102</p> <p>6.2.10 Choosing a Random Sample 103</p> <p>6.2.11 Ensuring Your Observations Are Independent 103</p> <p>6.3 How Large a Sample? 104</p> <p>6.3.1 Samples of Fixed Size 106</p> <p>6.3.1.1 Known Distribution 106</p> <p>6.3.1.2 Almost Normal Data 108</p> <p>6.3.1.3 Bootstrap 110</p> <p>6.3.2 Sequential Sampling 112</p> <p>6.3.2.1 Stein’s Two-Stage Sampling Procedure 112</p> <p>6.3.2.2 Wald Sequential Sampling 112</p> <p>6.3.2.3 Adaptive Sampling 115</p> <p>6.4 Meta-Analysis 116</p> <p>6.5 Summary and Review 116</p> <p><b>7. Guide to Entering, Editing, Saving, and Retrieving Large Quantities of Data Using R 119</b></p> <p>7.1 Creating and Editing a Data File 120</p> <p>7.2 Storing and Retrieving Files from within R 120</p> <p>7.3 Retrieving Data Created by Other Programs 121</p> <p>7.3.1 The Tabular Format 121</p> <p>7.3.2 Comma-Separated Values 121</p> <p>7.3.3 Data from Microsoft Excel 122</p> <p>7.3.4 Data from Minitab, SAS, SPSS, or Stata Data Files 122</p> <p>7.4 Using R to Draw a Random Sample 122</p> <p><b>8. Analyzing Complex Experiments 125</b></p> <p>8.1 Changes Measured in Percentages 125</p> <p>8.2 Comparing More Than Two Samples 126</p> <p>8.2.1 Programming the Multi-Sample Comparison in R 127</p> <p>8.2.2 Reusing Your R Functions 128</p> <p>8.2.3 What Is the Alternative? 129</p> <p>8.2.4 Testing for a Dose Response or Other Ordered Alternative 129</p> <p>8.3 Equalizing Variability 131</p> <p>8.4 Categorical Data 132</p> <p>8.4.1 Making Decisions with R 134</p> <p>8.4.2 One-Sided Fisher’s Exact Test 135</p> <p>8.4.3 The Two-Sided Test 136</p> <p>8.4.4 Testing for Goodness of Fit 137</p> <p>8.4.5 Multinomial Tables 137</p> <p>8.5 Multivariate Analysis 139</p> <p>8.5.1 Manipulating Multivariate Data in R 140</p> <p>8.5.2 Hotelling’s <i>T</i>2 141</p> <p>8.5.3 Pesarin–Fisher Omnibus Statistic 142</p> <p>8.6 R Programming Guidelines 144</p> <p>8.7 Summary and Review 148</p> <p><b>9. Developing Models 149</b></p> <p>9.1 Models 149</p> <p>9.1.1 Why Build Models? 150</p> <p>9.1.2 Caveats 152</p> <p>9.2 Classification and Regression Trees 152</p> <p>9.2.1 Example: Consumer Survey 153</p> <p>9.2.2 How Trees Are Grown 156</p> <p>9.2.3 Incorporating Existing Knowledge 158</p> <p>9.2.4 Prior Probabilities 158</p> <p>9.2.5 Misclassification Costs 159</p> <p>9.3 Regression 160</p> <p>9.3.1 Linear Regression 161</p> <p>9.4 Fitting a Regression Equation 162</p> <p>9.4.1 Ordinary Least Squares 162</p> <p>9.4.2 Types of Data 165</p> <p>9.4.3 Least Absolute Deviation Regression 166</p> <p>9.4.4 Errors-in-Variables Regression 167</p> <p>9.4.5 Assumptions 168</p> <p>9.5 Problems with Regression 169</p> <p>9.5.1 Goodness of Fit versus Prediction 169</p> <p>9.5.2 Which Model? 170</p> <p>9.5.3 Measures of Predictive Success 171</p> <p>9.5.4 Multivariable Regression 171</p> <p>9.6 Quantile Regression 174</p> <p>9.7 Validation 176</p> <p>9.7.1 Independent Verification 176</p> <p>9.7.2 Splitting the Sample 177</p> <p>9.7.3 Cross-Validation with the Bootstrap 178</p> <p>9.8 Summary and Review 178</p> <p><b>10. Reporting Your Findings 181</b></p> <p>10.1 What to Report 181</p> <p>10.1.1 Study Objectives 182</p> <p>10.1.2 Hypotheses 182</p> <p>10.1.3 Power and Sample Size Calculations 182</p> <p>10.1.4 Data Collection Methods 183</p> <p>10.1.5 Clusters 183</p> <p>10.1.6 Validation Methods 184</p> <p>10.2 Text, Table, or Graph? 185</p> <p>10.3 Summarizing Your Results 186</p> <p>10.3.1 Center of the Distribution 189</p> <p>10.3.2 Dispersion 189</p> <p>10.3.3 Categorical Data 190</p> <p>10.4 Reporting Analysis Results 191</p> <p>10.4.1 <i>p</i>-Values? Or Confidence Intervals? 192</p> <p>10.5 Exceptions Are the Real Story 193</p> <p>10.5.1 Nonresponders 193</p> <p>10.5.2 The Missing Holes 194</p> <p>10.5.3 Missing Data 194</p> <p>10.5.4 Recognize and Report Biases 194</p> <p>10.6 Summary and Review 195</p> <p><b>11. Problem Solving 197</b></p> <p>11.1 The Problems 197</p> <p>11.2 Solving Practical Problems 201</p> <p>11.2.1 Provenance of the Data 201</p> <p>11.2.2 Inspect the Data 202</p> <p>11.2.3 Validate the Data Collection Methods 202</p> <p>11.2.4 Formulate Hypotheses 203</p> <p>11.2.5 Choosing a Statistical Methodology 203</p> <p>11.2.6 Be Aware of What You Don’t Know 204</p> <p>11.2.7 Qualify Your Conclusions 204</p> <p>Answers to Selected Exercises 205</p> <p>Index 207</p>
<p><b>PHILLIP I. GOOD, PhD,</b> is Operations Manager of Information Research, a consulting firm specializing in statistical solutions for private and public organizations. He has published over thirty scholarly works, more than 600 articles, and forty-four books, including <i>Common Errors in Statistics (and How to Avoid Them)</i> and <i>A Manager's Guide to the Design and Conduct of Clinical Trials,</i> both published by Wiley.</p>
<p><b>A highly accessible alternative approach to basic statistics</b></p> <p>Praise for the <i>First Edition:</i></p> <p>"Certainly one of the most impressive little paperback 200-page introductory statistics books that I will ever see . . . it would make a good nightstand book for every statistician."<br /> <i>—Technometrics</i></p> <p>Written in a highly accessible style, <i>Introduction to Statistics through Resampling Methods and R, Second Edition</i> guides students in the understanding of descriptive statistics, estimation, hypothesis testing, and model building. The book emphasizes the discovery method, enabling readers to ascertain solutions on their own rather than simply copy answers or apply a formula by rote.</p> <p>The <i>Second Edition</i> utilizes the R programming language to simplify tedious computations, illustrate new concepts, and assist readers in completing exercises. The text facilitates quick learning through the use of:</p> <ul> <li>More than 250 exercises—with selected "hints"—scattered throughout to stimulate readers' thinking and to actively engage them in applying their newfound skills</li> <li>An increased focus on <i>why</i> a method is introduced</li> <li>Multiple explanations of basic concepts</li> <li>Real-life applications in a variety of disciplines</li> <li>Dozens of thought-provoking, problem-solving questions in the final chapter to assist readers in applying statistics to real-life applications</li> </ul> <p><i>Introduction to Statistics through Resampling Methods and R, Second Edition</i> is an excellent resource for students and practitioners in the fields of agriculture, astrophysics, bacteriology, biology, botany, business, climatology, clinical trials, economics, education, epidemiology, genetics, geology, growth processes, hospital administration, law, manufacturing, marketing, medicine, mycology, physics, political science, psychology, social welfare, sports, and toxicology who want to master and learn to apply statistical methods.</p>

Diese Produkte könnten Sie auch interessieren:

Statistics for Microarrays
Statistics for Microarrays
von: Ernst Wit, John McClure
PDF ebook
90,99 €