Details

Principles of Managerial Statistics and Data Science


Principles of Managerial Statistics and Data Science


1. Aufl.

von: Roberto Rivera

111,99 €

Verlag: Wiley
Format: EPUB
Veröffentl.: 31.01.2020
ISBN/EAN: 9781119486497
Sprache: englisch
Anzahl Seiten: 688

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

<p><b>Introduces readers to the principles of managerial statistics and data science, with an emphasis on statistical literacy of business students  </b> </p> <p>Through a statistical perspective, this book introduces readers to the topic of data science, including Big Data, data analytics, and data wrangling. Chapters include multiple examples showing the application of the theoretical aspects presented. It features practice problems designed to ensure that readers understand the concepts and can apply them using real data. Over 100 open data sets used for examples and problems come from regions throughout the world, allowing the instructor to adapt the application to local data with which students can identify. Applications with these data sets include:</p> <ul> <li>Assessing if searches during a police stop in San Diego are dependent on driver’s race</li> <li>Visualizing the association between fat percentage and moisture percentage in Canadian cheese</li> <li>Modeling taxi fares in Chicago using data from millions of rides</li> <li>Analyzing mean sales per unit of legal marijuana products in Washington state</li> </ul> <p>Topics covered in <i>Principles of Managerial Statistics and Data Science </i>include:data visualization; descriptive measures; probability; probability distributions; mathematical expectation; confidence intervals; and hypothesis testing. Analysis of variance; simple linear regression; and multiple linear regression are also included. In addition, the book offers contingency tables, Chi-square tests, non-parametric methods, and time series methods. The textbook: </p> <ul> <li>Includes academic material usually covered in introductory Statistics courses, but with a data science twist, and less emphasis in the theory</li> <li>Relies on Minitab to present how to perform tasks with a computer</li> <li>Presents and motivates use of data that comes from open portals</li> <li>Focuses on developing an intuition on how the procedures work</li> <li>Exposes readers to the potential in Big Data and current failures of its use</li> <li>Supplementary material includes: a companion website that houses PowerPoint slides; an Instructor's Manual with tips, a syllabus model, and project ideas; R code to reproduce examples and case studies; and information about the open portal data  </li> <li>Features an appendix with solutions to some practice problems</li> </ul> <p><i>Principles of Managerial Statistics and Data Science </i>is a textbook for undergraduate and graduate students taking managerial Statistics courses, and a reference book for working business professionals.</p>
<p>Preface xv</p> <p>Acknowledgments xvii</p> <p>Acronyms xix</p> <p>About the Companion Site xxi</p> <p>Principles of Managerial Statistics and Data Science xxiii</p> <p><b>1 Statistics Suck; So Why Do I Need to Learn About It? </b><b>1</b></p> <p>1.1 Introduction 1</p> <p>Practice Problems 4</p> <p>1.2 Data-Based Decision Making: Some Applications 5</p> <p>1.3 Statistics Defined 9</p> <p>1.4 Use of Technology and the New Buzzwords: Data Science, Data Analytics, and Big Data 11</p> <p>1.4.1 A Quick Look at Data Science: Some Definitions 11</p> <p>Chapter Problems 14</p> <p>Further Reading 14</p> <p><b>2 Concepts in Statistics </b><b>15</b></p> <p>2.1 Introduction 15</p> <p>Practice Problems 17</p> <p>2.2 Type of Data 19</p> <p>Practice Problems 20</p> <p>2.3 Four Important Notions in Statistics 22</p> <p>Practice Problems 24</p> <p>2.4 Sampling Methods 25</p> <p>2.4.1 Probability Sampling 25</p> <p>2.4.2 Nonprobability Sampling 27</p> <p>Practice Problems 30</p> <p>2.5 Data Management 31</p> <p>2.5.1 A Quick Look at Data Science: Data Wrangling Baltimore Housing Variables 34</p> <p>2.6 Proposing a Statistical Study 36</p> <p>Chapter Problems 37</p> <p>Further Reading 39</p> <p><b>3 Data Visualization </b><b>41</b></p> <p>3.1 Introduction 41</p> <p>3.2 Visualization Methods for Categorical Variables 41</p> <p>Practice Problems 46</p> <p>3.3 Visualization Methods for Numerical Variables 50</p> <p>Practice Problems 56</p> <p>3.4 Visualizing Summaries of More than Two Variables Simultaneously 59</p> <p>3.4.1 A Quick Look at Data Science: Does Race Affect the Chances of a Driver Being Searched During a Vehicle Stop in San Diego? 66</p> <p>Practice Problems 69</p> <p>3.5 Novel Data Visualization 75</p> <p>3.5.1 A Quick Look at Data Science: Visualizing Association Between Baltimore Housing Variables Over 14 Years 78</p> <p>Chapter Problems 81</p> <p>Further Reading 96</p> <p><b>4 Descriptive Statistics </b><b>97</b></p> <p>4.1 Introduction 97</p> <p>4.2 Measures of Centrality 99</p> <p>Practice Problems 108</p> <p>4.3 Measures of Dispersion 111</p> <p>Practice Problems 115</p> <p>4.4 Percentiles 116</p> <p>4.4.1 Quartiles 117</p> <p>Practice Problems 122</p> <p>4.5 Measuring the Association Between Two Variables 124</p> <p>Practice Problems 128</p> <p>4.6 Sample Proportion and Other Numerical Statistics 130</p> <p>4.6.1 A Quick Look at Data Science: Murder Rates in Los Angeles 131</p> <p>4.7 How to Use Descriptive Statistics 132</p> <p>Chapter Problems 133</p> <p>Further Reading 139</p> <p><b>5 Introduction to Probability </b><b>141</b></p> <p>5.1 Introduction 141</p> <p>5.2 Preliminaries 142</p> <p>Practice Problems 144</p> <p>5.3 The Probability of an Event 145</p> <p>Practice Problems 148</p> <p>5.4 Rules and Properties of Probabilities 149</p> <p>Practice Problems 152</p> <p>5.5 Conditional Probability and Independent Events 154</p> <p>Practice Problems 159</p> <p>5.6 Empirical Probabilities 161</p> <p>5.6.1 A Quick Look at Data Science: Missing People Reports in Boston by Day of Week 164</p> <p>Practice Problems 165</p> <p>5.7 Counting Outcomes 168</p> <p>Practice Problems 171</p> <p>Chapter Problems 171</p> <p>Further Reading 175</p> <p><b>6 Discrete Random Variables </b><b>177</b></p> <p>6.1 Introduction 177</p> <p>6.2 General Properties 178</p> <p>6.2.1 A Quick Look at Data Science: Number of Stroke Emergency Calls in Manhattan 183</p> <p>Practice Problems 184</p> <p>6.3 Properties of Expected Value and Variance 186</p> <p>Practice Problems 189</p> <p>6.4 Bernoulli and Binomial Random Variables 190</p> <p>Practice Problems 197</p> <p>6.5 Poisson Distribution 198</p> <p>Practice Problems 201</p> <p>6.6 Optional: Other Useful Probability Distributions 203</p> <p>Chapter Problems 205</p> <p>Further Reading 208</p> <p><b>7 Continuous Random Variables </b><b>209</b></p> <p>7.1 Introduction 209</p> <p>Practice Problems 211</p> <p>7.2 The Uniform Probability Distribution 211</p> <p>Practice Problems 215</p> <p>7.3 The Normal Distribution 216</p> <p>Practice Problems 225</p> <p>7.4 Probabilities for Any Normally Distributed Random Variable 227</p> <p>7.4.1 A Quick Look at Data Science: Normal Distribution, A Good Match for University of Puerto Rico SATs? 229</p> <p>Practice Problems 231</p> <p>7.5 Approximating the Binomial Distribution 234</p> <p>Practice Problems 236</p> <p>7.6 Exponential Distribution 236</p> <p>Practice Problems 238</p> <p>Chapter Problems 239</p> <p>Further Reading 242</p> <p><b>8 Properties of Sample Statistics </b><b>243</b></p> <p>8.1 Introduction 243</p> <p>8.2 Expected Value and Standard Deviation of <i>x̄ </i>244</p> <p>Practice Problems 246</p> <p>8.3 Sampling Distribution of <i>x̄ </i>When Sample Comes From a Normal Distribution 247</p> <p>Practice Problems 251</p> <p>8.4 Central Limit Theorem 252</p> <p>8.4.1 A Quick Look at Data Science: Bacteria at New York City Beaches 257</p> <p>Practice Problems 259</p> <p>8.5 Other Properties of Estimators 261</p> <p>Chapter Problems 264</p> <p>Further Reading 267</p> <p><b>9 Interval Estimation for One Population Parameter </b><b>269</b></p> <p>9.1 Introduction 269</p> <p>9.2 Intuition of a Two-Sided Confidence Interval 270</p> <p>9.3 Confidence Interval for the Population Mean: <i>𝜎 </i>Known 271</p> <p>Practice Problems 276</p> <p>9.4 Determining Sample Size for a Confidence Interval for <i>𝜇 </i>278</p> <p>Practice Problems 279</p> <p>9.5 Confidence Interval for the Population Mean: <i>𝜎 </i>Unknown 279</p> <p>Practice Problems 284</p> <p>9.6 Confidence Interval for <i>𝜋 </i>286</p> <p>Practice Problems 287</p> <p>9.7 Determining Sample Size for <i>𝜋 </i>Confidence Interval 288</p> <p>Practice Problems 290</p> <p>9.8 Optional: Confidence Interval for <i>𝜎 </i>290</p> <p>9.8.1 A Quick Look at Data Science: A Confidence Interval for the Standard Deviation of Walking Scores in Baltimore 292</p> <p>Chapter Problems 293</p> <p>Further Reading 296</p> <p><b>10 Hypothesis Testing for One Population </b><b>297</b></p> <p>10.1 Introduction 297</p> <p>10.2 Basics of Hypothesis Testing 299</p> <p>10.3 Steps to Perform a Hypothesis Test 304</p> <p>Practice Problems 305</p> <p>10.4 Inference on the Population Mean: Known Standard Deviation 306</p> <p>Practice Problems 318</p> <p>10.5 Hypothesis Testing for the Mean (<i>𝜎 </i>Unknown) 323</p> <p>Practice Problems 327</p> <p>10.6 Hypothesis Testing for the Population Proportion 329</p> <p>10.6.1 A Quick Look at Data Science: Proportion of New York City High Schools with a Mean SAT Score of 1498 or More 333</p> <p>Practice Problems 334</p> <p>10.7 Hypothesis Testing for the Population Variance 337</p> <p>10.8 More on the <i>p</i>-Value and Final Remarks 338</p> <p>10.8.1 Misunderstanding the <i>p</i>-Value 339</p> <p>Chapter Problems 343</p> <p>Further Reading 347</p> <p><b>11 Statistical Inference to Compare Parameters from Two Populations </b><b>349</b></p> <p>11.1 Introduction 349</p> <p>11.2 Inference on Two Population Means 350</p> <p>11.3 Inference on Two Population Means – Independent Samples, Variances Known 351</p> <p>Practice Problems 357</p> <p>11.4 Inference on Two Population Means When Two Independent Samples are Used – Unknown Variances 360</p> <p>11.4.1 A Quick Look at Data Science: Suicide Rates Among Asian Men and Women in New York City 364</p> <p>Practice Problems 366</p> <p>11.5 Inference on Two Means Using Two Dependent Samples 368</p> <p>Practice Problems 370</p> <p>11.6 Inference on Two Population Proportions 371</p> <p>Practice Problems 374</p> <p>Chapter Problems 375</p> <p>References 378</p> <p>Further Reading 378</p> <p><b>12 Analysis of Variance (ANOVA) </b><b>379</b></p> <p>12.1 Introduction 379</p> <p>Practice Problems 382</p> <p>12.2 ANOVA for One Factor 383</p> <p>Practice Problems 390</p> <p>12.3 Multiple Comparisons 391</p> <p>Practice Problems 395</p> <p>12.4 Diagnostics of ANOVA Assumptions 395</p> <p>12.4.1 A Quick Look at Data Science: Emergency Response Time for Cardiac Arrest in New York City 399</p> <p>Practice Problems 403</p> <p>12.5 ANOVA with Two Factors 404</p> <p>Practice Problems 409</p> <p>12.6 Extensions to ANOVA 413</p> <p>Chapter Problems 416</p> <p>Further Reading 419</p> <p><b>13 Simple Linear Regression </b><b>421</b></p> <p>13.1 Introduction 421</p> <p>13.2 Basics of Simple Linear Regression 423</p> <p>Practice Problems 425</p> <p>13.3 Fitting the Simple Linear Regression Parameters 426</p> <p>Practice Problems 429</p> <p>13.4 Inference for Simple Linear Regression 431</p> <p>Practice Problems 440</p> <p>13.5 Estimating and Predicting the Response Variable 443</p> <p>Practice Problems 446</p> <p>13.6 A Binary <i>X </i>448</p> <p>Practice Problems 449</p> <p>13.7 Model Diagnostics (Residual Analysis) 450</p> <p>Practice Problems 456</p> <p>13.8 What Correlation Doesn’t Mean 458</p> <p>13.8.1 A Quick Look at Data Science: Can Rate of College Educated People Help Predict the Rate of Narcotic Problems in Baltimore? 461</p> <p>Chapter Problems 466</p> <p>Further Reading 472</p> <p><b>14 Multiple Linear Regression </b><b>473</b></p> <p>14.1 Introduction 473</p> <p>14.2 The Multiple Linear Regression Model 474</p> <p>Practice Problems 477</p> <p>14.3 Inference for Multiple Linear Regression 478</p> <p>Practice Problems 483</p> <p>14.4 Multicollinearity and Other Modeling Aspects 486</p> <p>Practice Problems 490</p> <p>14.5 Variability Around the Regression Line: Residuals and Intervals 492</p> <p>Practice Problems 494</p> <p>14.6 Modifying Predictors 494</p> <p>Practice Problems 495</p> <p>14.7 General Linear Model 496</p> <p>Practice Problems 502</p> <p>14.8 Steps to Fit a Multiple Linear Regression Model 505</p> <p>14.9 Other Regression Topics 507</p> <p>14.9.1 A Quick Look at Data Science: Modeling Taxi Fares in Chicago 510</p> <p>Chapter Problems 513</p> <p>Further Reading 517</p> <p><b>15 Inference on Association of Categorical Variables </b><b>519</b></p> <p>15.1 Introduction 519</p> <p>15.2 Association Between Two Categorical Variables 520</p> <p>15.2.1 A Quick Look at Data Science: Affordability and Business Environment in Chattanooga 525</p> <p>Practice Problems 529</p> <p>Chapter Problems 532</p> <p>Further Reading 532</p> <p><b>16 Nonparametric Testing </b><b>533</b></p> <p>16.1 Introduction 533</p> <p>16.2 Sign Tests and Wilcoxon Sign-Rank Tests: One Sample and Matched Pairs Scenarios 533</p> <p>Practice Problems 537</p> <p>16.3 Wilcoxon Rank-Sum Test: Two Independent Samples 539</p> <p>16.3.1 A Quick Look at Data Science: Austin, Texas, as a Place to Live; Do Men Rate It Higher Than Women? 540</p> <p>Practice Problems 543</p> <p>16.4 Kruskal–Wallis Test: More Than Two Samples 544</p> <p>Practice Problems 546</p> <p>16.5 Nonparametric Tests Versus Their Parametric Counterparts 547</p> <p>Chapter Problems 548</p> <p>Further Reading 549</p> <p><b>17 Forecasting </b><b>551</b></p> <p>17.1 Introduction 551</p> <p>17.2 Time Series Components 552</p> <p>Practice Problems 557</p> <p>17.3 Simple Forecasting Models 558</p> <p>Practice Problems 562</p> <p>17.4 Forecasting When Data Has Trend, Seasonality 563</p> <p>Practice Problems 569</p> <p>17.5 Assessing Forecasts 572</p> <p>17.5.1 A Quick Look at Data Science: Forecasting Tourism Jobs in Canada 575</p> <p>17.5.2 A Quick Look at Data Science: Forecasting Retail Gross Sales of Marijuana in Denver 577</p> <p>Chapter Problems 580</p> <p>Further Reading 581</p> <p><b>Appendix A Math Notation and Symbols </b><b>583</b></p> <p>A.1 Summation 583</p> <p>A.2 <i>p</i>th Power 583</p> <p>A.3 Inequalities 584</p> <p>A.4 Factorials 584</p> <p>A.5 Exponential Function 585</p> <p>A.6 Greek and Statistics Symbols 585</p> <p><b>Appendix B Standard Normal Cumulative Distribution Function </b><b>587</b></p> <p><b>Appendix C <i>t </i>Distribution Critical Values </b><b>591</b></p> <p><b>Appendix D Solutions to Odd-Numbered Problems </b><b>593</b></p> <p>Index 643</p>
<p><b>ROBERTO RIVERA, PHD,</b> is a Professor, at the College of Business, University of Puerto Rico, Mayagüez. He received his PhD in Statistics from the University of California, Santa Barbara. He founded the Puerto Rico Chapter of the American Statistical Association. Dr. Rivera is also the co-author of <i>Applications of Regression Models in Epidemiology</i> (2017).
<p><b>INTRODUCES READERS TO THE PRINCIPLES OF MANAGERIAL STATISTICS AND DATA SCIENCE, WITH AN EMPHASIS ON STATISTICAL LITERACY OF BUSINESS STUDENTS</b> <p>Through a statistical perspective, this book introduces readers to the topic of data science, including Big Data, data analytics, and data wrangling. Chapters include multiple examples showing the application of the theoretical aspects presented. It features practice problems designed to ensure that readers understand the concepts and can apply them using real data. Over 100 open data sets used for examples and problems come from regions throughout the world, allowing the instructor to adapt the application to local data with which students can identify. Applications with these data sets include: <ul> <li>Assessing if searches during a police stop in San Diego are dependent on driver's race</li> <li>Visualizing the association between fat percentage and moisture percentage in Canadian cheese</li> <li>Modeling taxi fares in Chicago using data from millions of rides</li> <li>Analyzing mean sales per unit of legal marijuana products in Washington state</li> </ul> <p>Topics covered in <i>Principles of Managerial Statistics and Data Science</i> include: data visualization; descriptive measures; probability; probability distributions; mathematical expectation; confidence intervals; and hypothesis testing. Analysis of variance; simple linear regression; and multiple linear regression are also included. In addition, the book offers contingency tables, Chi-square tests, non-parametric methods, and time series methods. The textbook: <ul> <li>Includes academic material usually covered in introductory Statistics courses, but with a data science twist, and less emphasis in the theory</li> <li>Relies on Minitab to present how to perform tasks with a computer</li> <li>Presents and motivates use of data that comes from open portals</li> <li>Focuses on developing an intuition on how the procedures work</li> <li>Exposes readers to the potential in Big Data and current failures of its use</li> <li>Supplementary material includes: a companion website that houses PowerPoint slides; an Instructor's Manual with tips, a syllabus model, and project ideas; R code to reproduce examples and case studies; and information about the open portal data</li> <li>Features an appendix with solutions to some practice problems</li> </ul> <p><i>Principles of Managerial Statistics and Data Science</i> is a textbook for undergraduate and graduate students taking managerial Statistics courses, and a reference book for working business professionals.

Diese Produkte könnten Sie auch interessieren:

Statistics for Microarrays
Statistics for Microarrays
von: Ernst Wit, John McClure
PDF ebook
90,99 €