Details

Data Science Handbook


Data Science Handbook

A Practical Approach
1. Aufl.

von: Kolla Bhanu Prakash

134,99 €

Verlag: Wiley
Format: PDF
Veröffentl.: 02.08.2022
ISBN/EAN: 9781119858003
Sprache: englisch
Anzahl Seiten: 480

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

<p><b>DATA SCIENCE HANDBOOK</b></p> <p><b>This desk reference handbook gives a hands-on experience on various algorithms and popular techniques used in real-time in data science to all researchers working in various domains. </b></p> <p>Data Science is one of the leading research-driven areas in the modern era. It is having a critical role in healthcare, engineering, education, mechatronics, and medical robotics. Building models and working with data is not value-neutral. We choose the problems with which we work, make assumptions in these models, and decide on metrics and algorithms for the problems. The data scientist identifies the problem which can be solved with data and expert tools of modeling and coding.</p> <p>The book starts with introductory concepts in data science like data munging, data preparation, and transforming data. Chapter 2 discusses data visualization, drawing various plots and histograms. Chapter 3 covers mathematics and statistics for data science. Chapter 4 mainly focuses on machine learning algorithms in data science. Chapter 5 comprises of outlier analysis and DBSCAN algorithm. Chapter 6 focuses on clustering. Chapter 7 discusses network analysis. Chapter 8 mainly focuses on regression and naive-bayes classifier. Chapter 9 covers web-based data visualizations with Plotly. Chapter 10 discusses web scraping.</p> <p>The book concludes with a section discussing 19 projects on various subjects in data science.</p> <p><b>Audience </b></p> <p>The handbook will be used by graduate students up to research scholars in computer science and electrical engineering as well as industry professionals in a range of industries such as healthcare.</p>
<p>Acknowledgment xi</p> <p>Preface xiii</p> <p><b>1 Data Munging Basics</b></p> <p>1 Introduction 1</p> <p>1.1 Filtering and Selecting Data 6</p> <p>1.2 Treating Missing Values 11</p> <p>1.3 Removing Duplicates 14</p> <p>1.4 Concatenating and Transforming Data 16</p> <p>1.5 Grouping and Data Aggregation 20</p> <p>References 20</p> <p><b>2 Data Visualization 23</b></p> <p>2.1 Creating Standard Plots (Line, Bar, Pie) 26</p> <p>2.2 Defining Elements of a Plot 30</p> <p>2.3 Plot Formatting 33</p> <p>2.4 Creating Labels and Annotations 38</p> <p>2.5 Creating Visualizations from Time Series Data 42</p> <p>2.6 Constructing Histograms, Box Plots, and Scatter Plots 44</p> <p>References 54</p> <p><b>3 Basic Math and Statistics 57</b></p> <p>3.1 Linear Algebra 57</p> <p>3.2 Calculus 58</p> <p>3.2.1 Differential Calculus 58</p> <p>3.2.2 Integral Calculus 58</p> <p>3.3 Inferential Statistics 60</p> <p>3.3.1 Central Limit Theorem 60</p> <p>3.3.2 Hypothesis Testing 60</p> <p>3.3.3 ANOVA 60</p> <p>3.3.4 Qualitative Data Analysis 60</p> <p>3.4 Using NumPy to Perform Arithmetic Operations on Data 61</p> <p>3.5 Generating Summary Statistics Using Pandas and Scipy 64</p> <p>3.6 Summarizing Categorical Data Using Pandas 68</p> <p>3.7 Starting with Parametric Methods in Pandas and Scipy 84</p> <p>3.8 Delving Into Non-Parametric Methods Using Pandas and Scipy 87</p> <p>3.9 Transforming Dataset Distributions 91</p> <p>References 94</p> <p><b>4 Introduction to Machine Learning 97</b></p> <p>4.1 Introduction to Machine Learning 97</p> <p>4.2 Types of Machine Learning Algorithms 101</p> <p>4.3 Explanatory Factor Analysis 114</p> <p>4.4 Principal Component Analysis (PCA) 115</p> <p>References 121</p> <p><b>5 Outlier Analysis 123</b></p> <p>5.1 Extreme Value Analysis Using Univariate Methods 123</p> <p>5.2 Multivariate Analysis for Outlier Detection 125</p> <p>5.3 DBSCan Clustering to Identify Outliers 127</p> <p>References 133</p> <p><b>6 Cluster Analysis 135</b></p> <p>6.1 K-Means Algorithm 135</p> <p>6.2 Hierarchial Methods 141</p> <p>6.3 Instance-Based Learning w/ k-Nearest Neighbor 149</p> <p>References 156</p> <p><b>7 Network Analysis with NetworkX 157</b></p> <p>7.1 Working with Graph Objects 159</p> <p>7.2 Simulating a Social Network (ie; Directed Network Analysis) 163</p> <p>7.3 Analyzing a Social Network 169</p> <p>References 171</p> <p><b>8 Basic Algorithmic Learning 173</b></p> <p>8.1 Linear Regression 173</p> <p>8.2 Logistic Regression 183</p> <p>8.3 Naive Bayes Classifiers 189</p> <p>References 195</p> <p><b>9 Web-Based Data Visualizations with Plotly 197</b></p> <p>9.1 Collaborative Aanalytics 197</p> <p>9.2 Basic Charts 208</p> <p>9.3 Statistical Charts 212</p> <p>9.4 Plotly Maps 216</p> <p>References 219</p> <p><b>10 Web Scraping with Beautiful Soup 221</b></p> <p>10.1 The BeautifulSoup Object 224</p> <p>10.2 Exploring NavigableString Objects 228</p> <p>10.3 Data Parsing 230</p> <p>10.4 Web Scraping 233</p> <p>10.5 Ensemble Models with Random Forests 235</p> <p>References 254</p> <p><b>Data Science Projects 257</b></p> <p><b>11 Covid19 Detection and Prediction 259</b></p> <p>Bibliography 275</p> <p><b>12 Leaf Disease Detection 277</b></p> <p>Bibliography 283</p> <p><b>13 Brain Tumor Detection with Data Science 285</b></p> <p>Bibliography 295</p> <p><b>14 Color Detection with Python 297</b></p> <p>Bibliography 300</p> <p><b>15 Detecting Parkinson’s Disease 301</b></p> <p>Bibliography 302</p> <p><b>16 Sentiment Analysis 303</b></p> <p>Bibliography 306</p> <p><b>17 Road Lane Line Detection 307</b></p> <p>Bibliography 315</p> <p><b>18 Fake News Detection 317</b></p> <p>Bibliography 318</p> <p><b>19 Speech Emotion Recognition 319</b></p> <p>Bibliography 322</p> <p><b>20 Gender and Age Detection with Data Science 323</b></p> <p>Bibliography 339</p> <p><b>21 Diabetic Retinopathy 341</b></p> <p>Bibliography 350</p> <p><b>22 Driver Drowsiness Detection in Python 351</b></p> <p>Bibliography 356</p> <p><b>23 Chatbot Using Python 357</b></p> <p>Bibliography 363</p> <p><b>24 Handwritten Digit Recognition Project 365</b></p> <p>Bibliography 368</p> <p><b>25 Image Caption Generator Project in Python 369</b></p> <p>Bibliography 379</p> <p><b>26 Credit Card Fraud Detection Project 381</b></p> <p>Bibliography 391</p> <p><b>27 Movie Recommendation System 393</b></p> <p>Bibliography 411</p> <p><b>28 Customer Segmentation 413</b></p> <p>Bibliography 431</p> <p><b>29 Breast Cancer Classification 433</b></p> <p>Bibliography 443</p> <p><b>30 Traffic Signs Recognition 445</b></p> <p>Bibliography 453</p>
<p><b>Kolla Bhanu Prakash, PhD,</b> is a Professor and Research Group Head for A.I. & Data Science Research group at K L University, India. He has published more than 80 research papers in international and national journals and conferences, as well as authored/edited 12 books and seven patents. His research interests include deep learning, data science, and quantum computing. </p>
<p><b>This desk reference handbook gives a hands-on experience on various algorithms and popular techniques used in real-time in data science to all researchers working in various domains. </b></p> <p>Data Science is one of the leading research-driven areas in the modern era. It is having a critical role in healthcare, engineering, education, mechatronics, and medical robotics. Building models and working with data is not value-neutral. We choose the problems with which we work, make assumptions in these models, and decide on metrics and algorithms for the problems. The data scientist identifies the problem which can be solved with data and expert tools of modeling and coding. <p> The book starts with introductory concepts in data science like data munging, data preparation, and transforming data. Chapter 2 discusses data visualization, drawing various plots and histograms. Chapter 3 covers mathematics and statistics for data science. Chapter 4 mainly focuses on machine learning algorithms in data science. Chapter 5 comprises of outlier analysis and DBSCAN algorithm. Chapter 6 focuses on clustering. Chapter 7 discusses network analysis. Chapter 8 mainly focuses on regression and naive-bayes classifier. Chapter 9 covers web-based data visualizations with Plotly. Chapter 10 discusses web scraping. <p> The book concludes with a section discussing 19 projects on various subjects in data science. <p><b>Audience </b> <p> The handbook will be used by graduate students up to research scholars in computer science and electrical engineering as well as industry professionals in a range of industries such as healthcare.

Diese Produkte könnten Sie auch interessieren:

Impact of Artificial Intelligence on Organizational Transformation
Impact of Artificial Intelligence on Organizational Transformation
von: S. Balamurugan, Sonal Pathak, Anupriya Jain, Sachin Gupta, Sachin Sharma, Sonia Duggal
EPUB ebook
190,99 €
The CISO Evolution
The CISO Evolution
von: Matthew K. Sharp, Kyriakos Lambros
PDF ebook
33,99 €