Details

Computational Analysis of Communication


Computational Analysis of Communication


1. Aufl.

von: Wouter van Atteveldt, Damian Trilling, Carlos Arcila Calderon

38,99 €

Verlag: Wiley-Blackwell
Format: PDF
Veröffentl.: 10.03.2022
ISBN/EAN: 9781119680277
Sprache: englisch
Anzahl Seiten: 336

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

<p><b>Provides clear guidance on leveraging computational techniques to answer social science questions</b></p> <p>In disciplines such as political science, sociology, psychology, and media studies, the use of computational analysis is rapidly increasing. Statistical modeling, machine learning, and other computational techniques are revolutionizing the way electoral results are predicted, social sentiment is measured, consumer interest is evaluated, and much more. <i>Computational Analysis of Communication</i> teaches social science students and practitioners how computational methods can be used in a broad range of applications, providing discipline-relevant examples, clear explanations, and practical guidance.</p> <p>Assuming little or no background in data science or computer linguistics, this accessible textbook teaches readers how to use state-of-the art computational methods to perform data-driven analyses of social science issues. A cross-disciplinary team of authors—with expertise in both the social sciences and computer science—explains how to gather and clean data, manage textual, audio-visual, and network data, conduct statistical and quantitative analysis, and interpret, summarize, and visualize the results. Offered in a unique hybrid format that integrates print, ebook, and open-access online viewing, this innovative resource:</p> <ul> <li>Covers the essential skills for social sciences courses on big data, data visualization, text analysis, predictive analytics, and others</li> <li>Integrates theory, methods, and tools to provide unified approach to the subject</li> <li>Includes sample code in Python and links to actual research questions and cases from social science and communication studies</li> <li>Discusses ethical and normative issues relevant to privacy, data ownership, and reproducible social science</li> <li>Developed in partnership with the International Communication Association and by the editors of Computational Communication Research</li> </ul> <p><i>Computational Analysis of Communication</i> is an invaluable textbook and reference for students taking computational methods courses in social sciences, and for professional social scientists looking to incorporate computational methods into their work.</p>
<p>Preface xi</p> <p>Acknowledgments xiii</p> <p><b>1 Introduction </b><b>1</b></p> <p>1.1 The Role of Computational Analysis in the Social Sciences 1</p> <p>1.2 Why Python and/or R? 3</p> <p>1.3 How to Use This Book 4</p> <p>1.4 Installing R and Python 5</p> <p>1.4.1 Installing R and RStudio 7</p> <p>1.4.2 Installing Python and Jupyter Notebook 9</p> <p>1.5 Installing Third-Party Packages 12</p> <p><b>2 Getting Started: Fun with Data and Visualizations </b><b>13</b></p> <p>2.1 Fun With Tweets 14</p> <p>2.2 Fun With Textual Data 15</p> <p>2.3 Fun With Visualizing Geographic Information 17</p> <p>2.4 Fun With Networks 19</p> <p><b>3 Programming Concepts for Data Analysis </b><b>23</b></p> <p>3.1 About Objects and Data Types 24</p> <p>3.1.1 Storing Single Values: Integers, Floating-Point Numbers, Booleans 25</p> <p>3.1.2 Storing Text 26</p> <p>3.1.3 Combining Multiple Values: Lists, Vectors, And Friends 28</p> <p>3.1.4 Dictionaries 32</p> <p>3.1.5 From One to More Dimensions: Matrices and n-Dimensional Arrays 33</p> <p>3.1.6 Making Life Easier: Data Frames 34</p> <p>3.2 Simple Control Structures: Loops and Conditions 35</p> <p>3.2.1 Loops 36</p> <p>3.2.2 Conditional Statements 37</p> <p>3.3 Functions and Methods 39</p> <p><b>4 How to Write Code </b><b>43</b></p> <p>4.1 Re-using Code: How Not to Re-Invent the Wheel 43</p> <p>4.2 Understanding Errors and Getting Help 46</p> <p>4.2.1 Error Messages 46</p> <p>4.2.2 Debugging Strategies 48</p> <p>4.3 Best Practice: Beautiful Code, GitHub, and Notebooks 49</p> <p><b>5 From File to Data Frame and Back </b><b>55</b></p> <p>5.1 Why and When Do We Use Data Frames? 56</p> <p>5.2 Reading and Saving Data 57</p> <p>5.2.1 The Role of Files 57</p> <p>5.2.2 Encodings and Dialects 59</p> <p>5.2.3 File Handling Beyond Data Frames 61</p> <p>5.3 Data from Online Sources 62</p> <p><b>6 Data Wrangling </b><b>65</b></p> <p>6.1 Filtering, Selecting, and Renaming 66</p> <p>6.2 Calculating Values 67</p> <p>6.3 Grouping and Aggregating 69</p> <p>6.3.1 Combining Multiple Operations 70</p> <p>6.3.2 Adding Summary Values 71</p> <p>6.4 Merging Data 72</p> <p>6.4.1 Equal Units of Analysis 72</p> <p>6.4.2 Inner and Outer Joins 75</p> <p>6.4.3 Nested Data 76</p> <p>6.5 Reshaping Data: Wide To Long And Long To Wide 78</p> <p>6.6 Restructuring Messy Data 79</p> <p><b>7 Exploratory Data Analysis </b><b>83</b></p> <p>7.1 Simple Exploratory Data Analysis 84</p> <p>7.2 Visualizing Data 87</p> <p>7.2.1 Plotting Frequencies and Distributions 88</p> <p>7.2.2 Plotting Relationships 92</p> <p>7.2.3 Plotting Geospatial Data 98</p> <p>7.2.4 Other Possibilities 99</p> <p>7.3 Clustering and Dimensionality Reduction 100</p> <p>7.3.1 k-means Clustering 101</p> <p>7.3.2 Hierarchical Clustering 102</p> <p>7.3.3 Principal Component Analysis and Singular Value Decomposition 106</p> <p><b>8 Statistical Modeling and Supervised Machine Learning </b><b>113</b></p> <p>8.1 Statistical Modeling and Prediction 115</p> <p>8.2 Concepts and Principles 117</p> <p>8.3 Classical Machine Learning: From Naïve Bayes to Neural Networks 122</p> <p>8.3.1 Naïve Bayes 122</p> <p>8.3.2 Logistic Regression 124</p> <p>8.3.3 Support Vector Machines 125</p> <p>8.3.4 Decision Trees and Random Forests 127</p> <p>8.3.5 Neural Networks 129</p> <p>8.4 Deep Learning 130</p> <p>8.4.1 Convolutional Neural Networks 131</p> <p>8.5 Validation and Best Practices 133</p> <p>8.5.1 Finding a Balance Between Precision and Recall 133</p> <p>8.5.2 Train, Validate, Test 137</p> <p>8.5.3 Cross-validation and Grid Search 138</p> <p><b>9 Processing Text </b><b>141</b></p> <p>9.1 Text as a String of Characters 142</p> <p>9.1.1 Methods for Dealing With Text 144</p> <p>9.2 Regular Expressions 145</p> <p>9.2.1 Regular Expression Syntax 146</p> <p>9.2.2 Example Patterns 147</p> <p>9.3 Using Regular Expressions in Python and R 150</p> <p>9.3.1 Splitting and Joining Strings, and Extracting Multiple Matches 151</p> <p><b>10 Text as Data </b><b>155</b></p> <p>10.1 The Bag of Words and the Term-Document Matrix 156</p> <p>10.1.1 Tokenization 157</p> <p>10.1.2 The DTM as a Sparse Matrix 159</p> <p>10.1.3 The DTM as a “Bag of Words” 162</p> <p>10.1.4 The (Unavoidable) Word Cloud 163</p> <p>10.2 Weighting and Selecting Documents and Terms 164</p> <p>10.2.1 Removing stop words 165</p> <p>10.2.2 Removing Punctuation and Noise 167</p> <p>10.2.3 Trimming a DTM 170</p> <p>10.2.4 Weighting a DTM 171</p> <p>10.3 Advanced Representation of Text 172</p> <p>10.3.1 n-grams 173</p> <p>10.2.3 Collocations 174</p> <p>10.3.3 Word Embeddings 176</p> <p>10.3.4 Linguistic Preprocessing 177</p> <p>10.4 Which Preprocessing to Use? 182</p> <p><b>11 Automatic Analysis of Text </b><b>184</b></p> <p>11.1 Deciding on the Right Method 185</p> <p>11.2 Obtaining a Review Dataset 187</p> <p>11.3 Dictionary Approaches to Text Analysis 189</p> <p>11.4 Supervised Text Analysis: Automatic Classification and Sentiment Analysis 191</p> <p>11.4.1 Putting Together a Workflow 191</p> <p>11.4.2 Finding the Best Classifier 194</p> <p>11.4.3 Using the Model 198</p> <p>11.4.4 Deep Learning 199</p> <p>11.5 Unsupervised Text Analysis: Topic Modeling 203</p> <p>11.5.1 Latent Dirichlet Allocation (LDA) 203</p> <p>11.5.2 Fitting an LDA Model 206</p> <p>11.5.3 Analyzing Topic Model Results 207</p> <p>11.5.4 Validating and Inspecting Topic Models 208</p> <p>11.5.5 Beyond LDA 209</p> <p><b>12 Scraping Online Data </b><b>212</b></p> <p>12.1 Using Web APIs: From Open Resources to Twitter 213</p> <p>12.2 Retrieving and Parsing Web Pages 219</p> <p>12.2.1 Retrieving and Parsing an HTML Page 219</p> <p>12.2.2 Crawling Websites 223</p> <p>12.2.3 Dynamic Web Pages 225</p> <p>12.3 Authentication, Cookies, and Sessions 228</p> <p>12.3.1 Authentication and APIs 228</p> <p>12.3.2 Authentication and Webpages 229</p> <p>12.4 Ethical, Legal, and Practical Considerations 230</p> <p><b>13 Network Data </b><b>233</b></p> <p>13.1 Representing and Visualizing Networks 234</p> <p>13.2 Social Network Analysis 241</p> <p>13.2.1 Paths and Reachability 242</p> <p>13.2.2 Centrality Measures 246</p> <p>13.2.3 Clustering and Community Detection 248</p> <p><b>14 Multimedia Data </b><b>258</b></p> <p>14.1 Beyond Text Analysis: Images, Audio and Video 259</p> <p>14.2 Using Existing Libraries and APIs 261</p> <p>14.3 Storing, Representing, and Converting Images 263</p> <p>14.4 Image Classification 270</p> <p>14.4.1 Basic Classification with Shallow Algorithms 272</p> <p>14.4.2 Deep Learning for Image Analysis 273</p> <p>14.4.3 Re-using an Open Source CNN 279</p> <p><b>15 Scaling Up and Distributing </b><b>283</b></p> <p>15.1 Storing Data in SQL and noSQL Databases 283</p> <p>15.1.1 When to Use a Database 283</p> <p>15.1.2 Choosing the Right Database 285</p> <p>15.1.3 A Brief Example Using SQLite 286</p> <p>15.2 Using Cloud Computing 286</p> <p>15.3 Publishing Your Source 290</p> <p>15.4 Distributing Your Software as Container 291</p> <p><b>16 Where to Go Next </b><b>293</b></p> <p>16.1 How Far Have We Come? 293</p> <p>16.2 Where To Go Next? 294</p> <p>16.3 Open, Transparent, and Ethical Computational Science 295</p> <p>Bibliography 297</p> <p>Index 303</p> <p> </p>
<p><b>Dr. Wouter van Atteveldt</b> is an Associate Professor of Political Communication at Vrije Universiteit, Amsterdam. He is co-founder of the Computational Methods division of the International Communication Association, and Founding Chief Editor of <i>Computational Communication Research.</i> He has published extensively on innovative methods for analyzing political text and contributed to a number of relevant R and Python packages.</p> <p><b>Dr. Damian Trilling </b>is an Associate Professor, Department of Communication Science, at the University of Amsterdam, and Associate Editor of <i>Computational Communication Research.</i> His research uses computational methods such as the analysis of digital trace data and large-scale text analysis to study the use and effects of news media. He has developed extensive teaching materials to introduce social scientists to the Python programming language. <p><b>Dr. Carlos Arcila Calderón</b> is an Associate Professor, Department of Sociology and Communication, at the University of Salamanca, Chief Editor of the journal Disertaciones, and member of the Editorial Board of <i>Computational Communication Research.</i> He has published extensively on new media and social media studies, and has led the prototype Autocop, a Spark-based environment to run distributed supervised sentiment analysis of Twitter messages.
<p><b>A practical introduction to the analysis of texts, networks, and images with code examples in Python and R</b></p> <p>In disciplines such as political science, sociology, psychology, communication science, and media studies, the use of computational analysis is rapidly increasing. Statistical modeling, machine learning, and other computational techniques are revolutionizing the way electoral results are predicted, social sentiment is measured, consumer interest is evaluated, and much more. <i>Computational Analysis of Communication</i> teaches social science students and practitioners how computational methods can be used in a broad range of applications, providing ­discipline-relevant examples, clear explanations, and practical guidance. <p>Assuming little or no background in data science or computational linguistics, this accessible textbook teaches readers how to use state-of-the-art computational methods to perform data-driven analyses of social science issues. A team of authors with expertise in both the social sciences and computer science explains how to gather and clean data, manage textual, audio-visual, and network data, conduct statistical and quantitative analysis, and interpret, summarize, and visualize the results. Offered in a unique hybrid format that integrates print, ebook, and open-access online viewing, this innovative resource: <ul><li>Covers the essential skills for social sciences courses on big data, data visualization, text analysis, predictive analytics, and others</li> <li>Integrates theory, methods, and tools to provide a unified approach to the subject</li> <li>Includes sample code in Python and R and links to actual research questions and cases from social science and communication studies</li> <li>Discusses ethical and normative issues relevant to privacy, data ownership, and reproducible social science</li></ul> <p><i>Computational Analysis of Communication </i>is an invaluable textbook and reference for students taking computational methods courses in social sciences, and for professional social scientists looking to incorporate computational methods into their work.

Diese Produkte könnten Sie auch interessieren:

Television Truths
Television Truths
von: John Hartley
PDF ebook
25,99 €
Culture-on-Demand
Culture-on-Demand
von: James Lull
PDF ebook
91,99 €
The Small Screen
The Small Screen
von: Brian L. Ott
PDF ebook
34,99 €