Details

AWS Certified Data Analytics Study Guide


AWS Certified Data Analytics Study Guide

Specialty (DAS-C01) Exam
1. Aufl.

von: Asif Abbasi

38,99 €

Verlag: Wiley
Format: EPUB
Veröffentl.: 01.12.2020
ISBN/EAN: 9781119649458
Sprache: englisch
Anzahl Seiten: 416

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

<p><b>Move your career forward with AWS certification! Prepare for the AWS Certified Data Analytics Specialty Exam with this thorough study guide</b></p> <p>This comprehensive study guide will help assess your technical skills and prepare for the updated AWS Certified Data Analytics exam. Earning this AWS certification will confirm your expertise in designing and implementing AWS services to derive value from data. The <i>AWS Certified Data Analytics Study Guide: Specialty (DAS-C01) Exam</i> is designed for business analysts and IT professionals who perform complex Big Data analyses.</p> <p>This AWS Specialty Exam guide gets you ready for certification testing with expert content, real-world knowledge, key exam concepts, and topic reviews. Gain confidence by studying the subject areas and working through the practice questions. Big data concepts covered in the guide include:</p> <ul> <li>Collection</li> <li>Storage</li> <li>Processing</li> <li>Analysis</li> <li>Visualization</li> <li>Data security</li> </ul> <p>AWS certifications allow professionals to demonstrate skills related to leading Amazon Web Services technology. The AWS Certified Data Analytics Specialty (DAS-C01) Exam specifically evaluates your ability to design and maintain Big Data, leverage tools to automate data analysis, and implement AWS Big Data services according to architectural best practices. An exam study guide can help you feel more prepared about taking an AWS certification test and advancing your professional career. In addition to the guide’s content, you’ll have access to an online learning environment and test bank that offers practice exams, a glossary, and electronic flashcards.</p>
<p>Introduction xxi</p> <p>Assessment Test xxx</p> <p><b>Chapter 1 History of Analytics and Big Data 1</b></p> <p>Evolution of Analytics Architecture Over the Years 3</p> <p>The New World Order 5</p> <p>Analytics Pipeline 6</p> <p>Data Sources 7</p> <p>Collection 8</p> <p>Storage 8</p> <p>Processing and Analysis 9</p> <p>Visualization, Predictive and Prescriptive Analytics 9</p> <p>The Big Data Reference Architecture 10</p> <p>Data Characteristics: Hot, Warm, and Cold 11</p> <p>Collection/Ingest 12</p> <p>Storage 13</p> <p>Process/Analyze 14</p> <p>Consumption 15</p> <p>Data Lakes and Their Relevance in Analytics 16</p> <p>What is a Data Lake? 16</p> <p>Building a Data Lake on AWS 19</p> <p>Step 1: Choosing the Right Storage – Amazon S3</p> <p>Is the Base 19</p> <p>Step 2: Data Ingestion – Moving the Data into</p> <p>the Data Lake 21</p> <p>Step 3: Cleanse, Prep, and Catalog the Data 22</p> <p>Step 4: Secure the Data and Metadata 23</p> <p>Step 5: Make Data Available for Analytics 23</p> <p>Using Lake Formation to Build a Data Lake on AWS 23</p> <p>Exam Objectives 24</p> <p>Objective Map 25</p> <p>Assessment Test 27</p> <p>References 29</p> <p><b>Chapter 2 Data Collection 31</b></p> <p>Exam Objectives 32</p> <p>AWS IoT 33</p> <p>Common Use Cases for AWS IoT 35</p> <p>How AWS IoT Works 36</p> <p>Amazon Kinesis 38</p> <p>Amazon Kinesis Introduction 40</p> <p>Amazon Kinesis Data Streams 40</p> <p>Amazon Kinesis Data Analytics 54</p> <p>Amazon Kinesis Video Streams 61</p> <p>AWS Glue 64</p> <p>Glue Data Catalog 66</p> <p>Glue Crawlers 68</p> <p>Authoring ETL Jobs 69</p> <p>Executing ETL Jobs 71</p> <p>Change Data Capture with Glue Bookmarks 71</p> <p>Use Cases for AWS Glue 72</p> <p>Amazon SQS 72</p> <p>Amazon Data Migration Service 74</p> <p>What is AWS DMS Anyway? 74</p> <p>What Does AWS DMS Support? 75</p> <p>AWS Data Pipeline 77</p> <p>Pipeline Definition 77</p> <p>Pipeline Schedules 78</p> <p>Task Runner 79</p> <p>Large-Scale Data Transfer Solutions 81</p> <p>AWS Snowcone 81</p> <p>AWS Snowball 82</p> <p>AWS Snowmobile 85</p> <p>AWS Direct Connect 86</p> <p>Summary 87</p> <p>Review Questions 88</p> <p>References 90</p> <p>Exercises & Workshops 91</p> <p><b>Chapter 3 Data Storage 93</b></p> <p>Introduction 94</p> <p>Amazon S3 95</p> <p>Amazon S3 Data Consistency Model 96</p> <p>Data Lake and S3 97</p> <p>Data Replication in Amazon S3 100</p> <p>Server Access Logging in Amazon S3 101</p> <p>Partitioning, Compression, and File Formats on S3 101</p> <p>Amazon S3 Glacier 103</p> <p>Vault 103</p> <p>Archive 104</p> <p>Amazon DynamoDB 104</p> <p>Amazon DynamoDB Data Types 105</p> <p>Amazon DynamoDB Core Concepts 108</p> <p>Read/Write Capacity Mode in DynamoDB 108</p> <p>DynamoDB Auto Scaling and Reserved Capacity 111</p> <p>Read Consistency and Global Tables 111</p> <p>Amazon DynamoDB: Indexing and Partitioning 113</p> <p>Amazon DynamoDB Accelerator 114</p> <p>Amazon DynamoDB Streams 115</p> <p>Amazon DynamoDB Streams – Kinesis Adapter 116</p> <p>Amazon DocumentDB 117</p> <p>Why a Document Database? 117</p> <p>Amazon DocumentDB Overview 119</p> <p>Amazon Document DB Architecture 120</p> <p>Amazon DocumentDB Interfaces 120</p> <p>Graph Databases and Amazon Neptune 121</p> <p>Amazon Neptune Overview 122</p> <p>Amazon Neptune Use Cases 123</p> <p>Storage Gateway 123</p> <p>Hybrid Storage Requirements 123</p> <p>AWS Storage Gateway 125</p> <p>Amazon EFS 127</p> <p>Amazon EFS Use Cases 130</p> <p>Interacting with Amazon EFS 132</p> <p>Amazon EFS Security Model 132</p> <p>Backing Up Amazon EFS 132</p> <p>Amazon FSx for Lustre 133</p> <p>Key Benefits of Amazon FSx for Lustre 134</p> <p>Use Cases for Lustre 135</p> <p>AWS Transfer for SFTP 135</p> <p>Summary 136</p> <p>Exercises 137</p> <p>Review Questions 140</p> <p>Further Reading 142</p> <p>References 142</p> <p><b>Chapter 4 Data Processing and Analysis 143</b></p> <p>Introduction 144</p> <p>Types of Analytical Workloads 144</p> <p>Amazon Athena 146</p> <p>Apache Presto 147</p> <p>Apache Hive 148</p> <p>Amazon Athena Use Cases and Workloads 149</p> <p>Amazon Athena DDL, DML, and DCL 150</p> <p>Amazon Athena Workgroups 151</p> <p>Amazon Athena Federated Query 153</p> <p>Amazon Athena Custom UDFs 154</p> <p>Using Machine Learning with Amazon Athena 154</p> <p>Amazon EMR 155</p> <p>Apache Hadoop Overview 156</p> <p>Amazon EMR Overview 157</p> <p>Apache Hadoop on Amazon EMR 158</p> <p>EMRFS 166</p> <p>Bootstrap Actions and Custom AMI 167</p> <p>Security on EMR 167</p> <p>EMR Notebooks 168</p> <p>Apache Hive and Apache Pig on Amazon EMR 169</p> <p>Apache Spark on Amazon EMR 174</p> <p>Apache HBase on Amazon EMR 182</p> <p>Apache Flink, Apache Mahout, and Apache MXNet 184</p> <p>Choosing the Right Analytics Tool 186</p> <p>Amazon Elasticsearch Service 188</p> <p>When to Use Elasticsearch 188</p> <p>Elasticsearch Core Concepts (the ELK Stack) 189</p> <p>Amazon Elasticsearch Service 191</p> <p>Amazon Redshift 192</p> <p>What is Data Warehousing? 192</p> <p>What is Redshift? 193</p> <p>Redshift Architecture 195</p> <p>Redshift AQUA 198</p> <p>Redshift Scalability 199</p> <p>Data Modeling in Redshift 205</p> <p>Data Loading and Unloading 213</p> <p>Query Optimization in Redshift 217</p> <p>Security in Redshift 221</p> <p>Kinesis Data Analytics 225</p> <p>How Does It Work? 226</p> <p>What is Kinesis Data Analytics for Java? 228</p> <p>Comparing Batch Processing Services 229</p> <p>Comparing Orchestration Options on AWS 230</p> <p>AWS Step Functions 230</p> <p>Comparing Different ETL Orchestration Options 230</p> <p>Summary 231</p> <p>Exam Essentials 232</p> <p>Exercises 232</p> <p>Review Questions 235</p> <p>References 237</p> <p>Recommended Workshops 237</p> <p>Amazon Athena Blogs 238</p> <p>Amazon Redshift Blogs 240</p> <p>Amazon EMR Blogs 241</p> <p>Amazon Elasticsearch Blog 241</p> <p>Amazon Redshift References and Further Reading 242</p> <p><b>Chapter 5 Data Visualization 243</b></p> <p>Introduction 244</p> <p>Data Consumers 245</p> <p>Data Visualization Options 246</p> <p>Amazon QuickSight 247</p> <p>Getting Started 248</p> <p>Working with Data 250</p> <p>Data Preparation 255</p> <p>Data Analysis 256</p> <p>Data Visualization 258</p> <p>Machine Learning Insights 261</p> <p>Building Dashboards 262</p> <p>Embedding QuickSight Objects into Other Applications 264</p> <p>Administration 265</p> <p>Security 266</p> <p>Other Visualization Options 267</p> <p>Predictive Analytics 270</p> <p>What is Predictive Analytics? 270</p> <p>The AWS ML Stack 271</p> <p>Summary 273</p> <p>Exam Essentials 273</p> <p>Exercises 274</p> <p>Review Questions 275</p> <p>References 276</p> <p>Additional Reading Material 276</p> <p><b>Chapter 6 Data Security 279</b></p> <p>Introduction 280</p> <p>Shared Responsibility Model 280</p> <p>Security Services on AWS 282</p> <p>AWS IAM Overview 285</p> <p>IAM User 285</p> <p>IAM Groups 286</p> <p>IAM Roles 287</p> <p>Amazon EMR Security 289</p> <p>Public Subnet 290</p> <p>Private Subnet 291</p> <p>Security Configurations 293</p> <p>Block Public Access 298</p> <p>VPC Subnets 298</p> <p>Security Options during Cluster Creation 299</p> <p>EMR Security Summary 300</p> <p>Amazon S3 Security 301</p> <p>Managing Access to Data in Amazon S3 301</p> <p>Data Protection in Amazon S3 305</p> <p>Logging and Monitoring with Amazon S3 306</p> <p>Best Practices for Security on Amazon S3 308</p> <p>Amazon Athena Security 308</p> <p>Managing Access to Amazon Athena 309</p> <p>Data Protection in Amazon Athena 310</p> <p>Data Encryption in Amazon Athena 311</p> <p>Amazon Athena and AWS Lake Formation 312</p> <p>Amazon Redshift Security 312</p> <p>Levels of Security within Amazon Redshift 313</p> <p>Data Protection in Amazon Redshift 315</p> <p>Redshift Auditing 316</p> <p>Redshift Logging 317</p> <p>Amazon Elasticsearch Security 317</p> <p>Elasticsearch Network Configuration 318</p> <p>VPC Access 318</p> <p>Accessing Amazon Elasticsearch and Kibana 319</p> <p>Data Protection in Amazon Elasticsearch 322</p> <p>Amazon Kinesis Security 325</p> <p>Managing Access to Amazon Kinesis 325</p> <p>Data Protection in Amazon Kinesis 326</p> <p>Amazon Kinesis Best Practices 326</p> <p>Amazon QuickSight Security 327</p> <p>Managing Data Access with Amazon QuickSight 327</p> <p>Data Protection 328</p> <p>Logging and Monitoring 329</p> <p>Security Best Practices 329</p> <p>Amazon DynamoDB Security 329</p> <p>Access Management in DynamoDB 329</p> <p>IAM Policy with Fine-Grained Access Control 330</p> <p>Identity Federation 331</p> <p>How to Access Amazon DynamoDB 332</p> <p>Data Protection with DynamoDB 332</p> <p>Monitoring and Logging with DynamoDB 333</p> <p>Summary 334</p> <p>Exam Essentials 334</p> <p>Exercises/Workshops 334</p> <p>Review Questions 336</p> <p>References and Further Reading 337</p> <p><b>Appendix Answers to Review Questions 339</b></p> <p>Chapter 1: History of Analytics and Big Data 340</p> <p>Chapter 2: Data Collection 342</p> <p>Chapter 3: Data Storage 343</p> <p>Chapter 4: Data Processing and Analysis 344</p> <p>Chapter 5: Data Visualization 346</p> <p>Chapter 6: Data Security 346</p> <p>Index 349</p>
<p><b>ASIF ABBASI</b> has over 20 years of experience working in various Data & Analytics engineering, consulting and advisory roles with some of the largest customers across the globe to help them in their quest to become more data driven. Asif is the author of Learning Apache Spark 2.0 and is an AWS Certified Data Analytics & Machine Learning Specialist, AWS Certified Solutions Architect (Professional), Hortonworks Certified Hadoop Professional and Administrator, Certified Spark Developer, SAS Certified Predictive Modeler, and Sun Certified Enterprise Architect. Asif is also a Project Management Professional.</p>
<p><b>Includes one year of FREE access after activation to the interactive online learning environment and study tools:</b> <ul> <li><b>2 custom practice exams</b></li> <li><b>100 electronic flashcards</b></li> <li><b>Searchable key term glossary</b></li> </ul> <p><b>Your complete Guide to Preparing for the AWS Certified Data Analytics exam</b> <p>The AWS Certified Data Analytics Study Guide is your one-stop resource for understanding the necessary skills and responsibilities of working in a data analytics-focused job role. This Sybex Study Guide also provides complete coverage of the Specialty (DAS-C01) Exam objectives. Prepare for the exam smarter and faster with Sybex thanks to efficient and accurate content including assessment tests that validate and measure exam readiness, practical exercises, and challenging chapter review questions. Reinforce and retain what you've learned with the Sybex online learning environment and test bank, accessible across multiple devices. Get prepared for the AWS Certified Data Analytics exam with Sybex. <p><b>Coverage of 100% of all exam objectives in this</b> <b><i>Study Guide</i></b><b> means you'll be ready for:</b> <ul> <li>Collection</li> <li>Storage and Data Management</li> <li>Processing</li> <li>Analysis and Visualization</li> <li>Security</li> </ul> <p><b>Interactive learning environment</b> <p>Take your exam prep to the next level with Sybex's superior interactive online study tools. To access our learning environment, simply visit <b>www.wiley.com/go/sybextestprep</b>, register your book to receive your unique PIN, and instantly gain one year of FREE access after activation to: <ul> <b><li>Interactive test bank</b> with 2 practice exams to help you identify areas where further review is needed. Get more than 90% of the answers correct, and you're ready to take the certification exam.</li> <b><li>100 electronic flashcards</b> to reinforce learning and last-minute prep before the exam.</li> <b><li>Comprehensive glossary</b> in PDF format gives you instant access to the key terms so you are fully prepared.</li> </ul> <p><b>ABOUT THE AWS DATA ANALYTICS PROGRAM</b> <p>The AWS Data Analytics certification is the ideal credential for those with comprehensive understanding of using AWS services to design, build, secure, and maintain analytics solutions that provide insight from data. Visit <b>https://aws.amazon.com/certification/</b> for more information.

Diese Produkte könnten Sie auch interessieren:

Symbian OS Explained
Symbian OS Explained
von: Jo Stichbury
PDF ebook
32,99 €
Symbian OS Internals
Symbian OS Internals
von: Jane Sales
PDF ebook
56,99 €
Parallel Combinatorial Optimization
Parallel Combinatorial Optimization
von: El-Ghazali Talbi
PDF ebook
120,99 €