Details

The Informed Company


The Informed Company

How to Build Modern Agile Data Stacks that Drive Winning Insights
1. Aufl.

von: Dave Fowler, Matthew C. David

17,99 €

Verlag: Wiley
Format: EPUB
Veröffentl.: 22.10.2021
ISBN/EAN: 9781119748014
Sprache: englisch
Anzahl Seiten: 256

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

<p><b>Learn how to manage a modern data stack and get the most out of data in your organization!</b></p> <p>Thanks to the emergence of new technologies and the explosion of data in recent years, we need new practices for managing and getting value out of data. In the modern, data driven competitive landscape the "best guess" approach—reading blog posts here and there and patching together data practices without any real visibility—is no longer going to hack it. <i>The Informed Company</i> provides definitive direction on how best to leverage the modern data stack, including cloud computing, columnar storage, cloud ETL tools, and cloud BI tools. You'll learn how to work with Agile methods and set up processes that's right for your company to use your data as a key weapon for your success . . . You'll discover best practices for every stage, from querying production databases at a small startup all the way to setting up data marts for different business lines of an enterprise.</p> <p>In their work at Chartio, authors Fowler and David have learned that most businesspeople are almost completely self-taught when it comes to data. If they are using resources, those resources are outdated, so they're missing out on the latest cloud technologies and advances in data analytics. This book will firm up your understanding of data and bring you into the present with knowledge around what works and what doesn't.</p> <ul> <li>Discover the data stack strategies that are working for today's successful small, medium, and enterprise companies</li> <li>Learn the different Agile stages of data organization, and the right one for your team</li> <li>Learn how to maintain Data Lakes and Data Warehouses for effective, accessible data storage</li> <li>Gain the knowledge you need to architect Data Warehouses and Data Marts</li> <li>Understand your business's level of data sophistication and the steps you can take to get to "level up" your data</li> </ul> <p><i>The Informed Company</i> is the definitive data book for anyone who wants to work faster and more nimbly, armed with actionable decision-making data.</p>
<p>About This Book xiii</p> <p>Foreword xxi</p> <p>Introduction xxv</p> <p><b>Stage 1 Source (aka Siloed Data) 1</b></p> <p><b>Chapter 1 Starting with Source Data 3</b></p> <p>Common Options for Analyzing Source Data 4</p> <p><b>Chapter 2 The Need to Replicate Source Data 11</b></p> <p>Replicate Sources 12</p> <p>Create Read-Only Access 14</p> <p><b>Chapter 3 Source Data Best Practices 15</b></p> <p>Keep a Complexity Wiki Page 15</p> <p>Snippet Dictionary 16</p> <p>Use a BI Product 17</p> <p>Double Check Results 18</p> <p>Keep Short Dashboards 19</p> <p>Design Before Building 20</p> <p><b>Stage 2 Data Lake (aka Data Combined) 23</b></p> <p><b>Chapter 4 Why Build a Data Lake? 25</b></p> <p>What Is a Data Lake? 26</p> <p>Reasons to Build a Data Lake Summarized 27</p> <p><b>Chapter 5 Choosing an Engine for the Data Lake 33</b></p> <p>Modern Columnar Warehouse Engines 35</p> <p>Modern Warehouse Engine Products 38</p> <p>Database Engines 41</p> <p>Recommendation 42</p> <p><b>Chapter 6 Extract and Load (EL) Data 45</b></p> <p>ETL versus ELT 46</p> <p>EL/ETL Vendors 48</p> <p>Extract Options 49</p> <p>Load Options 51</p> <p>Multiple Schemas 52</p> <p>Other Extract and Load Routes 53</p> <p><b>Chapter 7 Data Lake Security 55</b></p> <p>Access in Central Place 56</p> <p>Permission Tiers 57</p> <p><b>Chapter 8 Data Lake Maintenance 59</b></p> <p>Why SQL? 60</p> <p>Data Sources 61</p> <p>Performance 64</p> <p>Upgrade Snippets to Views 68</p> <p><b>Stage 3 Data Warehouse (aka the Single Source of Truth) 69</b></p> <p><b>Chapter 9 The Power of Layers and Views 75</b></p> <p>Make Readable Views 77</p> <p>Layer Views on Views 78</p> <p>Start with a Single View 81</p> <p><b>Chapter 10 Staging Schemas 83</b></p> <p>Orient to the Schemas 84</p> <p>Pick a Table and Clean It 85</p> <p>Other Staging Modeling Considerations 98</p> <p>Building on Top of Staging Schemas 106</p> <p><b>Chapter 11 Model Data with dbt 111</b></p> <p>Version Control 111</p> <p>Modularity and Reusability 112</p> <p>Package Management 112</p> <p>Organizing Files 113</p> <p>Macros 113</p> <p>Incremental Tables 114</p> <p>Testing 115</p> <p><b>Chapter 12 Deploy Modeling Code 119</b></p> <p>Branch Using Version Control Software 119</p> <p>Commit Message 120</p> <p>Test Locally 120</p> <p>Code Review 121</p> <p>Schedule Runs 122</p> <p><b>Chapter 13 Implementing the Data Warehouse 123</b></p> <p>Manage Dependencies 124</p> <p>Combine Tables Within Schemas 126</p> <p>Combine Tables Across Schemas 128</p> <p>Keep the Grain Consistent 130</p> <p>Create Business Metrics 131</p> <p>Keeping Accurate History 133</p> <p><b>Chapter 14 Managing Data Access 135</b></p> <p>How to Secure Sensitive Data in the Data Warehouse 137</p> <p>How to Secure Sensitive Data in a BI Tool 140</p> <p><b>Chapter 15 Maintaining the Source of Truth 143</b></p> <p>Track New Metrics 144</p> <p>Deprecate Old Metrics 147</p> <p>Deprecate Old Schemas 149</p> <p>Resolve Conflicting Numbers 150</p> <p>Handling Ongoing Requests and Ongoing Feedback 151</p> <p>Updating Modeling Code 152</p> <p>Manage Access 153</p> <p>Tuning to Optimize 156</p> <p>Code Review All Modeling 157</p> <p>Maintenance Checklist 158</p> <p><b>Stage 4 Data Marts (aka Data Democratized) 161</b></p> <p><b>Chapter 16 Data Mart Implementation 167</b></p> <p>Views on the Data Warehouse 167</p> <p>Segment Tables 168</p> <p>Access Update 169</p> <p><b>Chapter 17 Data Mart Maintenance 171</b></p> <p>Educate Team 172</p> <p>Identifies Issues 172</p> <p>Identify New Needs 176</p> <p>Help Track Success 176</p> <p><b>Chapter 18 Modern versus Traditional Data Stacks: What’s Changed? 177</b></p> <p>What’s Changed? 177</p> <p><b>Chapter 19 Row-versus</b></p> <p>Column-Oriented</p> <p>Database 181</p> <p>Row-Oriented</p> <p>Databases 182</p> <p>Column-Oriented</p> <p>Databases 184</p> <p>Summary 190</p> <p><b>Chapter 20 Style Guide Example 191</b></p> <p>Simplify 192</p> <p>Clean 194</p> <p>Naming Conventions 195</p> <p>Share It 197</p> <p><b>Chapter 21 Building an SST Example 199</b></p> <p>First Attempt—Same Tables with Prefixes 199</p> <p>Second Attempt—Operational Schema (Source Agnostic) 205</p> <p>Third Attempt—Application Separate, Other Sources Smashed 207</p> <p>Less Planning, More Implementing 209</p> <p>Acknowledgments and Contributions 211</p> <p>Index 213</p>
<p><b>MATT DAVID</B> is the Product Marketing Manager for Platform Data at Atlassian. He formerly worked at Chartio as the Head of Data and before that at Udacity as Product Lead for the School of Data Science.</p> <p><b>DAVE FOWLER</B> is Head of Analytics and Visualization at Atlassian and Founder of Chartio. He has worked in business intelligence for over ten years. His professional focus is on enabling anyone and everyone to explore and understand their data.</p.
<p><i>“The Informed Company</i> is a lucid, pragmatic explanation of how to use data in a modern organization.”</p> <p><b>—George Fraser,</B> CEO, Fivetran <p><b>Discover how to manage data in the real world with modern data practices</b> <p>In <i>The Informed Company,</i> data professionals and authors Matt David and Dave Fowler deliver modern strategies and techniques for managing an up-to-date data stack that includes cloud computing, columnar storage, cloud ETL tools, and cloud BI tools. You’ll discover the processes that help you get the most out of your data and best practices for every part of the data exploration process. The authors eschew legacy practices that are inappropriate for the amount and type of data that modern companies deal with and embrace modern data stack technologies. <p>The book teaches data stack strategies for small, medium, and enterprise companies, how to maintain data lakes and data warehouses, how to architect data warehouses and data marts, and how to understand your business’s current level of data sophistication. You’ll discover how to create faster and more nimble companies that are armed with actionable decision-making data when they need it.

Diese Produkte könnten Sie auch interessieren:

Fanatical Prospecting
Fanatical Prospecting
von: Jeb Blount, Mike Weinberg
EPUB ebook
20,99 €
Fanatical Prospecting
Fanatical Prospecting
von: Jeb Blount, Mike Weinberg
PDF ebook
20,99 €
Convert Every Click
Convert Every Click
von: Benji Rabhan
EPUB ebook
9,99 €