Smarter Data Science

Smarter Data Science

Succeeding with Enterprise-Grade Data and AI Projects
1. Aufl.

von: Neal Fishman, Cole Stryker, Grady Booch

32,99 €

Verlag: Wiley
Format: EPUB
Veröffentl.: 14.04.2020
ISBN/EAN: 9781119693420
Sprache: englisch
Anzahl Seiten: 304

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.


<p><b>Organizations can make data science a repeatable, predictable tool, which business professionals use to get more value from their data</b></p> <p>Enterprise data and AI projects are often scattershot, underbaked, siloed, and not adaptable to predictable business changes. As a result, the vast majority fail. These expensive quagmires can be avoided, and this book explains precisely how. </p> <p>Data science is emerging as a hands-on tool for not just data scientists, but business professionals as well. Managers, directors, IT leaders, and analysts must expand their use of data science capabilities for the organization to stay competitive.<i> Smarter Data Science</i> helps them achieve their enterprise-grade data projects and AI goals. It serves as a guide to building a robust and comprehensive information architecture program that enables sustainable and scalable AI deployments.</p> <p>When an organization manages its data effectively, its data science program becomes a fully scalable function that’s both prescriptive and repeatable. With an understanding of data science principles, practitioners are also empowered to lead their organizations in establishing and deploying viable AI. They employ the tools of machine learning, deep learning, and AI to extract greater value from data for the benefit of the enterprise.</p> <p>By following a ladder framework that promotes prescriptive capabilities, organizations can make data science accessible to a range of team members, democratizing data science throughout the organization. Companies that collect, organize, and analyze data can move forward to additional data science achievements:</p> <ul> <li>Improving time-to-value with infused AI models for common use cases</li> <li>Optimizing knowledge work and business processes</li> <li>Utilizing AI-based business intelligence and data visualization</li> <li>Establishing a data topology to support general or highly specialized needs</li> <li>Successfully completing AI projects in a predictable manner</li> <li>Coordinating the use of AI from any compute node. From inner edges to outer edges: cloud, fog, and mist computing</li> </ul> <p>When they climb the ladder presented in this book, businesspeople and data scientists alike will be able to improve and foster repeatable capabilities. They will have the knowledge to maximize their AI and data assets for the benefit of their organizations.</p>
<p>Foreword for <i>Smarter Data Science </i>xix</p> <p>Epigraph xxi</p> <p>Preamble xxiii</p> <p><b>Chapter 1 Climbing the AI Ladder 1</b></p> <p>Readying Data for AI 2</p> <p>Technology Focus Areas 3</p> <p>Taking the Ladder Rung by Rung 4</p> <p>Constantly Adapt to Retain Organizational Relevance 8</p> <p>Data-Based Reasoning is Part and Parcel in the Modern Business 10</p> <p>Toward the AI-Centric Organization 14</p> <p>Summary 16</p> <p><b>Chapter 2 Framing Part I: Considerations for Organizations Using AI 17</b></p> <p>Data-Driven Decision-Making 18</p> <p>Using Interrogatives to Gain Insight 19</p> <p>The Trust Matrix 20</p> <p>The Importance of Metrics and Human Insight 22</p> <p>Democratizing Data and Data Science 23</p> <p>Aye, a Prerequisite: Organizing Data Must Be a Forethought 26</p> <p>Preventing Design Pitfalls 27</p> <p>Facilitating the Winds of Change: How Organized Data Facilitates Reaction Time 29</p> <p><i>Quae Quaestio </i>(Question Everything) 30</p> <p>Summary 32</p> <p><b>Chapter 3 Framing Part II: Considerations for Working with Data and AI 35</b></p> <p>Personalizing the Data Experience for Every User 36</p> <p>Context Counts: Choosing the Right Way to Display Data 38</p> <p>Ethnography: Improving Understanding Through Specialized Data 42</p> <p>Data Governance and Data Quality 43</p> <p>The Value of Decomposing Data 43</p> <p>Providing Structure Through Data Governance 43</p> <p>Curating Data for Training 45</p> <p>Additional Considerations for Creating Value 45</p> <p>Ontologies: A Means for Encapsulating Knowledge 46</p> <p>Fairness, Trust, and Transparency in AI Outcomes 49</p> <p>Accessible, Accurate, Curated, and Organized 52</p> <p>Summary 54</p> <p><b>Chapter 4 A Look Back on Analytics: More Than One Hammer 57</b></p> <p>Been Here Before: Reviewing the Enterprise Data Warehouse 57</p> <p>Drawbacks of the Traditional Data Warehouse 64</p> <p>Paradigm Shift 68</p> <p>Modern Analytical Environments: The Data Lake 69</p> <p>By Contrast 71</p> <p>Indigenous Data 72</p> <p>Attributes of Difference 73</p> <p>Elements of the Data Lake 75</p> <p>The New Normal: Big Data is Now Normal Data 77</p> <p>Liberation from the Rigidity of a Single Data Model 78</p> <p>Streaming Data 78</p> <p>Suitable Tools for the Task 78</p> <p>Easier Accessibility 79</p> <p>Reducing Costs 79</p> <p>Scalability 79</p> <p>Data Management and Data Governance for AI 80</p> <p>Schema-on-Read vs. Schema-on-Write 81</p> <p>Summary 84</p> <p><b>Chapter 5 A Look Forward on Analytics: Not Everything Can Be a Nail 87</b></p> <p>A Need for Organization 87</p> <p>The Staging Zone 90</p> <p>The Raw Zone 91</p> <p>The Discovery and Exploration Zone 92</p> <p>The Aligned Zone 93</p> <p>The Harmonized Zone 98</p> <p>The Curated Zone 100</p> <p>Data Topologies 100</p> <p>Zone Map 103</p> <p>Data Pipelines 104</p> <p>Data Topography 105</p> <p>Expanding, Adding, Moving, and Removing Zones 107</p> <p>Enabling the Zones 108</p> <p>Ingestion 108</p> <p>Data Governance 111</p> <p>Data Storage and Retention 112</p> <p>Data Processing 114</p> <p>Data Access 116</p> <p>Management and Monitoring 117</p> <p>Metadata 118</p> <p>Summary 119</p> <p><b>Chapter 6 Addressing Operational Disciplines on the AI Ladder 121</b></p> <p>A Passage of Time 122</p> <p>Create 128</p> <p>Stability 128</p> <p>Barriers 129</p> <p>Complexity 129</p> <p>Execute 130</p> <p>Ingestion 131</p> <p>Visibility 132</p> <p>Compliance 132</p> <p>Operate 133</p> <p>Quality 134</p> <p>Reliance 135</p> <p>Reusability 135</p> <p>The xOps Trifecta: DevOps/MLOps, DataOps, and AIOps 136</p> <p>DevOps/MLOps 137</p> <p>DataOps 139</p> <p>AIOps 142</p> <p>Summary 144</p> <p><b>Chapter 7 Maximizing the Use of Your Data: Being Value Driven 147</b></p> <p>Toward a Value Chain 148</p> <p>Chaining Through Correlation 152</p> <p>Enabling Action 154</p> <p>Expanding the Means to Act 155</p> <p>Curation 156</p> <p>Data Governance 159</p> <p>Integrated Data Management 162</p> <p>Onboarding 163</p> <p>Organizing 164</p> <p>Cataloging 166</p> <p>Metadata 167</p> <p>Preparing 168</p> <p>Provisioning 169</p> <p>Multi-Tenancy 170</p> <p>Summary 173</p> <p><b>Chapter 8 Valuing Data with Statistical Analysis and Enabling Meaningful Access 175</b></p> <p>Deriving Value: Managing Data as an Asset 175</p> <p>An Inexact Science 180</p> <p>Accessibility to Data: Not All Users are Equal 183</p> <p>Providing Self-Service to Data 184</p> <p>Access: The Importance of Adding Controls 186</p> <p>Ranking Datasets Using a Bottom-Up Approach for Data Governance 187</p> <p>How Various Industries Use Data and AI 188</p> <p>Benefi ting from Statistics 189</p> <p>Summary 198</p> <p><b>Chapter 9 Constructing for the Long-Term 199</b></p> <p>The Need to Change Habits: Avoiding Hard-Coding 200</p> <p>Overloading 201</p> <p>Locked In 202</p> <p>Ownership and Decomposition 204</p> <p>Design to Avoid Change 204</p> <p>Extending the Value of Data Through AI 206</p> <p>Polyglot Persistence 208</p> <p>Benefi ting from Data Literacy 213</p> <p>Understanding a Topic 215</p> <p>Skillsets 216</p> <p>It’s All Metadata 218</p> <p>The Right Data, in the Right Context, with the Right Interface 219</p> <p>Summary 221</p> <p><b>Chapter 10 A Journey’s End: An IA for AI 223</b></p> <p>Development Efforts for AI 224</p> <p>Essential Elements: Cloud-Based Computing, Data, and Analytics 228</p> <p>Intersections: Compute Capacity and Storage Capacity 234</p> <p>Analytic Intensity 237</p> <p>Interoperability Across the Elements 238</p> <p>Data Pipeline Flight Paths: Preflight, Inflight, Postflight 242</p> <p>Data Management for the Data Puddle, Data Pond, and Data Lake 243</p> <p>Driving Action: Context, Content, and Decision-Makers 245</p> <p>Keep It Simple 248</p> <p>The Silo is Dead; Long Live the Silo 250</p> <p>Taxonomy: Organizing Data Zones 252</p> <p>Capabilities for an Open Platform 256</p> <p>Summary 260</p> <p>Appendix Glossary of Terms 263</p> <p>Index 269</p>
<p><b>NEAL FISHMAN</b> is a Distinguished Engineer and CTO of Data-Based Pathology at IBM. He is an IBM-certified Senior IT Architect and Open Group Distinguished Chief Architect. <p><b>COLE STRYKER</b> is a journalist based in Los Angeles. He is the author of <i>Epic Win for Anonymous</i> and <i>Hacking the Future</i>.
<p><b>PRAISE FOR SMARTER DATA SCIENCE</b> <p>"This work provides benefit to a variety of roles, including architects, developers, product owners, and business executives. For organizations exploring AI, this book is the cornerstone to becoming successful."<br> <b>—Harry Xuegang Huang Ph.D., External Consultant, A.P. Moller – Maersk</b> <p>"Presents a holistic model that emphasizes how critical data and data management are in implementing successful value-driven data analytics and AI solutions. The book presents an elegant and novel approach to data management."<br> <b>—Ali Farahani, Ph.D., Former Chief Data Officer, County of Los Angeles; Adjunct Associate Professor, USC</b> <p>"The authors seek and speak the truth, and penetrate into the core of the challenge most organizations face in finding value in their data. Our industry needs to move away from trying to connect the winning dots by 'magical' technologies and overly simplified approaches. This book provides the necessary guidance."<br> <b>—Jan Gravesen, M.Sc., IBM Distinguished Engineer, Director and Chief Technology Officer, IBM</b> <p><b>BUILD A ROBUST INFORMATION ARCHITECTURE THAT SCALES AND DELIVERS LONG-TERM VALUE</b> <p>Large organizations are racing to implement advanced data science. All too often, our AI endeavors turn out to be dead-end science projects that never deliver sustainable business value. What are we missing? In <i>Smarter Data Science: Succeeding with Enterprise-Grade Data and AI Projects,</i> you'll discover the pillars of information architecture that you must understand and implement. <p>Data analytics and AI only add value when they can predictably and consistently deliver business insights and scale across the organization. <i>Smarter Data Science</i> outlines an effective and practical way for organizing, managing, and evaluating data, so you can establish an information architecture to better drive AI and data science. <p><i>You'll learn how to:</i> <ul> <li>Simplify data management, making data available when and where it is needed</li> <li>Improve time to value for operationalizing AI use cases</li> <li>Make AI and data insights accessible across the enterprise</li> <li>Scale complex AI scenarios dynamically and in real time</li> <li>Develop an information architecture that brings predictable, repeatable value</li> </ul>

Diese Produkte könnten Sie auch interessieren:

Wirtschaftsinformatik für Dummies
Wirtschaftsinformatik für Dummies
von: Stephan Thesmann, Werner Burkard
EPUB ebook
21,99 €
The Cognitive Dynamics of Computer Science
The Cognitive Dynamics of Computer Science
von: Szabolcs Michael de Gyurky, Mark A. Tarbell
PDF ebook
104,99 €