Details

Deep Learning for Physical Scientists


Deep Learning for Physical Scientists

Accelerating Research with Machine Learning
1. Aufl.

von: Edward O. Pyzer-Knapp, Matthew Benatan

63,99 €

Verlag: Wiley
Format: PDF
Veröffentl.: 20.09.2021
ISBN/EAN: 9781119408321
Sprache: englisch
Anzahl Seiten: 208

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

<p><b>Discover the power of machine learning in the physical sciences with this one-stop resource from a leading voice in the field</b></p> <p><i>Deep Learning for Physical Scientists: Accelerating Research with Machine Learning</i> delivers an insightful analysis of the transformative techniques being used in deep learning within the physical sciences. The book offers readers the ability to understand, select, and apply the best deep learning techniques for their individual research problem and interpret the outcome.</p> <p>Designed to teach researchers to think in useful new ways about how to achieve results in their research, the book provides scientists with new avenues to attack problems and avoid common pitfalls and problems. Practical case studies and problems are presented, giving readers an opportunity to put what they have learned into practice, with exemplar coding approaches provided to assist the reader.</p> <p>From modelling basics to feed-forward networks, the book offers a broad cross-section of machine learning techniques to improve physical science research. Readers will also enjoy:</p> <ul> <li>A thorough introduction to the basic classification and regression with perceptrons</li> <li>An exploration of training algorithms, including back propagation and stochastic gradient descent and the parallelization of training</li> <li>An examination of multi-layer perceptrons for learning from descriptors and de-noising data</li> <li>Discussions of recurrent neural networks for learning from sequences and convolutional neural networks for learning from images</li> <li>A treatment of Bayesian optimization for tuning deep learning architectures</li> </ul> <p>Perfect for academic and industrial research professionals in the physical sciences, <i>Deep Learning for Physical Scientists: Accelerating Research with Machine Learning</i> will also earn a place in the libraries of industrial researchers who have access to large amounts of data but have yet to learn the techniques to fully exploit that access.</p>
<p>About the Authors xi</p> <p>Acknowledgements xii</p> <p><b>1 Prefix – Learning to “Think Deep” </b><b>1</b></p> <p>1.1 So What Do I Mean by Changing the Way You Think? 2</p> <p><b>2 Setting Up a Python Environment for Deep Learning Projects </b><b>5</b></p> <p>2.1 Python Overview 5</p> <p>2.2 Why Use Python for Data Science? 6</p> <p>2.3 Anaconda Python 7</p> <p>2.3.1 Why Use Anaconda? 7</p> <p>2.3.2 Downloading and Installing Anaconda Python 7</p> <p>2.3.2.1 Installing TensorFlow 9</p> <p>2.4 Jupyter Notebooks 10</p> <p>2.4.1 Why Use a Notebook? 10</p> <p>2.4.2 Starting a Jupyter Notebook Server 11</p> <p>2.4.3 Adding Markdown to Notebooks 12</p> <p>2.4.4 A Simple Plotting Example 14</p> <p>2.4.5 Summary 16</p> <p><b>3 Modelling Basics </b><b>17</b></p> <p>3.1 Introduction 17</p> <p>3.2 Start Where You Mean to Go On – Input Definition and Creation 17</p> <p>3.3 Loss Functions 18</p> <p>3.3.1 Classification and Regression 19</p> <p>3.3.2 Regression Loss Functions 19</p> <p>3.3.2.1 Mean Absolute Error 19</p> <p>3.3.2.2 Root Mean Squared Error 19</p> <p>3.3.3 Classification Loss Functions 20</p> <p>3.3.3.1 Precision 21</p> <p>3.3.3.2 Recall 21</p> <p>3.3.3.3 <i>F</i>1 Score 22</p> <p>3.3.3.4 Confusion Matrix 22</p> <p>3.3.3.5 (Area Under) Receiver Operator Curve (AU-ROC) 23</p> <p>3.3.3.6 Cross Entropy 25</p> <p>3.4 Overfitting and Underfitting 28</p> <p>3.4.1 Bias–Variance Trade-Off 29</p> <p>3.5 Regularisation 31</p> <p>3.5.1 Ridge Regression 31</p> <p>3.5.2 LASSO Regularisation 33</p> <p>3.5.3 Elastic Net 34</p> <p>3.5.4 Bagging and Model Averaging 34</p> <p>3.6 Evaluating a Model 35</p> <p>3.6.1 Holdout Testing 35</p> <p>3.6.2 Cross Validation 36</p> <p>3.7 The Curse of Dimensionality 37</p> <p>3.7.1 Normalising Inputs and Targets 37</p> <p>3.8 Summary 39</p> <p>Notes 39</p> <p><b>4 Feedforward Networks and Multilayered Perceptrons </b><b>41</b></p> <p>4.1 Introduction 41</p> <p>4.2 The Single Perceptron 41</p> <p>4.2.1 Training a Perceptron 41</p> <p>4.2.2 Activation Functions 42</p> <p>4.2.3 Back Propagation 43</p> <p>4.2.3.1 Weight Initialisation 45</p> <p>4.2.3.2 Learning Rate 46</p> <p>4.2.4 Key Assumptions 46</p> <p>4.2.5 Putting It All Together in TensorFlow 47</p> <p>4.3 Moving to a Deep Network 49</p> <p>4.4 Vanishing Gradients and Other “Deep” Problems 53</p> <p>4.4.1 Gradient Clipping 54</p> <p>4.4.2 Non-saturating Activation Functions 54</p> <p>4.4.2.1 ReLU 54</p> <p>4.4.2.2 Leaky ReLU 56</p> <p>4.4.2.3 ELU 57</p> <p>4.4.3 More Complex Initialisation Schemes 57</p> <p>4.4.3.1 Xavier 58</p> <p>4.4.3.2 He 58</p> <p>4.4.4 Mini Batching 59</p> <p>4.5 Improving the Optimisation 60</p> <p>4.5.1 Bias 60</p> <p>4.5.2 Momentum 63</p> <p>4.5.3 Nesterov Momentum 63</p> <p>4.5.4 (Adaptive) Learning Rates 63</p> <p>4.5.5 AdaGrad 64</p> <p>4.5.6 RMSProp 65</p> <p>4.5.7 Adam 65</p> <p>4.5.8 Regularisation 66</p> <p>4.5.9 Early Stopping 66</p> <p>4.5.10 Dropout 68</p> <p>4.6 Parallelisation of learning 69</p> <p>4.6.1 Hogwild! 69</p> <p>4.7 High and Low-level Tensorflow APIs 70</p> <p>4.8 Architecture Implementations 72</p> <p>4.9 Summary 73</p> <p>4.10 Papers to Read 73</p> <p><b>5 Recurrent Neural Networks </b><b>77</b></p> <p>5.1 Introduction 77</p> <p>5.2 Basic Recurrent Neural Networks 77</p> <p>5.2.1 Training a Basic RNN 78</p> <p>5.2.2 Putting It All Together in TensorFlow 79</p> <p>5.2.3 The Problem with Vanilla RNNs 81</p> <p>5.3 Long Short-Term Memory (LSTM) Networks 82</p> <p>5.3.1 Forget Gate 82</p> <p>5.3.2 Input Gate 84</p> <p>5.3.3 Output Gate 84</p> <p>5.3.4 Peephole Connections 85</p> <p>5.3.5 Putting It All Together in TensorFlow 86</p> <p>5.4 Gated Recurrent Units 87</p> <p>5.4.1 Putting It All Together in TensorFlow 88</p> <p>5.5 Using Keras for RNNs 88</p> <p>5.6 Real World Implementations 89</p> <p>5.7 Summary 89</p> <p>5.8 Papers to Read 90</p> <p><b>6 Convolutional Neural Networks </b><b>93</b></p> <p>6.1 Introduction 93</p> <p>6.2 Fundamental Principles of Convolutional Neural Networks 94</p> <p>6.2.1 Convolution 94</p> <p>6.2.2 Pooling 95</p> <p>6.2.2.1 Why Use Pooling? 95</p> <p>6.2.2.2 Types of Pooling 96</p> <p>6.2.3 Stride and Padding 99</p> <p>6.2.4 Sparse Connectivity 101</p> <p>6.2.5 Parameter Sharing 101</p> <p>6.2.6 Convolutional Neural Networks with TensorFlow 102</p> <p>6.3 Graph Convolutional Networks 103</p> <p>6.3.1 Graph Convolutional Networks in Practice 104</p> <p>6.4 Real World Implementations 107</p> <p>6.5 Summary 108</p> <p>6.6 Papers to Read 108</p> <p><b>7 Auto-Encoders </b><b>111</b></p> <p>7.1 Introduction 111</p> <p>7.1.1 Auto-Encoders for Dimensionality Reduction 111</p> <p>7.2 Getting a Good Start – Stacked Auto-Encoders, Restricted Boltzmann Machines, and Pretraining 115</p> <p>7.2.1 Restricted Boltzmann Machines 115</p> <p>7.2.2 Stacking Restricted Boltzmann Machines 118</p> <p>7.3 Denoising Auto-Encoders 120</p> <p>7.4 Variational Auto-Encoders 121</p> <p>7.5 Sequence to Sequence Learning 125</p> <p>7.6 The Attention Mechanism 126</p> <p>7.7 Application in Chemistry: Building a Molecular Generator 127</p> <p>7.8 Summary 132</p> <p>7.9 Real World Implementations 132</p> <p>7.10 Papers to Read 132</p> <p><b>8 Optimising Models Using Bayesian Optimisation </b><b>135</b></p> <p>8.1 Introduction 135</p> <p>8.2 Defining Our Function 135</p> <p>8.3 Grid and Random Search 136</p> <p>8.4 Moving Towards an Intelligent Search 137</p> <p>8.5 Exploration and Exploitation 137</p> <p>8.6 Greedy Search 138</p> <p>8.6.1 Key Fact One – Exploitation Heavy Search is Susceptible to Initial Data Bias 139</p> <p>8.7 Diversity Search 141</p> <p>8.8 Bayesian Optimisation 142</p> <p>8.8.1 Domain Knowledge (or Prior) 142</p> <p>8.8.2 Gaussian Processes 145</p> <p>8.8.3 Kernels 146</p> <p>8.8.3.1 Stationary Kernels 146</p> <p>8.8.3.2 Noise Kernel 147</p> <p>8.8.4 Combining Gaussian Process Prediction and Optimisation 149</p> <p>8.8.4.1 Probability of Improvement 149</p> <p>8.8.4.2 Expected Improvement 150</p> <p>8.8.5 Balancing Exploration and Exploitation 151</p> <p>8.8.6 Upper and Lower Confidence Bound Algorithm 151</p> <p>8.8.7 Maximum Entropy Sampling 152</p> <p>8.8.8 Optimising the Acquisition Function 153</p> <p>8.8.9 Cost Sensitive Bayesian Optimisation 155</p> <p>8.8.10 Constrained Bayesian Optimisation 158</p> <p>8.8.11 Parallel Bayesian Optimisation 158</p> <p>8.8.11.1 qEI 158</p> <p>8.8.11.2 Constant Liar and Kriging Believer 160</p> <p>8.8.11.3 Local Penalisation 162</p> <p>8.8.11.4 Parallel Thompson Sampling 162</p> <p>8.8.11.5 <i>K</i>-Means Batch Bayesian Optimisation 162</p> <p>8.9 Summary 163</p> <p>8.10 Papers to Read 163</p> <p><b>Case Study 1 Solubility Prediction Case Study </b><b>167</b></p> <p>CS 1.1 Step 1 – Import Packages 167</p> <p>CS 1.2 Step 2 – Importing the Data 168</p> <p>CS 1.3 Step 3 – Creating the Inputs 168</p> <p>CS 1.4 Step 4 – Splitting into Training and Testing 168</p> <p>CS 1.5 Step 5 – Defining Our Model 169</p> <p>CS 1.6 Step 6 – Running Our Model 169</p> <p>CS 1.7 Step 7 – Automatically Finding an Optimised Architecture Using Bayesian Optimisation 170</p> <p><b>Case Study 2 Time Series Forecasting with LSTMs </b><b>173</b></p> <p>CS 2.1 Simple LSTM 173</p> <p>CS 2.2 Sequence-to-Sequence LSTM 177</p> <p><b>Case Study 3 Deep Embeddings for Auto-Encoder-Based Featurisation </b><b>185</b></p> <p>Index 190</p>
<p><b>Dr Edward O. Pyzer-Knapp</b> is the worldwide lead for AI Enriched Modelling and Simulation at IBM Research. Previously, he obtained his PhD from the University of Cambridge using state of the art computational techniques to accelerate materials design then moving to Harvard where he was in charge of the day-to-day running of the Harvard Clean Energy Project - a collaboration with IBM which combined massive distributed computing, quantum-mechanical simulations, and machine-learning to accelerate discovery of the next generation of organic photovoltaic materials. He is also the Visiting Professor of Industrially Applied AI at the University of Liverpool, and the Editor in Chief for Applied AI Letters, a journal with a focus on real-world application and validation of AI.</p> <p><b>Dr Matt Benatan</b> received his PhD in Audio-Visual Speech Processing from the University of Leeds, after which he went on to pursue a career in AI research within industry. His work to date has involved the research and development of AI techniques for a broad variety of domains, from applications in audio processing through to materials discovery. His research interests include Computer Vision, Signal Processing, Bayesian Optimization, and Scalable Bayesian Inference.
<p><b>Discover the power of machine learning in the physical sciences with this one-stop resource from a leading voice in the field</b></p> <p><i>Deep Learning for Physical Scientists: Accelerating Research with Machine Learning</i> delivers an insightful analysis of the transformative techniques being used in deep learning within the physical sciences. The book offers readers the ability to understand, select, and apply the best deep learning techniques for their individual research problem and interpret the outcome. <p>Designed to teach researchers to think in useful new ways about how to achieve results in their research, the book provides scientists with new avenues to attack problems and avoid common pitfalls and problems. Practical case studies and problems are presented, giving readers an opportunity to put what they have learned into practice, with exemplar coding approaches provided to assist the reader. <p>From modelling basics to feed-forward networks, the book offers a broad cross-section of machine learning techniques to improve physical science research. Readers will also enjoy: <ul><li>A thorough introduction to the basic classification and regression with perceptrons</li> <li>An exploration of training algorithms, including back propagation and stochastic gradient descent and the parallelization of training</li> <li>An examination of multi-layer perceptrons for learning from descriptors and de-noising data</li> <li>Discussions of recurrent neural networks for learning from sequences and convolutional neural networks for learning from images</li> <li>A treatment of Bayesian optimization for tuning deep learning architectures</li></ul> <p>Perfect for academic and industrial research professionals in the physical sciences, <i>Deep Learning for Physical Scientists: Accelerating Research with Machine Learning</i> will also earn a place in the libraries of industrial researchers who have access to large amounts of data but have yet to learn the techniques to fully exploit that access.

Diese Produkte könnten Sie auch interessieren:

Hot-Melt Extrusion
Hot-Melt Extrusion
von: Dennis Douroumis
PDF ebook
136,99 €
Hot-Melt Extrusion
Hot-Melt Extrusion
von: Dennis Douroumis
EPUB ebook
136,99 €
Kunststoffe
Kunststoffe
von: Wilhelm Keim
PDF ebook
99,99 €