Details

Big Data Analytics for Large-Scale Multimedia Search


Big Data Analytics for Large-Scale Multimedia Search


1. Aufl.

von: Stefanos Vrochidis, Benoit Huet, Edward Y. Chang, Ioannis Kompatsiaris

112,99 €

Verlag: Wiley
Format: EPUB
Veröffentl.: 18.03.2019
ISBN/EAN: 9781119377009
Sprache: englisch
Anzahl Seiten: 376

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

<p><b>A timely overview of cutting edge technologies for multimedia retrieval with a special emphasis on scalability</b></p> <p>The amount of multimedia data available every day is enormous and is growing at an exponential rate, creating a great need for new and more efficient approaches for large scale multimedia search. This book addresses that need, covering the area of multimedia retrieval and placing a special emphasis on scalability. It reports the recent works in large scale multimedia search, including research methods and applications, and is structured so that readers with basic knowledge can grasp the core message while still allowing experts and specialists to drill further down into the analytical sections.</p> <p><i>Big Data Analytics for Large-Scale Multimedia Search</i> covers: representation learning, concept and event-based video search in large collections; big data multimedia mining, large scale video understanding, big multimedia data fusion, large-scale social multimedia analysis, privacy and audiovisual content, data storage and management for big multimedia, large scale multimedia search, multimedia tagging using deep learning, interactive interfaces for big multimedia and medical decision support applications using large multimodal data.</p> <ul> <li>Addresses the area of multimedia retrieval and pays close attention to the issue of scalability</li> <li>Presents problem driven techniques with solutions that are demonstrated through realistic case studies and user scenarios</li> <li>Includes tables, illustrations, and figures</li> <li>Offers a Wiley-hosted BCS that features links to open source algorithms, data sets and tools</li> </ul> <p><i>Big Data Analytics for Large-Scale Multimedia Search</i> is an excellent book for academics, industrial researchers, and developers interested in big multimedia data search retrieval. It will also appeal to consultants in computer science problems and professionals in the multimedia industry.</p>
<p>Introduction xv</p> <p>List of Contributors xix</p> <p>About the Companion Website xxiii</p> <p><b>Part I Feature Extraction from Big Multimedia Data </b><b>1</b></p> <p><b>1 Representation Learning on Large and Small Data </b><b>3<br /> </b><i>Chun-Nan Chou, Chuen-Kai Shie, Fu-Chieh Chang, Jocelyn Chang and Edward Y. Chang</i></p> <p>1.1 Introduction 3</p> <p>1.2 Representative Deep CNNs 5</p> <p>1.2.1 AlexNet 6</p> <p>1.2.1.1 ReLU Nonlinearity 6</p> <p>1.2.1.2 Data Augmentation 7</p> <p>1.2.1.3 Dropout 8</p> <p>1.2.2 Network in Network 8</p> <p>1.2.2.1 MLP Convolutional Layer 9</p> <p>1.2.2.2 Global Average Pooling 9</p> <p>1.2.3 VGG 10</p> <p>1.2.3.1 Very Small Convolutional Filters 10</p> <p>1.2.3.2 Multi-scale Training 11</p> <p>1.2.4 GoogLeNet 11</p> <p>1.2.4.1 Inception Modules 11</p> <p>1.2.4.2 Dimension Reduction 12</p> <p>1.2.5 ResNet 13</p> <p>1.2.5.1 Residual Learning 13</p> <p>1.2.5.2 Identity Mapping by Shortcuts 14</p> <p>1.2.6 Observations and Remarks 15</p> <p>1.3 Transfer Representation Learning 15</p> <p>1.3.1 Method Specifications 17</p> <p>1.3.2 Experimental Results and Discussion 18</p> <p>1.3.2.1 Results of Transfer Representation Learning for OM 19</p> <p>1.3.2.2 Results of Transfer Representation Learning for Melanoma 20</p> <p>1.3.2.3 Qualitative Evaluation: Visualization 21</p> <p>1.3.3 Observations and Remarks 23</p> <p>1.4 Conclusions 24</p> <p>References 25</p> <p><b>2 Concept-Based and Event-Based Video Search in Large Video Collections </b><b>31<br /> </b><i>Foteini Markatopoulou, Damianos Galanopoulos, Christos Tzelepis, Vasileios Mezaris and Ioannis Patras</i></p> <p>2.1 Introduction 32</p> <p>2.2 Video preprocessing and Machine Learning Essentials 33</p> <p>2.2.1 Video Representation 33</p> <p>2.2.2 Dimensionality Reduction 34</p> <p>2.3 Methodology for Concept Detection and Concept-Based Video Search 35</p> <p>2.3.1 Related Work 35</p> <p>2.3.2 Cascades for Combining Different Video Representations 37</p> <p>2.3.2.1 Problem Definition and Search Space 37</p> <p>2.3.2.2 Problem Solution 38</p> <p>2.3.3 Multi-Task Learning for Concept Detection and Concept-Based Video Search 40</p> <p>2.3.4 Exploiting Label Relations 41</p> <p>2.3.5 Experimental Study 42</p> <p>2.3.5.1 Dataset and Experimental Setup 42</p> <p>2.3.5.2 Experimental Results 43</p> <p>2.3.5.3 Computational Complexity 47</p> <p>2.4 Methods for Event Detection and Event-Based Video Search 48</p> <p>2.4.1 Related Work 48</p> <p>2.4.2 Learning from Positive Examples 49</p> <p>2.4.3 Learning Solely from Textual Descriptors: Zero-Example Learning 50</p> <p>2.4.4 Experimental Study 52</p> <p>2.4.4.1 Dataset and Experimental Setup 52</p> <p>2.4.4.2 Experimental Results: Learning from Positive Examples 53</p> <p>2.4.4.3 Experimental Results: Zero-Example Learning 53</p> <p>2.5 Conclusions 54</p> <p>2.6 Acknowledgments 55</p> <p>References 55</p> <p><b>3 Big Data Multimedia Mining: Feature Extraction Facing Volume, Velocity, and Variety </b><b>61<br /> </b><i>Vedhas Pandit, Shahin Amiriparian, Maximilian Schmitt, Amr Mousa and Björn Schuller</i></p> <p>3.1 Introduction 61</p> <p>3.2 Scalability through Parallelization 64</p> <p>3.2.1 Process Parallelization 64</p> <p>3.2.2 Data Parallelization 64</p> <p>3.3 Scalability through Feature Engineering 65</p> <p>3.3.1 Feature Reduction through Spatial Transformations 66</p> <p>3.3.2 Laplacian Matrix Representation 66</p> <p>3.3.3 Parallel latent Dirichlet allocation and bag of words 68</p> <p>3.4 Deep Learning-Based Feature Learning 68</p> <p>3.4.1 Adaptability that Conquers both Volume and Velocity 70</p> <p>3.4.2 Convolutional Neural Networks 72</p> <p>3.4.3 Recurrent Neural Networks 73</p> <p>3.4.4 Modular Approach to Scalability 74</p> <p>3.5 Benchmark Studies 76</p> <p>3.5.1 Dataset 76</p> <p>3.5.2 Spectrogram Creation 77</p> <p>3.5.3 CNN-Based Feature Extraction 77</p> <p>3.5.4 Structure of the CNNs 78</p> <p>3.5.5 Process Parallelization 79</p> <p>3.5.6 Results 80</p> <p>3.6 Closing Remarks 81</p> <p>3.7 Acknowledgements 82</p> <p>References 82</p> <p><b>Part II Learning Algorithms for Large-Scale Multimedia </b><b>89</b></p> <p><b>4 Large-Scale Video Understanding with Limited Training Labels </b><b>91<br /> </b><i>Jingkuan Song, Xu Zhao, Lianli Gao and Liangliang Cao</i></p> <p>4.1 Introduction 91</p> <p>4.2 Video Retrieval with Hashing 91</p> <p>4.2.1 Overview 91</p> <p>4.2.2 Unsupervised Multiple Feature Hashing 93</p> <p>4.2.2.1 Framework 93</p> <p>4.2.2.2 The Objective Function of MFH 93</p> <p>4.2.2.3 Solution of MFH 95</p> <p>4.2.2.3.1 Complexity Analysis 96</p> <p>4.2.3 Submodular Video Hashing 97</p> <p>4.2.3.1 Framework 97</p> <p>4.2.3.2 Video Pooling 97</p> <p>4.2.3.3 Submodular Video Hashing 98</p> <p>4.2.4 Experiments 99</p> <p>4.2.4.1 Experiment Settings 99</p> <p>4.2.4.1.1 Video Datasets 99</p> <p>4.2.4.1.2 Visual Features 99</p> <p>4.2.4.1.3 Algorithms for Comparison 100</p> <p>4.2.4.2 Results 100</p> <p>4.2.4.2.1 CC_WEB_VIDEO 100</p> <p>4.2.4.2.2 Combined Dataset 100</p> <p>4.2.4.3 Evaluation of SVH 101</p> <p>4.2.4.3.1 Results 102</p> <p>4.3 Graph-Based Model for Video Understanding 103</p> <p>4.3.1 Overview 103</p> <p>4.3.2 Optimized Graph Learning for Video Annotation 104</p> <p>4.3.2.1 Framework 104</p> <p>4.3.2.2 OGL 104</p> <p>4.3.2.2.1 Terms and Notations 104</p> <p>4.3.2.2.2 Optimal Graph-Based SSL 105</p> <p>4.3.2.2.3 Iterative Optimization 106</p> <p>4.3.3 Context Association Model for Action Recognition 107</p> <p>4.3.3.1 Context Memory 108</p> <p>4.3.4 Graph-based Event Video Summarization 109</p> <p>4.3.4.1 Framework 109</p> <p>4.3.4.2 Temporal Alignment 110</p> <p>4.3.5 TGIF: A New Dataset and Benchmark on Animated GIF Description 111</p> <p>4.3.5.1 Data Collection 111</p> <p>4.3.5.2 Data Annotation 112</p> <p>4.3.6 Experiments 114</p> <p>4.3.6.1 Experimental Settings 114</p> <p>4.3.6.1.1 Datasets 114</p> <p>4.3.6.1.2 Features 114</p> <p>4.3.6.1.3 Baseline Methods and Evaluation Metrics 114</p> <p>4.3.6.2 Results 115</p> <p>4.4 Conclusions and Future Work 116</p> <p>References 116</p> <p><b>5 Multimodal Fusion of Big Multimedia Data </b><b>121<br /> </b><i>Ilias Gialampoukidis, Elisavet Chatzilari, Spiros Nikolopoulos, Stefanos Vrochidis and Ioannis Kompatsiaris</i></p> <p>5.1 Multimodal Fusion in Multimedia Retrieval 122</p> <p>5.1.1 Unsupervised Fusion in Multimedia Retrieval 123</p> <p>5.1.1.1 Linear and Non-linear Similarity Fusion 123</p> <p>5.1.1.2 Cross-modal Fusion of Similarities 124</p> <p>5.1.1.3 Random Walks and Graph-based Fusion 124</p> <p>5.1.1.4 A Unifying Graph-based Model 126</p> <p>5.1.2 Partial Least Squares Regression 127</p> <p>5.1.3 Experimental Comparison 128</p> <p>5.1.3.1 Dataset Description 128</p> <p>5.1.3.2 Settings 129</p> <p>5.1.3.3 Results 129</p> <p>5.1.4 Late Fusion of Multiple Multimedia Rankings 130</p> <p>5.1.4.1 Score Fusion 131</p> <p>5.1.4.2 Rank Fusion 132</p> <p>5.1.4.2.1 Borda Count Fusion 132</p> <p>5.1.4.2.2 Reciprocal Rank Fusion 132</p> <p>5.1.4.2.3 Condorcet Fusion 132</p> <p>5.2 Multimodal Fusion in Multimedia Classification 132</p> <p>5.2.1 Related Literature 134</p> <p>5.2.2 Problem Formulation 136</p> <p>5.2.3 Probabilistic Fusion in Active Learning 137</p> <p>5.2.3.1 If P(S=0|V,T)≠0: 138</p> <p>5.2.3.2 If P(S=0|V,T)≠0: 138</p> <p>5.2.3.3 Incorporating Informativeness in the Selection (<i>P</i>(<i>S</i>|<i>V</i>)) 139</p> <p>5.2.3.4 Measuring Oracle’s Confidence (<i>P</i>(<i>S</i>|<i>T</i>)) 139</p> <p>5.2.3.5 Re-training 140</p> <p>5.2.4 Experimental Comparison 141</p> <p>5.2.4.1 Datasets 141</p> <p>5.2.4.2 Settings 142</p> <p>5.2.4.3 Results 143</p> <p>5.2.4.3.1 Expanding with Positive, Negative or Both 143</p> <p>5.2.4.3.2 Comparing with Sample Selection Approaches 145</p> <p>5.2.4.3.3 Comparing with Fusion Approaches 147</p> <p>5.2.4.3.4 Parameter Sensitivity Investigation 147</p> <p>5.2.4.3.5 Comparing with Existing Methods 148</p> <p>5.3 Conclusions 151</p> <p>References 152</p> <p><b>6 Large-Scale Social Multimedia Analysis </b><b>157<br /> </b><i>Benjamin Bischke, Damian Borth and Andreas Dengel</i></p> <p>6.1 Social Multimedia in Social Media Streams 157</p> <p>6.1.1 Social Multimedia 157</p> <p>6.1.2 Social Multimedia Streams 158</p> <p>6.1.3 Analysis of the Twitter Firehose 160</p> <p>6.1.3.1 Dataset: Overview 160</p> <p>6.1.3.2 Linked Resource Analysis 160</p> <p>6.1.3.3 Image Content Analysis 162</p> <p>6.1.3.4 Geographic Analysis 164</p> <p>6.1.3.5 Textual Analysis 166</p> <p>6.2 Large-Scale Analysis of Social Multimedia 167</p> <p>6.2.1 Large-Scale Processing of Social Multimedia Analysis 167</p> <p>6.2.1.1 Batch-Processing Frameworks 167</p> <p>6.2.1.2 Stream-Processing Frameworks 168</p> <p>6.2.1.3 Distributed Processing Frameworks 168</p> <p>6.2.2 Analysis of Social Multimedia 169</p> <p>6.2.2.1 Analysis of Visual Content 169</p> <p>6.2.2.2 Analysis of Textual Content 169</p> <p>6.2.2.3 Analysis of Geographical Content 170</p> <p>6.2.2.4 Analysis of User Content 170</p> <p>6.3 Large-Scale Multimedia Opinion Mining System 170</p> <p>6.3.1 System Overview 171</p> <p>6.3.2 Implementation Details 171</p> <p>6.3.2.1 Social Media Data Crawler 171</p> <p>6.3.2.2 Social Multimedia Analysis 173</p> <p>6.3.2.3 Analysis of Visual Content 174</p> <p>6.3.3 Evaluations: Analysis of Visual Content 175</p> <p>6.3.3.1 Filtering of Synthetic Images 175</p> <p>6.3.3.2 Near-Duplicate Detection 177</p> <p>6.4 Conclusion 178</p> <p>References 179</p> <p><b>7 Privacy and Audiovisual Content: Protecting Users as Big Multimedia Data Grows Bigger </b><b>183<br /> </b><i>Martha Larson, Jaeyoung Choi, Manel Slokom, Zekeriya Erkin, Gerald Friedland and Arjen P. de Vries</i></p> <p>7.1 Introduction 183</p> <p>7.1.1 The Dark Side of Big Multimedia Data 184</p> <p>7.1.2 Defining Multimedia Privacy 184</p> <p>7.2 Protecting User Privacy 188</p> <p>7.2.1 What to Protect 188</p> <p>7.2.2 How to Protect 189</p> <p>7.2.3 Threat Models 191</p> <p>7.3 Multimedia Privacy 192</p> <p>7.3.1 Privacy and Multimedia Big Data 192</p> <p>7.3.2 Privacy Threats of Multimedia Data 194</p> <p>7.3.2.1 Audio Data 194</p> <p>7.3.2.2 Visual Data 195</p> <p>7.3.2.3 Multimodal Threats 195</p> <p>7.4 Privacy-Related Multimedia Analysis Research 196</p> <p>7.4.1 Multimedia Analysis Filters 196</p> <p>7.4.2 Multimedia Content Masking 198</p> <p>7.5 The Larger Research Picture 199</p> <p>7.5.1 Multimedia Security and Trust 199</p> <p>7.5.2 Data Privacy 200</p> <p>7.6 Outlook on Multimedia Privacy Challenges 202</p> <p>7.6.1 Research Challenges 202</p> <p>7.6.1.1 Multimedia Analysis 202</p> <p>7.6.1.2 Data 202</p> <p>7.6.1.3 Users 203</p> <p>7.6.2 Research Reorientation 204</p> <p>7.6.2.1 Professional Paranoia 204</p> <p>7.6.2.2 Privacy as a Priority 204</p> <p>7.6.2.3 Privacy in Parallel 205</p> <p>References 205</p> <p><b>Part III Scalability in Multimedia Access </b><b>209</b></p> <p><b>8 Data Storage and Management for Big Multimedia </b><b>211<br /> </b><i>Björn Þór Jónsson, Gylfi Þór Gudmundsson, Laurent Amsaleg and Philippe Bonnet</i></p> <p>8.1 Introduction 211</p> <p>8.1.1 Multimedia Applications and Scale 212</p> <p>8.1.2 Big Data Management 213</p> <p>8.1.3 System Architecture Outline 213</p> <p>8.1.4 Metadata Storage Architecture 214</p> <p>8.1.4.1 Lambda Architecture 214</p> <p>8.1.4.2 Storage Layer 215</p> <p>8.1.4.3 Processing Layer 216</p> <p>8.1.4.4 Serving Layer 216</p> <p>8.1.4.5 Dynamic Data 216</p> <p>8.1.5 Summary and Chapter Outline 217</p> <p>8.2 Media Storage 217</p> <p>8.2.1 Storage Hierarchy 217</p> <p>8.2.1.1 Secondary Storage 218</p> <p>8.2.1.2 The Five-Minute Rule 218</p> <p>8.2.1.3 Emerging Trends for Local Storage 219</p> <p>8.2.2 Distributed Storage 220</p> <p>8.2.2.1 Distributed Hash Tables 221</p> <p>8.2.2.2 The CAP Theorem and the PACELC Formulation 221</p> <p>8.2.2.3 The Hadoop Distributed File System 221</p> <p>8.2.2.4 Ceph 222</p> <p>8.2.3 Discussion 222</p> <p>8.3 Processing Media 222</p> <p>8.3.1 Metadata Extraction 223</p> <p>8.3.2 Batch Processing 223</p> <p>8.3.2.1 Map-Reduce and Hadoop 224</p> <p>8.3.2.2 Spark 225</p> <p>8.3.2.3 Comparison 226</p> <p>8.3.3 Stream Processing 226</p> <p>8.4 Multimedia Delivery 226</p> <p>8.4.1 Distributed In-Memory Buffering 227</p> <p>8.4.1.1 Memcached and Redis 227</p> <p>8.4.1.2 Alluxio 227</p> <p>8.4.1.3 Content Distribution Networks 228</p> <p>8.4.2 Metadata Retrieval and NoSQL Systems 228</p> <p>8.4.2.1 Key-Value Stores 229</p> <p>8.4.2.2 Document Stores 229</p> <p>8.4.2.3 Wide Column Stores 229</p> <p>8.4.2.4 Graph Stores 229</p> <p>8.4.3 Discussion 229</p> <p>8.5 Case Studies: Facebook 230</p> <p>8.5.1 Data Popularity: Hot, Warm or Cold 230</p> <p>8.5.2 Mentions Live 231</p> <p>8.6 Conclusions and Future Work 231</p> <p>8.6.1 Acknowledgments 232</p> <p>References 232</p> <p><b>9 Perceptual Hashing for Large-Scale Multimedia Search </b><b>239<br /> </b><i>LiWeng, I-Hong Jhuo and Wen-Huang Cheng</i></p> <p>9.1 Introduction 240</p> <p>9.1.1 Related work 240</p> <p>9.1.2 Definitions and Properties of Perceptual Hashing 241</p> <p>9.1.3 Multimedia Search using Perceptual Hashing 243</p> <p>9.1.4 Applications of Perceptual Hashing 243</p> <p>9.1.5 Evaluating Perceptual Hash Algorithms 244</p> <p>9.2 Unsupervised Perceptual Hash Algorithms 245</p> <p>9.2.1 Spectral Hashing 245</p> <p>9.2.2 Iterative Quantization 246</p> <p>9.2.3 K-Means Hashing 247</p> <p>9.2.4 Kernelized Locality Sensitive Hashing 249</p> <p>9.3 Supervised Perceptual Hash Algorithms 250</p> <p>9.3.1 Semi-Supervised Hashing 250</p> <p>9.3.2 Kernel-Based Supervised Hashing 252</p> <p>9.3.3 Restricted Boltzmann Machine-Based Hashing 253</p> <p>9.3.4 Supervised Semantic-Preserving Deep Hashing 255</p> <p>9.4 Constructing Perceptual Hash Algorithms 257</p> <p>9.4.1 Two-Step Hashing 257</p> <p>9.4.2 Hash Bit Selection 258</p> <p>9.5 Conclusion and Discussion 260</p> <p>References 261</p> <p><b>Part IV Applications of Large-Scale Multimedia Search </b><b>267</b></p> <p><b>10 Image Tagging with Deep Learning: Fine-Grained Visual Analysis </b><b>269<br /> </b><i>Jianlong Fu and Tao Mei</i></p> <p>10.1 Introduction 269</p> <p>10.2 Basic Deep Learning Models 270</p> <p>10.3 Deep Image Tagging for Fine-Grained Image Recognition 272</p> <p>10.3.1 Attention Proposal Network 274</p> <p>10.3.2 Classification and Ranking 275</p> <p>10.3.3 Multi-Scale Joint Representation 276</p> <p>10.3.4 Implementation Details 276</p> <p>10.3.5 Experiments on CUB-200-2011 277</p> <p>10.3.6 Experiments on Stanford Dogs 280</p> <p>10.4 Deep Image Tagging for Fine-Grained Sentiment Analysis 281</p> <p>10.4.1 Learning Deep Sentiment Representation 282</p> <p>10.4.2 Sentiment Analysis 283</p> <p>10.4.3 Experiments on SentiBank 283</p> <p>10.5 Conclusion 284</p> <p>References 285</p> <p><b>11 Visually Exploring Millions of Images using Image Maps and Graphs </b><b>289<br /> </b><i>Kai Uwe Barthel and Nico Hezel</i></p> <p>11.1 Introduction and Related Work 290</p> <p>11.2 Algorithms for Image Sorting 293</p> <p>11.2.1 Self-Organizing Maps 293</p> <p>11.2.2 Self-Sorting Maps 294</p> <p>11.2.3 Evolutionary Algorithms 295</p> <p>11.3 Improving SOMs for Image Sorting 295</p> <p>11.3.1 Reducing SOM Sorting Complexity 295</p> <p>11.3.2 Improving SOM Projection Quality 297</p> <p>11.3.3 Combining SOMs and SSMs 297</p> <p>11.4 Quality Evaluation of Image Sorting Algorithms 298</p> <p>11.4.1 Analysis of SOMs 298</p> <p>11.4.2 Normalized Cross-Correlation 299</p> <p>11.4.3 A New Image Sorting Quality Evaluation Scheme 299</p> <p>11.5 2D Sorting Results 301</p> <p>11.5.1 Image Test Sets 301</p> <p>11.5.2 Experiments 302</p> <p>11.6 Demo System for Navigating 2D Image Maps 304</p> <p>11.7 Graph-Based Image Browsing 306</p> <p>11.7.1 Generating Semantic Image Features 306</p> <p>11.7.2 Building the Image Graph 307</p> <p>11.7.3 Visualizing and Navigating the Graph 310</p> <p>11.7.4 Prototype for Image Graph Navigation 312</p> <p>11.8 Conclusion and Future Work 313</p> <p>References 313</p> <p><b>12 Medical Decision Support Using Increasingly Large Multimodal Data Sets </b><b>317<br /> </b><i>Henning Müller and Devrim Ünay</i></p> <p>12.1 Introduction 317</p> <p>12.2 Methodology for Reviewing the Literature in this chapter 320</p> <p>12.3 Data, Ground Truth, and Scientific Challenges 321</p> <p>12.3.1 Data Annotation and Ground Truthing 321</p> <p>12.3.2 Scientific Challenges and Evaluation as a Service 321</p> <p>12.3.3 Other Medical Data Resources Available 322</p> <p>12.4 Techniques used for Multimodal Medical Decision Support 323</p> <p>12.4.1 Visual and Non-Visual Features Describing the Image Content 323</p> <p>12.4.2 General Machine Learning and Deep Learning 323</p> <p>12.5 Application Types of Image-Based Decision Support 326</p> <p>12.5.1 Localization 326</p> <p>12.5.2 Segmentation 326</p> <p>12.5.3 Classification 327</p> <p>12.5.4 Prediction 327</p> <p>12.5.5 Retrieval 327</p> <p>12.5.6 Automatic Image Annotation 328</p> <p>12.5.7 Other Application Types 328</p> <p>12.6 Discussion on Multimodal Medical Decision Support 328</p> <p>12.7 Outlook or the Next Steps of Multimodal Medical Decision Support 329</p> <p>References 330</p> <p>Conclusions and Future Trends 337</p> <p>Index 339</p>
<p><b>Stefanos Vrochidis</b> is a Senior Researcher with the Information Technologies Institute (CERTH-ITI) in Greece. His research interests include multimedia retrieval, semantic multimedia analysis, multimodal big data analytics, web data mining, multimodal interaction and security applications. <p><b>Benoit Huet</b> is Assistant Professor in the Data Science Department of EURECOM, France. His current research interests include large scale multimedia content analysis, mining and indexing, multimodal fusion, and affective and socially-aware multimedia. <p><b>Edward Y. Chang</b> has acted as the President of AI Research and Healthcare at HTC since 2012. Prior to his current post, he was a director of research at Google from 2006 to 2012, and a professor at the University of California, Santa Barbara, from 1999 to 2006. He is an IEEE Fellow for his contribution to scalable machine learning. <p><b>Ioannis Kompatsiaris</b> is a Senior Researcher with the Information Technologies Institute (CERTH-ITI) in Greece, leading the Multimedia, Knowledge and Social Media Analytics Lab. His research interests include large-scale multimedia and social media analysis, knowledge structures and reasoning, eHealth, security and environmental applications.
<p><b>A timely overview of cutting edge technologies for multimedia retrieval, with a special emphasis on scalability</b> <p>The amount of multimedia data available every day is enormous and is growing at an exponential rate, creating a great need for new and more efficient approaches for large scale multimedia search. This book addresses that need, covering the area of multimedia retrieval and placing a special emphasis on scalability. It reports the recent works in large scale multimedia search, including research methods and applications, and is structured so that readers with basic knowledge can grasp the core message while still allowing experts and specialists to drill further down into the analytical sections. <p><i>Big Data Analytics for Large-Scale Multimedia Search</i> covers: representation learning; concept and event-based video search in large collections; big data multimedia mining; large scale video understanding; big multimedia data fusion; large-scale social multimedia analysis; privacy and audiovisual content; data storage and management for big multimedia; large scale multimedia search; multimedia tagging using deep learning; interactive interfaces for big multimedia and medical decision support applications using large multimodal data. <ul> <li>Addresses the area of multimedia retrieval and pays close attention to the issue of scalability</li> <li>Presents problem driven techniques with solutions that are demonstrated through realistic case studies and user scenarios</li> <li>Includes tables, illustrations, and figures</li> <li>Offers a Wiley-hosted book companion website that features links to open source algorithms, data sets and tools</li> </ul> <p><i>Big Data Analytics for Large-Scale Multimedia Search</i> is an excellent book for academics, industrial researchers, and developers interested in big multimedia data search retrieval. It will also appeal to consultants in computer science problems and professionals in the multimedia industry.

Diese Produkte könnten Sie auch interessieren:

Strategies to the Prediction, Mitigation and Management of Product Obsolescence
Strategies to the Prediction, Mitigation and Management of Product Obsolescence
von: Bjoern Bartels, Ulrich Ermel, Peter Sandborn, Michael G. Pecht
PDF ebook
116,99 €