Details

NoSQL Data Models


NoSQL Data Models

Trends and Challenges
1. Aufl.

von: Olivier Pivert

139,99 €

Verlag: Wiley
Format: PDF
Veröffentl.: 27.07.2018
ISBN/EAN: 9781119544135
Sprache: englisch
Anzahl Seiten: 288

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

<p>The topic of NoSQL databases has recently emerged, to face the Big Data challenge, namely the ever increasing volume of data to be handled. It is now recognized that relational databases are not appropriate in this context, implying that new database models and techniques are needed. This book presents recent research works, covering the following basic aspects: semantic data management, graph databases, and big data management in cloud environments. The chapters in this book report on research about the evolution of basic concepts such as data models, query languages, and new challenges regarding implementation issues.</p>
<p>Foreword xi<br /><i>Anne LAURENT and Dominique LAURENT</i></p> <p>Preface xiii<br /><i>Olivier PIVERT</i></p> <p><b>Chapter 1. NoSQL Languages and Systems </b><b>1<br /></b><i>Kim NGUY</i><i>ỄN</i></p> <p>1.1. Introduction 1</p> <p>1.1.1. The rise of NoSQL systems and languages 1</p> <p>1.1.2. Overview of NoSQL concepts 4</p> <p>1.1.3. Current trends of French research in NoSQL languages 6</p> <p>1.2. Join implementations on top of MapReduce 7</p> <p>1.3. Models for NoSQL languages and systems 12</p> <p>1.4. New challenges for database research 16</p> <p>1.5. Bibliography 18</p> <p><b>Chapter 2. Distributed SPARQL Query Processing: A Case Study with Apache Spark </b><b>21<br /></b><i>Bernd AMANN, Olivier CURÉ and Hubert NAACKE</i></p> <p>2.1. Introduction 21</p> <p>2.2. RDF and SPARQL 22</p> <p>2.2.1. RDF framework and data model 22</p> <p>2.2.2. SPARQL query language 25</p> <p>2.3. SPARQL query processing 29</p> <p>2.3.1. SPARQL with and without RDF/S entailment 29</p> <p>2.3.2. Query optimization 30</p> <p>2.3.3. Triple store systems 33</p> <p>2.4. SPARQL and MapReduce 34</p> <p>2.4.1. MapReduce-based SPARQL processing 35</p> <p>2.4.2. Related work 39</p> <p>2.5. SPARQL on Apache Spark 41</p> <p>2.5.1. Apache Spark 41</p> <p>2.5.2. SPARQL on Spark 42</p> <p>2.5.3. Experimental evaluation 48</p> <p>2.6. Bibliography 53</p> <p><b>Chapter 3. Doing Web Data: from Dataset Recommendation to Data Linking </b><b>57<br /></b><i>Manel ACHICHI, Mohamed BEN ELLEFI, Zohra BELLAHSENE and Konstantin TODOROV</i></p> <p>3.1. Introduction 57</p> <p>3.1.1. The Semantic Web vision 57</p> <p>3.1.2. Linked data life cycles 58</p> <p>3.1.3. Chapter overview 61</p> <p>3.2. Datasets recommendation for data linking 62</p> <p>3.2.1. Process definition 63</p> <p>3.2.2. Dataset recommendation for data linking based on a Semantic Web index 64</p> <p>3.2.3. Dataset recommendation for data linking based on social networks 64</p> <p>3.2.4. Dataset recommendation for data linking based on domain-specific keywords 65</p> <p>3.2.5. Dataset recommendation for data linking based on topic modeling 65</p> <p>3.2.6. Dataset recommendation for data linking based on topic profiles 66</p> <p>3.2.7. Dataset recommendation for data linking based on intensional profiling 67</p> <p>3.2.8. Discussion on dataset recommendation approaches 68</p> <p>3.3. Challenges of linking data 69</p> <p>3.3.1. Value dimension 70</p> <p>3.3.2. Ontological dimension 74</p> <p>3.3.3. Logical dimension 77</p> <p>3.4. Techniques applied to the data linking process 78</p> <p>3.4.1. Data linking techniques 79</p> <p>3.4.2. Discussion 83</p> <p>3.5. Conclusion 86</p> <p>3.6. Bibliography 87</p> <p><b>Chapter 4. Big Data Integration in Cloud Environments: Requirements, Solutions and Challenges </b><b>93<br /></b><i>Rami SELLAMI and Bruno DEFUDE</i></p> <p>4.1. Introduction 93</p> <p>4.2. Big Data integration requirements in Cloud environments 96</p> <p>4.3. Automatic data store selection and discovery 99</p> <p>4.3.1. Introduction 99</p> <p>4.3.2. Model-based approaches 99</p> <p>4.3.3. Matching-oriented approaches 100</p> <p>4.3.4. Comparison 102</p> <p>4.4. Unique access for all data stores 103</p> <p>4.4.1. Introduction 103</p> <p>4.4.2. ODBAPI: A unified REST API for relational and NoSQL data stores 104</p> <p>4.4.3. Other works 105</p> <p>4.4.4. Comparison 107</p> <p>4.5. Unified data model and query languages 108</p> <p>4.5.1. Introduction 108</p> <p>4.5.2. Data models of classical data integration approaches 109</p> <p>4.5.3. A global schema to unify the view over relational and NoSQL data stores 110</p> <p>4.5.4. Other works 113</p> <p>4.5.5. Comparison 117</p> <p>4.6. Query processing and optimization 118</p> <p>4.6.1. Introduction 118</p> <p>4.6.2. Federated query language approaches 118</p> <p>4.6.3. Integrated query language approaches 121</p> <p>4.6.4. Comparison 124</p> <p>4.7. Summary and open issues 125</p> <p>4.7.1. Summary 125</p> <p>4.7.2. Open issues 127</p> <p>4.8. Conclusion 129</p> <p>4.9. Bibliography 129</p> <p><b>Chapter 5. Querying RDF Data: A Multigraph-based Approach </b><b>135<br /></b><i>Vijay INGALALLI, Dino IENCO and Pascal PONCELET</i></p> <p>5.1. Introduction 135</p> <p>5.2. Related work 137</p> <p>5.3. Background and preliminaries 137</p> <p>5.3.1. RDF data 138</p> <p>5.3.2. SPARQL query 140</p> <p>5.3.3. SPARQL querying by adopting multigraph homomorphism 142</p> <p>5.4. AMBER: A SPARQL querying engine 143</p> <p>5.5. Index construction 144</p> <p>5.5.1. Attribute index 144</p> <p>5.5.2. Vertex signature index 145</p> <p>5.5.3. Vertex neighborhood index 148</p> <p>5.6. Query matching procedure 149</p> <p>5.6.1. Vertex-level processing 151</p> <p>5.6.2. Processing satellite vertices 152</p> <p>5.6.3. Arbitrary query processing 154</p> <p>5.7. Experimental analysis 159</p> <p>5.7.1. Experimental setup 159</p> <p>5.7.2. Workload generation 160</p> <p>5.7.3. Comparison with RDF engines 161</p> <p>5.8. Conclusion 164</p> <p>5.9. Acknowledgment 164</p> <p>5.10. Bibliography 164</p> <p><b>Chapter 6. Fuzzy Preference Queries to NoSQL Graph Databases </b><b>167<br /></b><i>Arnaud CASTELLTORT, Anne LAURENT, Olivier PIVERT, Olfa SLAMA and Virginie THION</i></p> <p>6.1. Introduction 167</p> <p>6.2. Preliminary statements 168</p> <p>6.2.1. Graph databases 168</p> <p>6.2.2. Fuzzy set theory 174</p> <p>6.3. Fuzzy preference queries over graph databases 176</p> <p>6.3.1. Fuzzy preference queries over crisp graph databases 176</p> <p>6.3.2. Fuzzy preference queries over fuzzy graph databases 182</p> <p>6.4. Implementation challenges 193</p> <p>6.4.1. Modeling fuzzy databases 193</p> <p>6.4.2. Evaluation of queries with fuzzy preferences 193</p> <p>6.4.3. Scalability 195</p> <p>6.5. Related work 197</p> <p>6.6. Conclusion and perspectives 198</p> <p>6.7. Acknowledgment 199</p> <p>6.8. Bibliography 199</p> <p><b>Chapter 7. Relevant Filtering in a Distributed Content-based Publish/Subscribe System </b><b>203<br /></b><i>Cédric DU MOUZA and Nicolas TRAVERS</i></p> <p>7.1. Introduction 203</p> <p>7.2. Related work: novelty and diversity filtering 205</p> <p>7.3. A Publish/Subscribe data model 206</p> <p>7.3.1. Data model 206</p> <p>7.3.2. Weighting terms in textual data flows 207</p> <p>7.4. Publish/Subscribe relevance 208</p> <p>7.4.1. Items and histories 208</p> <p>7.4.2. Novelty 209</p> <p>7.4.3. Diversity 209</p> <p>7.4.4. An overview of the filtering process 210</p> <p>7.4.5. Choices of relevance 210</p> <p>7.5. Real-time integration of novelty and diversity 212</p> <p>7.5.1. Centralized implementation 212</p> <p>7.5.2. Distributed filtering 216</p> <p>7.6. TDV updates 221</p> <p>7.6.1. TDV computation techniques 221</p> <p>7.6.2. Incremental approach 223</p> <p>7.6.3. TDV in a distributed environment 225</p> <p>7.7. Experiments 228</p> <p>7.7.1. Implementation and description of datasets 229</p> <p>7.7.2. TDV updates 229</p> <p>7.7.3. Filtering rate 230</p> <p>7.7.4. Performance evaluation in the centralized environment 234</p> <p>7.7.5. Performance evaluation in a distributed environment 238</p> <p>7.7.6. Quality of filtering 240</p> <p>7.8. Conclusion 241</p> <p>7.9. Bibliography 242</p> <p>List of Authors 245</p> <p>Index 247</p>
<strong>Olivier Pivert</strong> is currently a full Professor of Computer Science at the National School of Applied Sciences and Technology, Lannion, France; and a Member of the Institute for Research in Computer Science and Random Systems where he heads the Shaman research team.

Diese Produkte könnten Sie auch interessieren:

AutoCAD 2013 For Dummies
AutoCAD 2013 For Dummies
von: Bill Fane, David Byrnes
PDF ebook
23,99 €
IBM Workplace Services Express For Dummies
IBM Workplace Services Express For Dummies
von: Stephen R. Londergan, Michael Loria
PDF ebook
16,99 €
Building Flash Web Sites For Dummies
Building Flash Web Sites For Dummies
von: Doug Sahlin
PDF ebook
18,99 €