- Intro
- Preface
- Organization
- Contents
- New Generation Data Warehouses Design
- Evaluation of Data Warehouse Design Methodologies in the Context of Big Data
- Abstract
- 1 Introduction
- 2 Methodology Classification
- 3 Metrics for Design Evaluation of Methodologies
- 3.1 Metrics for Methodology Evaluation
- 3.2 Metrics for Schema Quality Evaluation
- 4 Experimental Results
- 4.1 Methodology Evaluation
- 4.2 Schema Evaluation
- 5 Conclusion
- References
- Optimal Task Ordering in Chain Data Flows: Exploring the Practicality of Non-scalable Solutions
- 1 Introduction
- 2 Preliminaries
- 2.1 Problem Complexity
- 2.2 Chains in TPC-DI
- 3 Accurate Algorithms for Linear Execution Plans
- 3.1 Backtracking
- 3.2 Dynamic Programming
- 3.3 Topological Sorting
- 4 Evaluation of the Time Overhead
- 5 Related Work
- 6 Conclusions
- References
- Exploiting Mathematical Structures of Statistical Measures for Comparison of RDF Data Cubes
- 1 Introduction
- 2 Model and Data Representation
- 3 Structural Comparison of RDF Data Cubes
- 3.1 Computability and Comparability
- 3.2 Comparison Functionalities
- 3.3 Experimentation
- 4 Conclusion
- References
- S2D: Shared Distributed Datasets, Storing Shared Data for Multiple and Massive Queries Optimization in a Distributed Data Warehouse
- 1 Introduction
- 2 Related Work
- 3 Overview of Shared Distributed Datasets
- 3.1 Phase 1: The Logical Representation
- 3.2 Phase 2: The Physical Representation
- 4 Experimental Evaluation
- 4.1 Experimental Setup
- 4.2 Experimental Results and Discussion
- 5 Conclusion and Future Work
- References
- Cloud and NoSQL Databases
- Enforcing Privacy in Cloud Databases
- 1 Introduction
- 2 Non-cryptographic Methods
- 2.1 Differential Privacy
- 2.2 Data Anonymization
- 2.3 Data Fragmentation
- 3 Secret Sharing-Based Methods
- 3.1 Verifiable Secret Sharing
- 3.2 Order-Preserving Secret Sharing
- 3.3 Discussion
- 4 Index-Based Methods
- 4.1 Bucketization-Based Indexing
- 4.2 Order-Preserving Indexing
- 4.3 Searchable Encryption
- 4.4 Discussion
- 5 Secure Databases
- 5.1 CryptDB
- 5.2 MONOMI
- 5.3 Multi-valued Order Preserving Encryption (MV-OPE)
- 5.4 Secure Trusted Hardware
- 5.5 Discussion
- 6 Conclusion
- 6.1 Security
- 6.2 Query Post-processing
- 6.3 Storage Overhead
- 6.4 Computational Overhead
- 6.5 Wrap-up
This book constitutes the refereed proceedings of the 19th International Conference on Big Data Analytics and Knowledge Discovery, DaWaK 2017, held in Lyon, France, in August 2017. The 24 revised full papers and 11 short papers presented were carefully reviewed and selected from 97 submissions. The papers are organized in the following topical sections: new generation data warehouses design; cloud and NoSQL databases; advanced programming paradigms; non-functional requirements satisfaction; machine learning; social media and twitter analysis; sentiment analysis and user influence; knowledge discovery; and data flow management and optimization. .
(source: Nielsen Book Data)