Acknowledgments |
|
xi | |
About the Author |
|
xii | |
|
1 Introduction to the World of Big Data |
|
|
1 | (30) |
|
1.1 Understanding Big Data |
|
|
1 | (1) |
|
1.2 Evolution of Big Data |
|
|
2 | (1) |
|
1.3 Failure of Traditional Database in Handling Big Data |
|
|
3 | (1) |
|
|
4 | (3) |
|
|
7 | (1) |
|
1.6 Different Types of Data |
|
|
8 | (3) |
|
1.7 Big Data Infrastructure |
|
|
11 | (1) |
|
|
12 | (6) |
|
|
18 | (3) |
|
1.10 Big Data Applications |
|
|
21 | (1) |
|
|
21 | (10) |
|
|
24 | (7) |
|
2 Big Data Storage Concepts |
|
|
31 | (22) |
|
|
32 | (5) |
|
|
37 | (6) |
|
2.3 Distributed File System |
|
|
43 | (1) |
|
2.4 Relational and Non-Relational Databases |
|
|
43 | (4) |
|
2.5 Scaling Up and Scaling Out Storage |
|
|
47 | (6) |
|
|
48 | (5) |
|
|
53 | (30) |
|
3.1 Introduction to NoSQL |
|
|
53 | (1) |
|
|
54 | (1) |
|
|
54 | (2) |
|
|
56 | (1) |
|
|
56 | (1) |
|
|
57 | (1) |
|
|
57 | (19) |
|
3.8 Migrating from RDBMS to NoSQL |
|
|
76 | (7) |
|
|
77 | (6) |
|
4 Processing, Management Concepts, and Cloud Computing |
|
|
83 | (1) |
|
Part I Big Data Processing and Management Concepts |
|
|
83 | (10) |
|
|
83 | (2) |
|
4.2 Shared Everything Architecture |
|
|
85 | (1) |
|
4.3 Shared-Nothing Architecture |
|
|
86 | (2) |
|
|
88 | (1) |
|
4.5 Real-Time Data Processing |
|
|
88 | (1) |
|
|
89 | (1) |
|
4.7 Distributed Computing |
|
|
90 | (1) |
|
4.8 Big Data Virtualization |
|
|
90 | (3) |
|
Part II Managing and Processing Big Data in Cloud Computing |
|
|
93 | (254) |
|
|
93 | (1) |
|
4.10 Cloud Computing Types |
|
|
94 | (1) |
|
|
95 | (1) |
|
|
96 | (5) |
|
|
101 | (10) |
|
|
103 | (8) |
|
5 Driving Big Data with Hadoop Tools and Technologies |
|
|
111 | (50) |
|
|
111 | (3) |
|
|
114 | (5) |
|
|
119 | (10) |
|
|
129 | (9) |
|
|
138 | (3) |
|
|
141 | (1) |
|
|
141 | (2) |
|
|
143 | (1) |
|
|
144 | (1) |
|
|
145 | (1) |
|
|
146 | (1) |
|
|
146 | (3) |
|
|
149 | (2) |
|
|
151 | (1) |
|
5.15 Hadoop Distributions |
|
|
152 | (9) |
|
|
153 | (8) |
|
|
161 | (26) |
|
6.1 Terminology of Big Data Analytics |
|
|
161 | (1) |
|
|
162 | (4) |
|
6.3 Data Analytics Life Cycle |
|
|
166 | (4) |
|
6.4 Big Data Analytics Techniques |
|
|
170 | (5) |
|
|
175 | (3) |
|
|
178 | (1) |
|
6.7 Big Data Business Intelligence |
|
|
178 | (2) |
|
6.8 Big Data Real-Time Analytics Processing |
|
|
180 | (1) |
|
6.9 Enterprise Data Warehouse |
|
|
181 | (6) |
|
|
182 | (5) |
|
7 Big Data Analytics with Machine Learning |
|
|
187 | (14) |
|
7.1 Introduction to Machine Learning |
|
|
187 | (1) |
|
7.2 Machine Learning Use Cases |
|
|
188 | (1) |
|
7.3 Types of Machine Learning |
|
|
189 | (12) |
|
|
196 | (5) |
|
8 Mining Data Streams and Frequent Itemset |
|
|
201 | (58) |
|
|
201 | (5) |
|
|
206 | (4) |
|
8.3 Frequent Itemset Generation |
|
|
210 | (1) |
|
8.4 Itemset Mining Algorithms |
|
|
211 | (18) |
|
8.5 Maximal and Closed Frequent Itemset |
|
|
229 | (4) |
|
8.6 Mining Maximal Frequent Itemsets: the GenMax Algorithm |
|
|
233 | (3) |
|
8.7 Mining Closed Frequent Itemsets: the Charm Algorithm |
|
|
236 | (1) |
|
8.8 CHARM Algorithm Implementation |
|
|
236 | (3) |
|
|
239 | (1) |
|
|
240 | (1) |
|
8.11 Important Terms Used in Bayesian Network |
|
|
241 | (8) |
|
8.12 Density Based Clustering Algorithm |
|
|
249 | (1) |
|
|
249 | (1) |
|
8.14 Kernel Density Estimation |
|
|
250 | (4) |
|
|
254 | (1) |
|
8.16 Time Series Forecasting |
|
|
255 | (4) |
|
|
259 | (34) |
|
|
259 | (2) |
|
9.2 Distance Measurement Techniques |
|
|
261 | (2) |
|
9.3 Hierarchical Clustering |
|
|
263 | (3) |
|
9.4 Analysis of Protein Patterns in the Human Cancer-Associated Liver |
|
|
266 | (1) |
|
9.5 Recognition Using Biometrics of Hands |
|
|
267 | (7) |
|
9.6 Expectation Maximization Clustering Algorithm |
|
|
274 | (3) |
|
9.7 Representative-Based Clustering |
|
|
277 | (1) |
|
9.8 Methods of Determining the Number of Clusters |
|
|
277 | (7) |
|
9.9 Optimization Algorithm |
|
|
284 | (4) |
|
9.10 Choosing the Number of Clusters |
|
|
288 | (2) |
|
9.11 Bayesian Analysis of Mixtures |
|
|
290 | (1) |
|
|
290 | (1) |
|
9.13 Fuzzy C-Means Clustering |
|
|
291 | (2) |
|
10 Big Data Visualization |
|
|
293 | (54) |
|
10.1 Big Data Visualization |
|
|
293 | (1) |
|
10.2 Conventional Data Visualization Techniques |
|
|
294 | (3) |
|
|
297 | (12) |
|
10.4 Bar Chart in Tableau |
|
|
309 | (1) |
|
|
310 | (1) |
|
|
311 | (1) |
|
|
312 | (1) |
|
|
313 | (1) |
|
|
313 | (5) |
|
10.10 Installing R and Getting Ready |
|
|
318 | (3) |
|
10.11 Data Structures in R |
|
|
321 | (14) |
|
10.12 Importing Data from a File |
|
|
335 | (1) |
|
10.13 Importing Data from a Delimited Text File |
|
|
336 | (1) |
|
10.14 Control Structures in R |
|
|
337 | (4) |
|
|
341 | (6) |
Index |
|
347 | |