Index: Mlbase Component, 100
Index: Mlbase Component, 100
A
Big Data revolution
cloud computing, 22
Aerospike, 91, 217 competing definitions, 21
Aerospike query language (AQL), 218 industrial revolution, 22
AJAX. See Asynchronous JavaScript and IoT, 22
XML (AJAX) social networks and
Alternative persistence model, 92 smartphones, 22
Amazon Binary JSON (BSON), 157
ACID RDBMS, 46 Blockchain, 212
Dynamo, 14, 45–46 Bloom filters, 161
DynamoDB, 219 Boolean bit logic, 214
hashing, 47–48 B-tree index structure, 158–159
key–value stores, 51 Business intelligence (BI) practices, 193
NWR notation, 49–50
SOA, 45
Amazon Web Services (AWS), 15 C
Apache Cassandra , 218. See also Cassandra Cache-less architecture, 92
Apache HBase, 220 CAP theorem
Apache Kudu, 211. See also Hbase partition tolerance, 44
Append only file (AOF), 94 RAC solution, 44
Asynchronous JavaScript and Cascading Style Sheets (CSS), 54
XML (AJAX), 15 Cassandra, 211
Atomic, Consistent, Independent, and Durable cluster node, 120
(ACID) transactions, 9–10, 128 consistent hashing, 120–121
AWS. See Amazon Web Services (AWS) data model, 153, 155
gossip, 119
node adding, 122
B
order-preserving
Berkeley analytics data stack and spark partitioners, 124
AMPlab, 99 replicas, 124–125
BDAS, 100 snitches, 126
DAG, 101 virtual nodes, 122–123
Hadoop, 99–100 Cassandra consistency
JDBC-compliant database, 101 hinted handoff, 136–137
MapReduce, 99 LWT (see Lightweight
MLBase component, 100 transactions (LWT))
RDD, 101 read consistency, 135
spark architecture, 101 read repair, 136–137
spark processing replication factor, 134
elements, 102 timestamps and granularity, 137
spark SQL, 100 vector clocks, 138–140
spark streaming, 100 write consistency, 134–135
229
■ index
230
■ Index
232
■ Index
233
■ index
P
INGRES, 10
mainframe computer, 10
Pig Latin, 36 OODBMS, 11–13
PropertyFileSnitch, 126 OOP, 11–13
QUEL, 10
Q
relational database model (see Relational
theory)
Quantum query language (QQL), 214 SQL/DS, 10
Quantum search, 213 SQL language, 10
Quantum transactions, 213 transaction models, 9–10
QUEL, 10 Secondary index
B-tree indexes, 163
R
DIY, 163
global and local, 165
RAC. See Real Application Clusters (RAC) implementations, 166
RackInferringSnitch, 126 nonrelational operational
RDD. See Resilient distributed datasets (RDD) database systems, 163
RDF. See Resource Description Framework (RDF) Service-oriented architecture (SOA), 45
Real Application Clusters (RAC), 43, 109 Set-based query language (SQL)
Redis. See Remote dictionary server (Redis) advantages, 183
Relational storage model ANSI and ISO standard, 168
B-tree index structure, 158–159 Apache Drill framework, 188–190
Couchbase’s HB+-Trie, 160 Hive, 183–184
database architecture, 158 Impala, 184
index blocks, 159 N1QL, 185–187
RDBMS architectural pattern, 158 NoSQL, 190
Tokutek’s fractal tree index, 160 spark, 185
Relational theory types, 168
concepts, 8 Shard chunk, 113
normalized and un-normalized data, 8–9 Sharding
Remote dictionary server (Redis), 224 ACID transactions, 14
AOF, 94 drawbacks, 43
architectural components, 94 Facebook, 14
architecture, 94 memcached/replication architecture, 42
EMC, 93 Shared-disk database architecture, 109
key-value store, 95 Simple Oracle Data Access (SODA), 204
key-value store architecture, 93 Simple Oracle Document Access (SODA), 201
memory database system, 95 Single-level cell (SLC), 88
MongoDB, 95 SOA. See Service-oriented architecture (SOA)
snapshot, 93 SODA REST query, 205
virtual memory system, 94 Solid state disk (SSD)
Replica sets, 113–114 Aerospike, 91
Replication factor, 124 algorithms, 89
Resilient distributed datasets (RDD), 101 battery-backed RAM device, 88
Resource Description Framework (RDF), 68–69 DDR RAM, 88
Riak, 225 economics, 89–90
Row Level Versioned, 81 enabled databases, 90
NAND flash, 88
S
performance characteristics, 89
SLC and MLC, 88
SAP Hana, 95–96, 225 write amplification, 89
SCN. See System change number (SCN) SPARQL Protocol, 69
Second database revolution Splice Machine layers, 210
client-server computing, 11 SQL/DS, 10
IBM, 7 SSTables, 160
234
■ Index
235