0% found this document useful (0 votes)

2 views85 pages

NoSql Unit 3

The document discusses various cloud-based NoSQL databases, focusing on Google App Engine and Amazon SimpleDB, highlighting their scalability, ease of use, and automatic indexing features. It also covers parallel processing with MapReduce and the use of Hive for big data management, including SQL-like querying capabilities. Overall, it emphasizes the advantages of using cloud solutions for handling large-scale web applications and data storage needs.

Uploaded by

Ambar Majumdar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views85 pages

NoSql Unit 3

Uploaded by

Ambar Majumdar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 85

Dr.

SELVA KUMAR S
B.M.S COLLEGE OF ENGINEERING
▪ NOSQL in CLOUD
▪ Exploring ready-to-use NoSQL databases in the cloud
▪ Leveraging Google AppEngine and its scalable data store
▪ Using Amazon SimpleDB

▪ Parallel Processing with Map Reduce

▪ BigData with Hive

2
▪ Google and Amazon, have achieved
▪ High availability
▪ Ability to concurrently service millions of users
▪ Scaling out horizontally among multiple machines
▪ Spread across multiple data centers.

▪ Success stories of large-scale web applications like those from Google and Amazon have
proven that
▪ Horizontally scaled environments
▪ NoSQL solutions
▪ Available on-demand

▪ Provisioned as required have been christened as the “cloud.”

▪ If scalability and availability is your priority, NoSQL in the cloud is possibly the ideal setup.
3
▪Google’s Bigtable data store
▪Amazon SimpleDB

4
▪ The Google App Engine (GAE) provides a sandboxed
deployment environment for applications.
▪ It is written using:
▪ Python programming
▪ Java Virtual Machine (JVM)
▪ Google provides developers with a set of rich APIs and an
SDK to build applications for the app engine.

5
▪ Google App Engine (GAE) is a Platform as a Service (PaaS) cloud computing
platform for developing and hosting web applications in Google-managed data
centers.
▪ Google’s Platform to build web applications on Cloud.

▪ Easy to build.
▪ Easy to maintain.
▪ Easy to scale as the traffic and storage needs grow.
▪ Automatic scaling and load balancing.
▪ Transactional data store model.

▪ Free for up to 1 GB of storage and enough CPU and bandwidth to support 5 million
page views a month. 10 Applications per Google account.

6
▪ Lower total cost of ownership
▪ Rich set of APIs
▪ Fully featured SDK for local development
▪ Ease of Deployment

7
8
▪ Java:
• App Engine runs JAVA apps on a JAVA 7 virtual machine
(currently
▪ supports JAVA 6 as well).
• Uses JAVA Servlet standard for web applications:
• WAR (Web Applications ARchive) directory structure.
• Servlet classes
• Java Server Pages (JSP)
• Static and data files
• Deployment descriptor (web.xml)
• Other configuration files
9
▪ Python:
• Uses WSGI (Web Server Gateway Interface) standard.
• Python applications can be written using:
• Webapp2 framework
• Django framework
• Any python code that uses the CGI (Common Gateway Interface)
standard.

10
▪ PHP (Experimental support):
• Local development servers are available to anyone for developing
and testing local applications.

▪ Google’s Go:
• Go is an Google’s open source programming environment.
• Tightly coupled with Google App Engine.
• Applications can be written using App Engine’s Go SDK.

11
▪ App Engine Datastore:
• NoSQL schema-less object based data storage, with a query engine and
▪ atomic transactions.
• Data object is called a “Entity” that has a kind (~ table name) and a set of
▪ properties (~ column names).
• JAVA JDO/ JPA interfaces and Python datastore interfaces.

▪ Google cloud SQL:

• Provides a relational SQL database service.
• Similar to MySQL RDBMS.

12
▪ Google cloud store:
• RESTful service for storing and querying data.
• Fast, scalable and highly available solution.
• Provides Multiple layers of redundancy. All data is replicated to multiple
▪ data centers.
• Provides different levels of access control.
• HTTP based APIs.

13
14
15
▪ Use App Engine when:

• You don’t want to get troubled for setting up a server.

• You want instant for-free nearly infinite scalability support.
• Your application’s traffic is spiky and rather unpredictable.
• You don't feel like taking care of your own server monitoring tools.
• You need pricing that fits your actual usage and isn't time-slot based
(App engine provides pay-per-drink cost model).
• You are able to chunk long tasks into 60 second pieces.
• You are able to work without direct access to local file system.

16
17
18
19
20
21
22
23
▪ The app engine provides a SQL-like query language called GQL.
▪ GQL queries on entities and their properties.
▪ Entities manifest as objects in the GAE Python and the Java SDK.
▪ GQL is quite similar to object-oriented query languages that are
used:
▪ query, filter, and get model instances and their properties.

24
from google.appengine.ext import db

class Person(db.Model):
name = db.StringProperty()
age = db.IntegerProperty()

# We use a unique username for the Entity's key.

amy = Person(key_name='amym', name='Amy', age=48)
amy.put()
Person(key_name='bettyd', name='Betty', age=42).put()
Person(key_name='charliec', name='Charlie', age=32).put()
Person(key_name='charliek', name='Charlie', age=29).put()
Person(key_name='eedna', name='Edna', age=20).put()
Person(key_name='fredm', name='Fred', age=16, parent=amy).put()
Person(key_name='georgemichael', name='George').put()
25
▪ SELECT * FROM Person WHERE age >= 18 AND age <= 35

▪ SELECT * FROM Person ORDER BY age DESC LIMIT 3

▪ SELECT * FROM Person WHERE name IN ('Betty', 'Charlie')

▪ SELECT name FROM Person

▪ SELECT key FROM Person WHERE age = NULL

26
27
28
29
▪ address_k = db.Key.from_path('Employee', 'asalieri', 'Address', 1)
▪ address = db.get(address_k)

30
▪ To update an existing entity:
▪ Modify the attributes of the object
▪ Call its put() method.
▪ The object data overwrites the existing entity.
▪ The entire object is sent to Datastore with every call to
put().

31
▪ employee_k = db.Key.from_path('Employee', 'asalieri')
▪ employee = db.get(employee_k)
▪ # ...
▪ employee.delete()

▪ address_k = db.Key.from_path('Employee', 'asalieri', 'Address', 1)

▪ db.delete(address_k)

32
33
34
▪ Amazon SimpleDB is a ready-to-run database alternative to the app engine
data store.
▪ Amazon SimpleDB is a web service for running queries on structured data in
real time.
▪ Amazon SimpleDB requires no schema, automatically indexes your data and
provides a simple API for storage and access.
▪ This eliminates the administrative burden of data modeling, index
maintenance, and performance tuning.
▪ This service works in close conjunction with Amazon Simple Storage Service
(Amazon S3) and Amazon Elastic Compute Cloud (Amazon EC2), collectively
providing the ability to store, process and query data sets in the cloud.
35
▪Domain
▪ Attributes
▪ Item

36
▪ A domain is like a table.
▪ An attribute is analogous to a field or column.
▪ An item is similar to a database row.
▪ We can change the structure of a domain easily, since it
has no schema.
▪ In addition, attributes are of string type and can contain
multiple values.

37
▪ SimpleDB can be queried in one of the following ways:
▪ Making RESTful get and post requests over HTTP or
HTTPS.
▪ Making SQL like query using a programming language.

38
▪ This shows a REST request that puts
three attributes and values for an item
named Item123 into the domain
named MyDomain.

39
40
▪ Simple Queries:
▪ These are the usual queries we perform like in any database:
▪ Examples: select * from mydomain where Title = 'The Right Stuff’
select * from mydomain where Year > '1985’
▪ Range Queries:
▪ Amazon SimpleDB enables us to execute more than one comparison against
attribute values within the same predicate.
▪ This is most commonly used to specify a range of values.
▪ select * from mydomain where Year between '1975' and '2008’
▪ select * from mydomain where (Year > '1950' and Year < '1960') or Year like '193%'
or Year = '2007'

41
▪ Amazon SimpleDB allows you to associate multiple values with a
single attribute.
▪ Each attribute is considered individually against the comparison
conditions defined in the predicate.
▪ Example: select * from mydomain where Keyword = 'Book' and
Keyword = 'Hardcover’
▪ Retrieve all items that have the Keyword attribute as both "Book"
and "Hardcover."
▪ Each value is evaluated individually against the predicate
expression.
42
▪ Multiple attribute queries work by producing a set of item names
from each predicate and applying the intersection operator.
▪ The intersection operator only returns item names that appear in
both result sets.
▪ select * from mydomain where Keyword = 'Book' intersection
Keyword = 'Hardcover’
▪ The first predicate produces 100, 200, and 50. The second produces
50.
▪ The result returns 50 counts. The intersection operator returns
results that appear in both queries.
43
▪ Amazon does the query optimization on its own and lets
the users to just store the data and query it.
▪ The 10gb domain limit was created with optimization in
mind.
▪ The user can optimize it themselves by splitting data to
multiple domains.
▪ In order to improve the performance, we can partition our
dataset among multiple domains to parallelize queries
and have them operate on smaller individual datasets.
44
▪ Applications to parallelize queries:
▪ Natural Partitions— The data set naturally partitions along some
dimension. For example, a University catalog might be partitioned
in the "Grad", "UnderGrad" and "Staff" domains. Although we can
store all the product data in a single domain, partitioning can
improve overall performance.
▪ High Performance Application— This can be useful when the
application requires higher throughput than a single domain can
provide.
▪ Large Data Set—This can be useful when timeout limits are reached
because of the data size or query complexity.

45
▪ If we need aggregation, SimpleDB is not the right solution.
▪ It is built around the school of thought that the DB is just a key value
store, and aggregation should be handled by an aggregation
process that writes the results back to the key value store.
▪ The count() function is recently introduced to the set of functions.
▪ Since only 2500 data records will be displayed per query we should
make sure that the count function does not exceed this range.
▪ We cannot perform joins in SimpleDB as we can execute a query
against a single domain only and this is one of the limitations
present in it.
46
▪ Amazon does not provide enough information about how indexes
are created or managed on SimpleDB, except for the fact that they
are automatically created and managed.
▪ SimpleDB users do not have any control over it.
▪ Following are some of the salient features of indexes:
▪ Domain keys are indexed.
▪ Data are indexed when we enter or modify them in the database.
▪ SimpleDB takes all data as input and indexes all the attributes.

47
▪ Asynchronous replication is supported.
▪ Amazon SimpleDB creates and manages multiple
geographically distributed replicas of the data
automatically.
▪ Every time we store a data item, multiple replicas are
created in different data centers within the region we
select.

48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
▪ HBase provides a TableInputFormat, to which you provided a table scan, that splits
the rows resulting from the table scan into the regions in which those rows reside.
▪ The map process is passed an ImmutableBytesWritable that contains the row key
for a row and a Result that contains the columns for that row.
▪ The map process outputs its key/value pair based on its business logic in whatever
form makes sense to your application.
▪ The reduce process builds its results but emits the row key as an
ImmutableBytesWritable and a Put command to store the results back to HBase.
▪ Finally, the results are stored in HBase by the HBase MapReduce infrastructure.

64
65
▪ Input format
▪ First it splits the input data, and then it returns a RecordReader instance that
defines the classes of the key and value objects, and provides a next() method
that is used to iterate over each input record.

66
▪ Mapper
▪ In this step, each record read using the RecordReader is processed using the
map() method.

67
▪ Reducer
▪ The Reducer stage and class hierarchy is very similar to the Mapper stage. This
time we get the output of a Mapper class and process it after the data has been
shuffled and sorted.

68
▪ OutputFormat
▪ The final stage is the OutputFormat class, and its job is to persist the data in
various locations. There are specific implementations that allow output to files, or
to HBase tables in the case of the TableOutputFormat class. It uses a TableRecord
Writer to write the data into the specific HBase output table.

69
▪ Apache Mahout is a project of the Apache Software Foundation which is
implemented on top of Apache Hadoop and uses the MapReduce paradigm.
▪ It is also used to create implementations of scalable and distributed
machine learning algorithms that are focused in the areas of
▪ Clustering,
▪ Collaborative filtering and
▪ Classification.
▪ Mahout contains Java libraries for common math algorithms and operations
focused on statistics and linear algebra, as well as primitive Java
collections.

70
▪ To build a recommender engine mahout provides the following components:
• DataModel
• UserSimilarity
• ItemSimilarity
• UserNeighborhood
• Recommender

71
72
▪ DataModel datamodel = new FileDataModel(new File("input file"));

▪ UserSimilarity similarity = new PearsonCorrelationSimilarity(datamodel);

▪ UserNeighborhood neighborhood = new ThresholdUserNeighborhood(3.0, similarity, model);

▪ UserBasedRecommender recommender = new GenericUserBasedRecommender(model, neighborhood, similarity);

▪ List<RecommendedItem> recommendations = recommender.recommend(2, 3);

▪ for (RecommendedItem recommendation : recommendations) {

▪ System.out.println(recommendation);
▪ }

73
74
▪ What is HIVE?

75
76
77
78
▪ Create database mydb;
▪ Show databases;
▪ Use mydb;

79
▪ Create table customer(custId INT, custName String, mobile INT)
row format delimited
fields terminated by ‘,’;
▪ Load data local inpath ‘c:/temp/cust.txt’ into table customer;
▪ Select * from customer;
▪ Select count(*) from customer;

80
▪ Create table out(custId INT, custName String, amount INT, product String)
row format delimited
fields terminated by ‘,’;
▪ Insert overwrite table out
select a.custId, a.custName, b.amount, b.product
from customer a JOIN products b ON a.custId = b.custId;
▪ Select * from out limit 5;

81
▪ Insert overwrite table out1
select *, case
when age<30 then ‘young’
when age>=30 and age<50 ‘middle’
when age>=50 ‘old’
else ‘others’
end
from out;
▪ Insert overwrite table out2
select level, sum(amount) from out1 group by level;

82
▪ hive> SELECT ratings.userid, ratings.rating, ratings.tstamp, movies.title, users.gender
▪ > FROM ratings JOIN movies ON (ratings.movieid = movies.movieid)
▪ > JOIN users ON (ratings.userid = users.userid)
▪ > LIMIT 5;

▪ hive> SELECT user_id, rating_count

▪ > FROM (SELECT ratings.userid as user_id, COUNT(ratings.rating) as rating_count
▪ > FROM ratings
▪ > WHERE ratings.rating = 5
▪ > GROUP BY ratings.userid ) top_raters
▪ > WHERE rating_count > 15;

83
▪ An explain plan in Hive reveals the MapReduce behind a query.
▪ hive> EXPLAIN SELECT COUNT(*) FROM ratings
▪ > WHERE movieid = 1 and rating = 5;
▪ OK
▪ ABSTRACT SYNTAX TREE:
▪ (TOK_QUERY (TOK_FROM (TOK_TABREF ratings))
▪ (TOK_INSERT (TOK_DESTINATION (TOK_DIR TOK_TMP_FILE))
▪ (TOK_SELECT (TOK_SELEXPR (TOK_FUNCTIONSTAR COUNT)))
▪ (TOK_WHERE (and (= (TOK_TABLE_OR_COL movieid) 1)
▪ (= (TOK_TABLE_OR_COL rating) 5)))))
▪ STAGE DEPENDENCIES:
▪ Stage-1 is a root stage

84
85

Dbms
No ratings yet
Dbms
61 pages
Datanucleus Accessplatform Docs 4.2
No ratings yet
Datanucleus Accessplatform Docs 4.2
1,188 pages
Datanucleus Accessplatform Docs
No ratings yet
Datanucleus Accessplatform Docs
1,137 pages
Unit 1 Mangodb
No ratings yet
Unit 1 Mangodb
57 pages
CC Unit 4
No ratings yet
CC Unit 4
51 pages
4.1 Intro Nosql-Converted-133751863122661863
No ratings yet
4.1 Intro Nosql-Converted-133751863122661863
43 pages
Haptic Technology
25% (4)
Haptic Technology
29 pages
Manual de Potogold
No ratings yet
Manual de Potogold
139 pages
C Language Notes For 1st Sem
No ratings yet
C Language Notes For 1st Sem
260 pages
Database in Java
No ratings yet
Database in Java
779 pages
ORMLite
No ratings yet
ORMLite
97 pages
E-Healthcare Management System
100% (1)
E-Healthcare Management System
73 pages
ClickHouse DBMS
No ratings yet
ClickHouse DBMS
203 pages
2 Designing Data-Intensive Apps - CH 2
No ratings yet
2 Designing Data-Intensive Apps - CH 2
3 pages
Introduction To NoSQL Database
No ratings yet
Introduction To NoSQL Database
9 pages
12 OpenDBengine
No ratings yet
12 OpenDBengine
25 pages
Lecture 7
No ratings yet
Lecture 7
19 pages
90 Recipes For JET CHEF Premium Chef Menu: Being A Chef Has Never Been So Easy
100% (1)
90 Recipes For JET CHEF Premium Chef Menu: Being A Chef Has Never Been So Easy
78 pages
ADBMS Original-Output
No ratings yet
ADBMS Original-Output
28 pages
Updated Mongodb Lab Manual IV Sem
No ratings yet
Updated Mongodb Lab Manual IV Sem
48 pages
CC Presentation GAE
No ratings yet
CC Presentation GAE
14 pages
Cloud Computing With Google DataStore
No ratings yet
Cloud Computing With Google DataStore
6 pages
IBM I DB2 Web Query For I Version 2.1 Implementation Guide
100% (2)
IBM I DB2 Web Query For I Version 2.1 Implementation Guide
880 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
43 pages
BD Unit 1,2
No ratings yet
BD Unit 1,2
12 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
45 pages
Bda CHP 3
No ratings yet
Bda CHP 3
75 pages
Orm Lite
No ratings yet
Orm Lite
94 pages
BIG Data Analytics 21CSH-471: Computer Science & Engineering
No ratings yet
BIG Data Analytics 21CSH-471: Computer Science & Engineering
31 pages
Overview of NoSQL
No ratings yet
Overview of NoSQL
17 pages
No SQL
No ratings yet
No SQL
109 pages
CloudComputing DATABASE
No ratings yet
CloudComputing DATABASE
27 pages
Curso Google Data Engineer
No ratings yet
Curso Google Data Engineer
36 pages
Lecture 10 - Interactive Querying
No ratings yet
Lecture 10 - Interactive Querying
27 pages
Dbms
No ratings yet
Dbms
12 pages
Designing Data Intensive Applications
25% (4)
Designing Data Intensive Applications
61 pages
Hosting Japi JSSL Jtypes of
No ratings yet
Hosting Japi JSSL Jtypes of
15 pages
Unit1 - Database Engine
No ratings yet
Unit1 - Database Engine
16 pages
4th Unit
No ratings yet
4th Unit
52 pages
I - What Is G G App Engine ???
No ratings yet
I - What Is G G App Engine ???
26 pages
No SQL
No ratings yet
No SQL
35 pages
Introduction To Web Programming: IT 533 Lecture Notes Gülşen Demiröz
No ratings yet
Introduction To Web Programming: IT 533 Lecture Notes Gülşen Demiröz
42 pages
Microwave Cookbook WEB
100% (1)
Microwave Cookbook WEB
35 pages
SC4x W3L1 TopicsInDatabases v2
No ratings yet
SC4x W3L1 TopicsInDatabases v2
37 pages
Xladder User Manual-V2.0.0.6 - 2
No ratings yet
Xladder User Manual-V2.0.0.6 - 2
313 pages
CookBook English Hindi
No ratings yet
CookBook English Hindi
96 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
43 pages
Manual Mango
No ratings yet
Manual Mango
17 pages
Introduction To Google App Engine
No ratings yet
Introduction To Google App Engine
34 pages
5.1 Intro Nosql
No ratings yet
5.1 Intro Nosql
22 pages
04-2 Intro Nosql
No ratings yet
04-2 Intro Nosql
18 pages
CB Series: CB100 CB400 CB500 CB700 CB900
No ratings yet
CB Series: CB100 CB400 CB500 CB700 CB900
6 pages
Inside RavenDB 3 0
No ratings yet
Inside RavenDB 3 0
187 pages
Drop Box
No ratings yet
Drop Box
401 pages
First QSN
No ratings yet
First QSN
2 pages
Seminar Nosql
No ratings yet
Seminar Nosql
59 pages
Cockroach DB
No ratings yet
Cockroach DB
37 pages
Nosqldbs
No ratings yet
Nosqldbs
149 pages
Database Connections PDF
No ratings yet
Database Connections PDF
55 pages
Multi Gpu Programming With Mpi
No ratings yet
Multi Gpu Programming With Mpi
93 pages
Integer Programming and Branch and Bound
No ratings yet
Integer Programming and Branch and Bound
62 pages
Poweredge r740 - Users Guide3 - en Us
No ratings yet
Poweredge r740 - Users Guide3 - en Us
57 pages
Seminar Nosql
No ratings yet
Seminar Nosql
56 pages
P S T U: Atuakhali Cience AND Echnology Niversity
No ratings yet
P S T U: Atuakhali Cience AND Echnology Niversity
8 pages
NoSql Unit 5
No ratings yet
NoSql Unit 5
114 pages
Win10 MD100 PPT Mod1 Final
No ratings yet
Win10 MD100 PPT Mod1 Final
49 pages
Cooking 101 Quick and Easy Menus
No ratings yet
Cooking 101 Quick and Easy Menus
8 pages
Fanuc-Page-6 (Al SP, Servo, Sytem, PS,)
100% (1)
Fanuc-Page-6 (Al SP, Servo, Sytem, PS,)
17 pages
Atul Final
No ratings yet
Atul Final
64 pages
SN Quick Reference 2018
No ratings yet
SN Quick Reference 2018
6 pages
4 Core Competency of Computer Systems Servicing
No ratings yet
4 Core Competency of Computer Systems Servicing
16 pages
Recipe Cookbook For Crisp
No ratings yet
Recipe Cookbook For Crisp
8 pages
Singelton
No ratings yet
Singelton
15 pages
Techniques and Methodologies For Multimedia Systems Development: A Survey of Industrial Practice
No ratings yet
Techniques and Methodologies For Multimedia Systems Development: A Survey of Industrial Practice
10 pages
CSC311 Lecture 1
No ratings yet
CSC311 Lecture 1
29 pages
Amcat Automata PDF
No ratings yet
Amcat Automata PDF
11 pages
Canon-Imagerunner-Advance-C5560i-Brochure RTM 65cpm
No ratings yet
Canon-Imagerunner-Advance-C5560i-Brochure RTM 65cpm
4 pages
Microsoft Excel 365 - Basic & Advanced: Getting To Know Excel
No ratings yet
Microsoft Excel 365 - Basic & Advanced: Getting To Know Excel
6 pages
G9 Revision Worksheet Question Paper First Term
No ratings yet
G9 Revision Worksheet Question Paper First Term
6 pages
VVIP Beach Villa Alkhur CCTV PPM July 2023
No ratings yet
VVIP Beach Villa Alkhur CCTV PPM July 2023
6 pages
2.06 Sam Hodges
No ratings yet
2.06 Sam Hodges
5 pages
Information Security Policy Plan
No ratings yet
Information Security Policy Plan
7 pages
Preprocessor Directives in C Programming
No ratings yet
Preprocessor Directives in C Programming
7 pages
Microsoft Azure Fundametnals - AZ900 Course Outline
No ratings yet
Microsoft Azure Fundametnals - AZ900 Course Outline
11 pages
Create A Fragment - Android Developers
No ratings yet
Create A Fragment - Android Developers
6 pages
SQL Cheat Sheet
No ratings yet
SQL Cheat Sheet
2 pages
Muhammad's Resume
No ratings yet
Muhammad's Resume
2 pages
Place Holder
No ratings yet
Place Holder
1 page
Place Holder
No ratings yet
Place Holder
1 page
Outpatient Record: OPD Notes
No ratings yet
Outpatient Record: OPD Notes
1 page
99acres - Rent Receipt
No ratings yet
99acres - Rent Receipt
1 page
Microwave Reciepe
No ratings yet
Microwave Reciepe
53 pages

NoSql Unit 3

Uploaded by

NoSql Unit 3

Uploaded by

Dr.

▪ Parallel Processing with Map Reduce

▪ Provisioned as required have been christened as the “cloud.”

▪ Google cloud SQL:

• You don’t want to get troubled for setting up a server.

# We use a unique username for the Entity's key.

▪ SELECT * FROM Person ORDER BY age DESC LIMIT 3

▪ SELECT * FROM Person WHERE name IN ('Betty', 'Charlie')

▪ SELECT name FROM Person

▪ SELECT __key__ FROM Person WHERE age = NULL

▪ address_k = db.Key.from_path('Employee', 'asalieri', 'Address', 1)

▪ UserSimilarity similarity = new PearsonCorrelationSimilarity(datamodel);

▪ UserNeighborhood neighborhood = new ThresholdUserNeighborhood(3.0, similarity, model);

▪ UserBasedRecommender recommender = new GenericUserBasedRecommender(model, neighborhood, similarity);

▪ List<RecommendedItem> recommendations = recommender.recommend(2, 3);

▪ for (RecommendedItem recommendation : recommendations) {

▪ hive> SELECT user_id, rating_count

You might also like

▪ SELECT key FROM Person WHERE age = NULL