Unit 5 Lecture 4
Unit 5 Lecture 4
1. Unstructured Database:
• It is an information that either does not have a predefined data model or which is not organized in a pre-defined
manner.
• Unstructured information is typically text-heavy, but mat contain data such as dates, numbers and facts as well
examples include e-mail message, word processing, documents, vedios, photos, audio files, presentation web pages.
Semi-structured data is the data which does not conforms to a data model but has some structure. It lacks a fixed
or rigid schema. It is the data that does not reside in a rational database but that have some organizational
properties that make it easier to analyse. With some process, we can store them in the relational database
3. Structured Database:
A data structure is a kind of repository that organizes information for that purpose. In a database, for example, each field
is discrete and its information can be retrieved either separately or along with data from other fields, in a variety of
combinations. The power of the database is its ability to make data comprehensive, so that it yields useful information. A
database query language, such as SQL (standard query language), allows a database administrator to interact with the
database.
Structured data contrasts with unstructured and semi-structured data. The three can be considered to exist on a
continuum, with unstructured data being the least formatted and structured data being the most formatted. Data is
increasingly amenable to processing as it is increasingly structured.
Dynamo Table
In DynamoDB, tables, items, and attributes are the core components that you work with. A table is a collection
of items, and each item is a collection of attributes. DynamoDB uses primary keys to uniquely identify each item in
a table and secondary indexes to provide more querying flexibility. You can use DynamoDB Streams to capture
data modification events in DynamoDB tables.
3. Attributes – Each item is composed of one or more attributes. An attribute is a fundamental data
element, something that does not need to be broken down any further. For example, an item in
a People table contains attributes called PersonID, LastName, FirstName, and so on. For
a Department table, an item might have attributes such as DepartmentID, Name, Manager, and so on.
Attributes in DynamoDB are similar in many ways to fields or columns in other database systems.
• The following diagram shows a table named People with some example items and attributes.
Read Request Units and Write Request Units
For on-demand mode tables, you don't need to specify how much read and write throughput you expect your
application to perform. DynamoDB charges you for the reads and writes that your application performs on your tables
in terms of read request units and write request units.
•One read request unit represents one strongly consistent read request, or two eventually consistent read requests, for
an item up to 4 KB in size. Two read request units represent one transactional read for items up to 4 KB. If you need
to read an item that is larger than 4 KB, DynamoDB needs additional read request units. The total number of read
request units required depends on the item size, and whether you want an eventually consistent or strongly consistent
read. For example, if your item size is 8 KB, you require 2 read request units to sustain one strongly consistent read, 1
read request unit if you choose eventually consistent reads, or 4 read request units for a transactional read request.
•One write request unit represents one write for an item up to 1 KB in size. If you need to write an item
that is larger than 1 KB, DynamoDB needs to consume additional write request units. Transactional write
requests require 2 write request units to perform one write for items up to 1 KB. The total number of write
request units required depends on the item size. For example, if your item size is 2 KB, you require 2 write
request units to sustain one write request or 4 write request units for a transactional write request.
DynamoDB pricing
1. Managed service − Amazon DynamoDB is a managed service. There is no need to hire experts to manage
NoSQL installation. Developers need not worry about setting up, configuring a distributed database cluster,
managing ongoing cluster operations, etc. It handles all the complexities of scaling, partitions and re-partitions
data over more machine resources to meet I/O performance requirements.
2. Scalable − Amazon DynamoDB is designed to scale. There is no need to worry about predefined limits to the
amount of data each table can store. Any amount of data can be stored and retrieved. DynamoDB will spread
automatically with the amount of data stored as the table grows.
3. Fast − Amazon DynamoDB provides high throughput at very low latency. As datasets grow, latencies remain
stable due to the distributed nature of DynamoDB's data placement and request routing algorithms.
Benefits of Amazon DynamoDB
4. Durable and highly available − Amazon DynamoDB replicates data over at least 3 different data centers’
results. The system operates and serves data even under various failure conditions.
5. Flexible: Amazon DynamoDB allows creation of dynamic tables, i.e. the table can have any number of
attributes, including multi-valued attributes.
6. Cost-effective: Payment is for what we use without any minimum charges. Its pricing structure is simple
and easy to calculate.
Important Questions