Introduction To Big Data
Introduction To Big Data
Government Officials
1
1. Introduction to Big Data
What is Big Data
Big Data is a phrase used to mean a massive volume of both
structured and unstructured data that is so large it is difficult to
process using traditional database and software techniques.1
Professionally, Big Data is a field that studies various means of
extracting, analyzing, or dealing with sets of data that are so
complex to be handled by traditional data-processing systems.2
3
4
5V's of Big Data
Velocity: The data is increasing at a very fast rate. It is estimated
that the volume of data will double in every 2 years.
Variety: Now a days data are not stored in rows and column only.
Data is structured as well as unstructured.
Volume: The amount of data which we deal with is of very large size
of Peta bytes (10^15 bytes)
5
Structured & Unstructured Data
6
Structured Data
. Source : https://www.bigdataframework.org/data-types-structured-vs-unstructured-
data/#:~:text=Structured%20data%20is%20data%20that,Excel%20files%20or%20SQL%20databases.
7
Structured Data
8
Unstructured Data
➢ Unstructured data is information that either does not have a
predefined data model or is not organized in a pre-defined manner.
9
10
Structured & Unstructured Data
11
Structured & Unstructured Data
12
What is Artificial Intelligence
Source :https://en.wikipedia.org/wiki/Artificial_intelligence
13
What is Artificial Intelligence
14
What Artificial Intelligence can do
Self-Driving Car
Image source: https://towardsdatascience.com/what-is-big-data-and-what-artificial-intelligence-can-do-d3f1d14b84ce:
15
How Big Data and AI are connected
➢ There is a massive amount of data online and offline ,ranging from people,
their routine, their preferences, etc. to non-living things, their properties,
their uses, etc.
➢ This huge stockpile of data, when properly harnessed, can give valuable
insights and business analytics to the sector/ industry where the data set
belongs. Thus, artificially intelligent algorithms are written to benefit from
large and complex data
Source :https://www.analyticsinsight.net/using-artificial-intelligence-in-big-data/
16
How Big Data and AI are connected
17
How Big Data and AI are connected
Source :https://www.analyticsinsight.net/using-artificial-intelligence-in-big-data/
18
How Big Data and AI are connected
19
How Big Data and AI are connected
20
How Big Data and AI are connected
AI helps farmers to count and monitor their produce through every growth
stage till maturity.
AI can identify weak points or defects long before they spread to other areas
of these huge acres of land.
For Example:, satellite systems or drones are used by the AI for viewing and
extracting the data.
Source :https://www.analyticsinsight.net/using-artificial-intelligence-in-big-data/
21
How Big Data and AI are connected
22
How Big Data and AI are connected
23
How Big Data and AI are connected
The Securities Exchange Commission (SEC) is using network analytics and
natural language processing to foil illegal trading activities in financial
markets.
They are also used for early fraud warning, card fraud detection, archival
and analysis of audit trails, reporting enterprise credit, customer data
transformation, etc.
Source :https://www.analyticsinsight.net/using-artificial-intelligence-in-big-data/
24
How Big Data and AI are connected
25
Introduction to Search Algorithms in AI
Source :https://medium.com/datadriveninvestor/searching-algorithms-for-artificial-intelligence-
85d58a8e4a42#:~:text=Essentially%2C%20artificial%20intelligence%20can%20find,blind%2C%20informed%2C%20and%20optimal.
26
Search Algorithm Terminologies:
27
Search Algorithm Terminologies:
3. Actions: It gives the description of all the available actions to the agent.
6. Solution: It is an action sequence which leads from the start node to the
goal node.
7. Optimal Solution: If a solution has the lowest cost among all solutions.
Source :https://www.javatpoint.com/search-algorithms-in-ai.
28
Properties of Search Algorithms:
Following are the four essential properties of search algorithms to compare the
efficiency of these algorithms:
❖ Space Complexity: It is the maximum storage space required at any point during
the search, as the complexity of the problem.
Source :https://www.javatpoint.com/search-algorithms-in-ai.
29
Types of Search Algorithms
Source :https://www.educba.com/search-algorithms-in-ai/?source=leftnav.
30
Types of Search Algorithms
Source :https://www.educba.com/search-algorithms-in-ai/?source=leftnav.
31
Informed Search Algorithms
Source :https://www.educba.com/search-algorithms-in-ai/?source=leftnav.
32
Informed Search Algorithms
➢ A* Search
Source :https://www.educba.com/search-algorithms-in-ai/?source=leftnav.
33
Uninformed Search Algorithms
Source :https://www.educba.com/search-algorithms-in-ai/?source=leftnav.
34
Uninformed Search Algorithms
➢ Breadth-first Search
➢ Depth-first Search
➢ Depth-limited Search
➢ Iterative deepening depth-first search
➢ Uniform cost search
➢ Bidirectional Search
Source :https://www.educba.com/search-algorithms-in-ai/?source=leftnav.
35
Uninformed Search Algorithms
Breadth-First Search(BFS)
➢ In breadth first search, the tree or the graph is traversed breadth wise i.e. it
starts from a node called search key and then explores all the neighboring nodes
of the search key at that depth-first and then moves to the next level nodes.
➢ It is implemented using the queue data structure that works on the concept of
first in first out (FIFO).
➢ The time complexity of BFS is O(bd) where b (branching factor) is the average
number of child nodes for any given node and d stands for depth.
➢ The disadvantage of this algorithm is that it requires a lot of memory space
because it has to store each level of nodes for the next one. It may also check
duplicate nodes.
Source :https://www.educba.com/search-algorithms-in-ai/?source=leftnav.
36
Uninformed Search Algorithms
Breadth-First Search(BFS)
Example: If search starts from root node “S” to reach goal node “K” then it will traverse
S---> A--->B---->C--->D---->G--->H--->E---->F---->I---->K .
It traverses level wise i.e. explores the shallowest node first.
37
Uninformed Search Algorithms
Depth-First Search(DFS)
➢ In depth first search, the tree or the graph is traversed depth-wise i.e. it starts
from a node called search key and nodes along the branch then backtracks.
➢ It is implemented using a stack data structure that works on the concept of last in
first out (LIFO).
➢ The time complexity of DFS is O(nm ) where n stands for number of nodes and m
stands for maximum depth of any node. It stores nodes linearly hence less space
requirement.
➢ The major disadvantage is that this algorithm may go in an infinite loop.
Source :https://www.educba.com/search-algorithms-in-ai/?source=leftnav.
38
Uninformed Search Algorithms
Depth-First Search(DFS)
Example: If search starts from root node “S” to reach goal node “K” then it will traverse
S---> A--->B---->D---->E--->C--->G
It traverses depth wise i.e. explores the deepest node first.
Source :https://www.educba.com/search-algorithms-in-ai/?source=leftnav.
39
Uninformed Search Algorithms
Source:https://www.javatpoint.com/ai-uninformed-search-algorithms.
40
Uninformed Search Algorithms
Disadvantages:
Source:https://www.javatpoint.com/ai-uninformed-search-algorithms.
41
Uninformed Search Algorithms
Example: If search starts from root node “S” to reach goal node “J” then it will traverse
S---> A--->C---->D---->B--->I--->J
The pre-determined limit is set to level 2
Source :https://www.educba.com/search-algorithms-in-ai/?source=leftnav.
42
Uninformed Search Algorithms
43
Uninformed Search Algorithms
Advantages:
➢ Uniform cost search is optimal because at every state the path with the least cost
is chosen.
Disadvantages:
➢ It does not care about the number of steps involve in searching and only
concerned about path cost. Due to which this algorithm may be stuck in an infinite
loop.
Source: https://www.javatpoint.com/ai-uninformed-search-algorithms.
44
Uninformed Search Algorithms
Uniform-cost Search Algorithm:
Example: If search starts from root node “S” to reach goal node “G” then it will traverse
S---> A--->D---->G
The algorithm choses the path with lowest cost first.
Source :https://www.educba.com/search-algorithms-in-ai/?source=leftnav.
45
Uninformed Search Algorithms
➢ This Search algorithm combines the benefits of Breadth-first search's fast search
and depth-first search's memory efficiency.
➢ The iterative search algorithm is useful uninformed search when search space is
large, and depth of goal node is unknown.
Source:https://www.javatpoint.com/ai-uninformed-search-algorithms.
46
Uninformed Search Algorithms
Advantages:
➢ It combines the benefits of BFS and DFS search algorithm in terms of fast search
and memory efficiency.
Disadvantages:
➢ The main drawback of IDDFS is that it repeats all the work of the previous phase.
Source:https://www.javatpoint.com/ai-uninformed-search-algorithms.
47
Uninformed Search Algorithms
Iterative deepening depth-first Search:
Example: If search starts from root node “S” to reach goal node “G” then it will traverse
1'st Iteration-----> A
2'nd Iteration----> A, B, C
3'rd Iteration------>A, B, D, E, C, F, G
4'th Iteration------>A, B, D, H, I, E, C, F, K, G
In the fourth iteration, the algorithm will find the goal node.
48
Uninformed Search Algorithms
Source:https://www.javatpoint.com/ai-uninformed-search-algorithms.
49
Uninformed Search Algorithms
Advantages:
Disadvantages:
Source:https://www.javatpoint.com/ai-uninformed-search-algorithms.
50
Uninformed Search Algorithms
Bidirectional Search Algorithm:
In the below search tree, bidirectional search algorithm is applied. This algorithm divides one
graph/tree into two sub-graphs. It starts traversing from node 1 in the forward direction and
starts from goal node 16 in the backward direction.
51
Agents in Artificial Intelligence
52
Agents in Artificial Intelligence
What is an Agent?
An agent can be anything that perceive its environment through sensors and
act upon that environment through actuators. An Agent runs in the cycle of
perceiving, thinking, and acting. An agent can be:
• Human-Agent: A human agent has eyes, ears, and other organs which
work for sensors and hand, legs, vocal tract work for actuators.
• Robotic Agent: A robotic agent can have cameras, infrared range finder,
NLP for sensors and various motors for actuators.
• Software Agent: Software agent can have keystrokes, file contents as
sensory input and act on those inputs and display output on the screen.
53
Agents in Artificial Intelligence
54
Agents in Artificial Intelligence
Terminologies
Sensor: Sensor is a device which detects the change in the environment and
sends the information to other electronic devices. An agent observes its
environment through sensors.
Effectors: Effectors are the devices which affect the environment. Effectors
can be legs, wheels, arms, fingers, wings, fins, and display screen.
55
Agents in Artificial Intelligence
56
Intelligent Agents
An intelligent agent is an autonomous entity which act upon an
environment using sensors and actuators for achieving goals. An
intelligent agent may learn from the environment to achieve their
goals. A thermostat is an example of an intelligent agent.
57
Rational Agent
A rational agent is an agent which has clear preference, models
uncertainty, and acts in a way to maximize its performance measure with
all possible actions.
58
Rationality
The rationality of an agent is measured by its performance measure.
Rationality can be judged on the basis of following points:
Note: Rationality differs from Omniscience because an Omniscient agent knows the actual
outcome of its action and act accordingly, which is not possible in reality.
59
Structure of an AI Agent
The task of AI is to design an agent program which implements the agent
function. The structure of an intelligent agent is a combination of architecture and
agent program. It can be viewed as:
Following are the main three terms involved in the structure of an AI agent:
f:P* → A
60
PEAS Representation
PEAS is a type of model on which an AI agent works upon. When we
define an AI agent or rational agent, then we can group its properties
under PEAS representation model. It is made up of four words:
P: Performance measure
E: Environment
A: Actuators
S: Sensors
Here performance measure is the objective for the success of an agent's behavior.
61
Example of Agents with their PEAS
representation
Agent Performance Environment Actuators Sensors
measure
1. Medical •Healthy •Patient •Tests Keyboard
Diagnose patient •Hospital •Treatments (Entry of
•Minimized cost •Staff symptoms)
62