Chapter 1
Chapter 1
Artificial
intelligence
Course Outcome
Describe Artificial Intelligence, Machine
learning and deep learning, Data visualization
and Data storytelling.
AI =
itng
• Problem solving
Ability to formulate a problem in a suitable representation, to plan or its solution and
to know when new information is needed and how to obtain it.
cont.
ces o cusom
Robots
– Exploration
– Transportation/Navigation
– Industrial Automation (e.g., Process Control, Assembly Tasks, Executive Tasks)
• Security
• Other (Agriculture, Fishing, Mining, Sanitation, Construction, etc.)
• Military
• Household
• Other (Agriculture, Fishing, Mining, Sanitation, Construction, etc.)
• Military
• Household
• Games
• For emergency services
• For Entertainment
• For providing services to customers
Components of AI
Level Core Compound Coarse components
Induction
Logic Proposition Knowledge Knowledge based
Tautology Reasoning systems, Heuristic
Model Logic Control Search Theorem
Search Proving
Temporal Multi Agent system
Cognitio Learning Belief Desire Co-operation, Co-
n Adaptation Intention ordination AI
Self- Programming
organization
Vision
Functio Memory Utterance Natural Language
nal Perception Speech Processing
Types of AI
Application of AI
Data Visualization
• Definition: The graphical representation of data to
help users understand patterns, trends, and outliers.
• Importance:
Simplifies complex data
Highlights key insights
Aids in decision-making
Enhances storytelling
Why is Data Visualization
Important?
• Key Points:
• Makes data more accessible and understandable
• Helps identify trends and patterns quickly
• Assists in presenting data in a more persuasive and
compelling way
• Supports faster decision-making
• Facilitates better communication across teams
Data Types in Data Visualization
Definition: Data types refer to the kind of data being visualized
and influence the choice of visualization techniques.
1. Categorical Data- Categories with a defined order (e.g., education levels, customer
satisfaction ratings).
2. Numerical Data-Countable numbers (e.g., number of people).
3. Time Series Data-Data collected or recorded at regular time intervals. (eg.line chart
showing sales over a year).
4. Geospatial Data-Data representing locations or geographic features.(eg. Maps)
5. Text Data-Unstructured data involving words or strings(eg. customer feedback)
6. Boolean Data-Binary data indicating two possible outcomes (True/False, Yes/No).
7. Hierarchical Data-Data organized in a tree-like structure with parent-child
relationships(eg. Tree maps)
8. Relational Data-: Data showing relationships between variables or entities.(eg.
network graph showing connections between people or items.)
Types of Example Appropriate Description
Variables Scale
Quantitative/ 1.3, 5.7, 83, Arbitrary numerical values. These
1.5 ×10–2 Continuous
numerical can be integers, rational numbers, or
continuous real numbers.
Quantitative/ Numbers in discrete units. These are
numerical discrete most commonly but not necessarily
1, 2, 3, 4 Discrete integers. For example, the numbers
0.5, 1.0, 1.5 could also be treated as
discrete if intermediate values
cannot exist in the given dataset.
Qualitative/ Categories without order. These are
categorical discrete and uniquecategories that
unordered dog, cat, fish Discrete have no inherent
order. These variables are also called
factors.
Qualitative/ Categories with order. These are
categorical ordered discrete and unique categories with
good, fair, Discrete an order. For example, “fair” always
poor lies between
The quick “good” and “poor.” These variables
Text brown fox None, or are also called
Free-form text.ordered factors.as
Can be treated
Jan. 5 2018,
jumps over the Continuous
discrete Specific days
categorical if and/or
needed.times. Also,
Date or time 8:03am
lazy dog. or generic dates, such as July 4 or Dec.
Discrete 25 (without year).
First 8 rows of a dataset listing daily temperature normal for four
weather stations
Common Visualizations:
• Bar Charts
• Column Charts
• Line Graphs
• Area Charts
Amounts
Histograms and density plots provide the most intuitive visualizations of a distribution
• Boxplots, violin plots, strip charts, and since plots are useful when we want to visualize many
distributions at once.
• Stacked histograms and overlapping densities allow a more in-depth comparison of a smaller
number of distributions.
• Ridgeline plots can be a useful alternative to violin plots and are often useful when visualizing
very large numbers of distributions or changes in distributions over.
Proportions
Comparison:
o Descriptive Analysis: Summarizes data (What happened?).
o Predictive Analysis: Predicts future outcomes (What could happen?).
o Explanatory Analysis: Explains the reasons behind trends and
relationships (Why did this happen?).
Explanatory Analysis
Explanatory analysis, there are a few things to think about and be extremely clear on before
visualizing any data or creating content. First, to whom are you communicating?
Who -
Your audience - The more specific you can be about who your audience is, the better position you
will be in for successful communication.
You - It’s also helpful to think about the relationship that you have with your audience and how you
expect that they will perceive you.
What-
Action - You should always want your audience to know or do something.
Mechanism - How will you communicate to your audience?
HOW
Data becomes supporting evidence of the story you will build and tell.
Machine Learning
Definition: Machine learning is a branch of science that deals with programming the systems in
such a way that they automatically learn and improve with experience.
Types of Machine Learning:
o Supervised Learning: The model is trained using labeled data (input-output pairs).