0% found this document useful (0 votes)
11 views16 pages

417 AI Handbook Class9 Acquiring Processing Interpreting

Uploaded by

ankithalder135
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views16 pages

417 AI Handbook Class9 Acquiring Processing Interpreting

Uploaded by

ankithalder135
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 16

2.2.1.

Types of data
Artificial Intelligence is crucial, with data serving as its foundation. We come across different types of
information every day. Some common types of data include:

Textual Data (Qualitative Data) Numeric Data (Quantitative Data)

● It is made up of words and phrases ● It is made up of numbers


● It is used for Natural Language Processing (NLP) ● It is used for Statistical Data
● Search queries on the internet are an example ● Any measurements, readings, or values
of textual data would count as numeric data
● Example: “Which is a good park nearby?” ● Example: Cricket Score, Restaurant Bill

Numeric Data is further classified as:


● Continuous data is numeric data that is continuous. E.g., height, weight, temperature, voltage
● Discrete data is numeric data that contains only whole numbers and cannot be fractional
E.g. the number of students in the class – it can only be a whole number, not in decimals
Types of Data used in three domains of AI:

2.2.2 Data Acquisition/Acquiring Data

Data Acquisition, also known as acquiring data, refers to the procedure of gathering data. This involves
searching for datasets suitable for training AI models. The process typically comprises three key steps:
Acquiring Data – Sample Data Discovery

Let’s say we want to collect data for making a CV model


for a self-driving car

● We will require pictures of roads and the objects on


roads
● We can search and download this data from the
internet
● This process is called data discovery

Acquiring Data – Sample Data Augmentation

● Data augmentation means increasing the amount of data


by adding copies of existing data with small changes
● The image given here does not change, but we get data
on the image by changing different parameters like color
and brightness
● New data is added by slightly changing the existing data

Acquiring Data – Sample Data Generation

● Data generation refers to generating or


recording data using sensors
● Recording temperature readings of a building
is an example of data generation
● Recorded data is stored in a computer in a
suitable form

Sources of Data
Various Sources for Acquiring Data:
● Primary Data Sources — Some of the sources for primary data include surveys, interviews,
experiments, etc. The data generated from the experiment is an example of primary data.
Here is an excel sheet showing the data collected for students of a class.
● Secondary Data Sources—Secondary data collection obtains information from external sources,
rather than generating it personally. Some sources for secondary data collection include:

2.2.3 Best Practices for Acquiring Data


Checklist of factors that make data good or bad

Data acquisition from websites

Ethical concerns in data acquisition


While gathering data and choosing datasets, certain ethical issues can be addressed before they occur
2.2.4 Features of Data and Data Preprocessing
Usability of Data
There are three primary factors determining the usability of data:
1. Structure- Defines how data is stored.

2. Cleanliness- Clean data is free from duplicates, missing values, outliers, and other anomalies that
may affect its reliability and usefulness for analysis. In this particular example, duplicate values are
removed after cleaning the data.

3. Accuracy- Accuracy indicates how well the data matches real-world values, ensuring reliability.
Accurate data closely reflects actual values without errors, enhancing the quality and
trustworthiness of the dataset.
In this particular example, we are comparing data gathered from measuring the length of a small box
in centimeters.

Kaggle assigns a usability score to the data sets that are present on the website based on scores
given by the users of that data.

What kind of data is more usable, according to you?


_____________________________________________________________________________________
_____________________________________________________________________________________
_____________________________________________________________________________________
_____________________________________________________________
If we have a lot of data which is not clean, is it good for AI?
_____________________________________________________________________________________
_____________________________________________________________________________________
_____________________________________________________________________________________
_____________________________________________________________

Features of Data
Data features are the characteristics or properties of the data. They describe each piece of information in
a dataset. For example, in a table of student records, features could include things like the student's name,
age, or grade. In a photo dataset, features might be the colors present in each image. These features help
us understand and analyze the data.

In AI models, we need two types of features: independent and dependent.


Independent features are the input to the model—they're the information we provide to make
predictions.

Dependent features, on the other hand, are the outputs or results of the model—they're what we're
trying to predict.
2.2.5 Data Processing and Data Interpretation
Data processing and interpretation have become very important in today’s world
Can you answer this?
▪ Niki has 7 candies, and Ruchi has 4 candies
▪ How many candies do Niki and Ruchi have in total?
▪ We can answer this question using data processing
▪ Who should get more candies so that both Niki and Ruchi
have an equal number of candies?
▪ How many candies should they get?
▪ We can answer this question using data interpretation

Data Processing
▪ Data processing helps computers understand raw data.
▪ Use of computers to perform different operations on data is
included under data processing.

Data Interpretation
▪ It is the process of making sense out of data that has been
processed.
▪ The interpretation of data helps us answer critical questions
using data.

Understanding some keywords related to Data More than 60% of


Students would be
Acquire Data- Acquiring data is to collect data from various data
interested in Sports!
sources.

Data Processing- After raw data is collected, data is processed to derive meaningful
information from it.
Data Analysis – Data analysis is to examine each component of the data
in order to draw conclusions.

Data Interpretation – It is to be able to explain what these


findings/conclusions mean in a given context.

Data Presentation- In this step, you select, organize, and group ideas and
evidence in a logical way.

Acquire Process Analyze Interpret Present

Methods of Data Interpretation

How to interpret Data?

Based on the two types of data, there are two ways to interpret data-
● Quantitative Data Interpretation
● Qualitative Data Interpretation
Qualitative Data Interpretation
● Qualitative data tells us about the emotions
and feelings of people

● Qualitative data interpretation is focused on


insights and motivations of people

Reviews by customers –
Pizza Qualitative data Jim and his
toppings friends are regular customers
here are so Veg Veg farmhouse
tasty! farmhouse pizza is a popular choice
pizza is the
best here!

Data Collection Methods – Qualitative Data Interpretation

Record keeping: This method uses existing reliable documents and other similar sources of information
as the data source. It is similar to going to a library.
Observation: In this method, the participant – their behavior and emotions – are observed carefully
Case Studies: In this method, data is collected from case studies.
Focus groups: In this method, data is collected from a group discussion on relevant topic.
Longitudinal Studies: This data collection method is performed on the same data source repeatedly over
an extended period.
One-to-One Interviews: In this method, data is collected using a one-to-one interview.

Activity – Trend Analysis

Purpose:
▪ This activity will engage youth with longitudinal studies – a study conducted over a considerable
amount of time to identify trends and patterns
▪ The ability to identify trends and patterns in datasets allows us to make informed decisions
about different tasks in our lives

Activity Guidelines
Let’s do a small activity based on Identifying trends.
● Visit the link: https://trends.google.com/trends/?geo=IN (Google Trends)
● Explore the website
● Check what is trending in the year 2022 – Global
▪ Make a list of trending sports (top 5)
▪ Make a list of trending movies (top 5)
● Check what is trending globally in the year 2022

List of trending athletes (top 5) List of trending movies (top 5)

5 Steps to Qualitative Data Analysis


1. Collect Data
2. Organize
3. Set a code to the Data Collected
4. Analyze your data
5. Reporting

Quantitative Data Interpretation

Counter – Number of
Cumulative Grade Point website visit
Average (CGPA)
Cumulative Grade Point
Average (CGPA)

The average height


Recording the
of students will be
height of students
important to build
in a class
suitable tables and
chairs for students

▪ Quantitative data interpretation is made on numerical data


▪ It helps us answer questions like “when,” “how many,” and “how often”
▪ For example – (how many) numbers of likes on the Instagram post
Data Collection Methods -Quantitative Data Interpretation
Interviews: Quantitative interviews play a key role in collecting information.
Polls: A poll is a type of survey that asks simple questions to respondents. Polls are usually limited to one
question.
Observations: Quantitative data can be collected through observations in a particular time period
Longitudinal Studies: A type of study conducted over a long time
Survey: Surveys can be conducted for a large number of people to collect quantitative data.

4 Steps to Quantitative Data Analysis


1. Relate measurement scales with variables
2. Connect descriptive statistics with data
3. Decide a measurement scale
4. Represent data in an appropriate format

Let’s summarize Qualitative and Quantitative data interpretation


Qualitative & Quantitative Data Interpretation

Qualitative Data Interpretation Quantitative Data Interpretation


Categorical Numerical
Provides insights into feelings and Provides insights into quantity
emotions
Answers how and why Answers when, how many or how often
Methods – Interviews, Focus Groups Methods – Assessment, Tests, Polls,
Surveys
Example question – Why do students like Example question – How many students
attending online classes? like attending online classes?

Types of Data Interpretation


There are three ways in which data can be presented:

Data Interpretation Types

Textual Tabular Graphical

Textual DI
▪ The data is mentioned in the text form, usually in a paragraph.
▪ Used when the data is not large and can be easily comprehended by reading.
▪ Textual presentation is not suitable for large data.
▪ Example:
In the Science Olympiad class of 45 Students, 3 students obtained the More than 60% of
perfect score of 50. 10 students got a score of 45 and above, 15 students scored more
students got a score of 40 and above, 8 students got a score of 30 and than 80% Marks in
above, 6 students got a score of 20 and above and 3 got 19 and below. Olympiad!
Tabular DI
▪ Data is represented systematically in the form of rows and columns.
▪ Title of the Table (Item of Expenditure) contains the description of the table content.
▪ Column Headings (Year; Salary; Fuel and Transport; Bonus; Interest on Loans; Taxes) contains the
description of information contained in columns.

Graphical DI
Bar Graphs
In a Bar Graph, data is represented using vertical and horizontal bars.

Pie Charts
▪ Pie Charts have the shape of a pie and each slice of the pie represents the portion of the entire
pie allocated to each category
▪ It is a circular chart divided into various sections (think of a cake cut into slices)
▪ Each section of the pie chart is proportional to the corresponding value
Distribution of Math Score

Perfect Score(=50)
7%7% 45 and Above(>=45)
13% 22%
18% 40 and Above(>=40)

33% Between 30-39


Between 20-29
19 and Below(<=19)

Line Graphs
▪ A line graph is created by connecting various data points.
▪ It shows the change in quantity over time.
Activity: Visualize and Interpret Data

Duration: 40 Minutes
Purpose
▪ This activity will engage youth with data visualization and interpretation
▪ visualization makes it easier for us to extract useful information contained in the dataset

Activity Guidelines
● The table shows the details of a class consisting of 50 students and their scores ranging in the listed
categories for 5 subjects: Math, Physics, Chemistry, Social Science, and Biology

Student Performance
Marks Range Math Physics Chemistry Social Science Biology
Less than 20 6 3 1 0 0
Between 20-29 14 11 9 15 8
Between 30-40 17 20 21 22 19
Between 41-44 8 10 14 10 16
45 and Above 5 6 5 3 7
Total Students 50 50 50 50 50

● Copy the table in an Excel sheet and create the following visualizations for the given data:
▪ Make a bar graph showing the marks distribution for all 5 subjects
▪ Make a pie chart showing the marks distribution for Physics
▪ Make a line chart displaying the marks distribution for Chemistry
Importance of Data Interpretation

Informed Decision Making A decision is only as good as the knowledge it is based on

Remember the height example:


Since the average height of students is known, school can custom design the chairs and tables
according to the requirement of the class
Reduced Cost Identifying needs can lead to reduction in cost

Remember the restaurant example:


Restaurant owner could decide to drop/modify some dishes of the menu which aren’t popular or
have got bad reviews

Identifying Needs We can identify needs of people by data interpretation

Remember the restaurant example:


Veg Farmhouse Pizza is a popular choice among age group 8-10

Quiz Time: AI Quiz


Session Preparation
Logistics: For a class of 40 Students [Pair Activity]
Materials Required:
ITEMS QUANTITY
COMPUTERS 20

Brief:
The following are questions for the quiz. You can either go for a Pen/Paper Quiz or you can visit any
open-sourced, free, online portal; one of which is Kahoot, and create your quiz there. For Kahoot: Go to
https://kahoot.com/ and create your login ID on it. Then, add your own kahoot in it simply by adding all
the given questions into it. Once created, you can initiate the quiz from your ID and students can
participate in it by putting in the Game pin.

Quiz Questions
1. What are the basic building blocks of qualitative data?
a. Individuals
b. Units
c. Categories
d. Measurements
2. Which among these is not a type of data interpretation?
a. Textual
b. Tabular
c. Graphical
d. Raw data
3. Quantitative data is numerical in nature.
a. True
b. False
4. A Bar Graph is an example of?
a. Textual
b. Tabular
c. Graphical
d. None of the above
5. _____________ relates to the manipulation of data to produce meaningful insights.
a. Data Processing
b. Data Interpretation
c. Data Analysis
d. Data Presentation

2.3 Project Interactive Data Dashboard & Presentation


Lesson Title: Project Interactive Data Dashboard and Approach: Session + Activity
Presentation
Summary: In this module, you will reflect on your learnings from the previous units till learnt.
You will further engage in an activity on data collection and data visualization using the visual
data analytics platform, Tableau.
Learning Objectives
● Demonstrating comprehension and retention of learnings from previous units
● Apply acquired knowledge to select and employ appropriate data visualization methods

Learning Outcomes
● Summarize the topics learned previously
● Recognize the importance of data visualization
● Discover different methods of data visualization

Pre-requisites:
● Meet the learning outcomes of units till learnt
● Basic computer skills.
Key-concepts
● Mapping AI Project Cycle.
● Data Literacy.
● Sources of data.
● Data acquisition.
● Usability of data.
● Data processing and interpretation.
● Data visualization using Tableau.

Icebreaker Activity

Tic-Tac-Toe

Purpose:

You might also like