0% found this document useful (0 votes)

10 views7 pages

Glossary

The document is a glossary of terms related to machine learning and data processing, defining key concepts such as aggregation, automation, and classification. It includes explanations of various terms like false positives, data labeling, and supervised learning, providing insights into how machine learning models operate and are evaluated. This resource serves as a reference for understanding fundamental terminology in the field of machine learning.

Uploaded by

Yueyao Wang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views7 pages

Glossary

Uploaded by

Yueyao Wang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

12/10/2019 Glossary

Glossary
Aggregation
When you combine data from many different sources or times in order to lower the possibility of a
single individual being identified.

Augment
When a machine, software, or function extends a person’s abilities or potential while maintaining
their agency.

Automate
When a machine, software, or function performs a task without user involvement.

Binary Classification
Binary classification: when an ML model predicts if an example falls into one category or another
based on a set of features.

Classification
When a machine learning model identifies an object. In response to an identification question, the
simplest classification is “yes” or “no”. For example, if a model was shown a picture of a cat, it
could classify it as “Cat”, or “Not a cat”. More complex classifications are sorting items into one of
several groups.

Confidence Level, Model Confidence

The confidence level for a model is a statistical measure of how certain a prediction or outcome is.

Context Errors
Situations when the product output doesn’t make sense in the user’s current context. Often, this
output serves
Google is perceived
cookiesas
to irrelevant by the
analyse traffic user.
to this site. Information about your use of our site is shared with Google
for that purpose. See details OK, got it

https://pair.withgoogle.com/glossary/ 1/7
12/10/2019 Glossary

Counterfactuals
Rationale for why something is classified as not within the given class. Usually in the form of a
statement of how the world would have to be different for a desirable outcome to occur.

Data Collection and Labeling

How product teams get the data they need and apply meaningful labels to it. For example:
acquiring millions of images of cats and dogs correctly labeled as “cat” or “dog”.

Data Distribution
Shows frequency of specific values within a dataset. For example, your could find that your data
includes a high number of certain values, and lower numbers of others. Usually follows “normal”
distribution, or a Gaussian curve.

Data Examples
Lines in a dataset or specific pieces of data, such as a photo of a shoe or run route.

Data Features
An individual measurable property or characteristic of an observable entity. Feature should be
informative, discriminating, and independent.

Data Labels
Human-added descriptions for a piece of data, or example.

Explicit Data Collection

When you request information from users outright, like in feedback forms.

Explicit Feedback
Information solicited from users from within your app. For example: rating systems, review
requests, forms, or surveys.

False Negatives
Google serves cookies to analyse traffic to this site. Information about your use of our site is shared with Google
for that purpose. See details OK, got it

https://pair.withgoogle.com/glossary/ 2/7
12/10/2019 Glossary

When the ML algorithm classifies an object as not in a certain category, when it actually is. For
example, if it was searching for sneakers, and it didn’t return several true images of sneakers.

False Positives
When the machine learning algorithm classifies an object as belonging to a certain category, but it
is not in that category. For example, if the algorithm incorrectly identified a sneaker as a llama.

Features
Distinct data sources or machine learning calculations that influence a prediction or outcome.

Folk Theories
Invented (and usually false) ideas of how a product works based on existing mental models and
assumptions.

General System Explanations

Descriptions of general system functionality, i.e. how and why it uses inputs to generate outputs.

Heuristic-Based
Based on static if-then functions, or rules based on desired situation-result pairs. If a certain
situation arises, the software produces a specific result, every time.

Implicit Data Collection

When you gather information about users passively, usually through logging behavior.

Implicit Feedback
Information about user behaviors, preferences, and needs that’s gathered from their interactions
within your application or product. Often uses logging — records of what people do within your
app.

Inter-rater Reliability
Also known as inter-rater agreement, or concordance, is a score of much consensus there is
between
Google different
serves raters
cookies performing
to analyse the
traffic to same
this task.
site. Information about your use of our site is shared with Google
for that purpose. See details OK, got it

https://pair.withgoogle.com/glossary/ 3/7
12/10/2019 Glossary

Labeling/Labeled
A label is the description that is either given to a piece of data by a human or derived from user
actions. For example, labeling a photo as “sneakers”, or run route as “hilly”.

ML Model
Mathematical algorithm that learns the statistical relationships among examples to make
predictions in the future.

Machine Learning
Techniques and methods to program computers to execute tasks without super-specific rules. ML
can help machines recognize patterns and adjust to unique situations.

Machine Learning (ML) Systems

Techniques and methods to develop AI, by getting computers to do something without being
programmed with super-specific rules. ML can help machines recognize patterns and adjust to
unique situations.

Mental Model
Users’ internal explanations of how something works. They shape how users interact with a
product or feature and it’s perceived value.

N-Best, N-Best Classifications, N-Best Lists

Refers to showing a certain number, “n”, top solutions or suggestions, such as the top 5 matches
for an image search.

Network Effect
When a person starts or stops using a product or service because the majority of their network is
using it or not.

Overfitting
When a model is optimized for predictive power for a training dataset that is narrower than the ML
model’sserves
Google intended use.
cookies to analyse traffic to this site. Information about your use of our site is shared with Google
for that purpose. See details OK, got it

https://pair.withgoogle.com/glossary/ 4/7
12/10/2019 Glossary

Partial Explanations
Messages that explain one aspect of how the system works. Ideally, this is the most important
aspect to the user.

Precision
The proportion of true positives correctly categorized out of all the true and false positives.

Predictive Power
A percentage that refers to an ML models’ ability to correctly predict outcomes given a certain
input. A model with predictive power of 100 gives the correct prediction every time, 0 is purely
random.

Probabilistic
Situations where there are multiple possible outcomes, each having varying degrees of certainty of
its occurrence.

Progressive Disclosures
A practice in UX when more information is revealed in subsequent screens or interactions.

Qualitative Feedback
Non-numeric feedback about how a user feels about a certain experience. Can include measures
of satisfaction, happiness, verbal responses or other qualities.

Quantitative Feedback
Feedback that is numeric or converted to a number. Both implicit and explicit feedback
mechanisms can be quantitative. This feedback can be fed back into your model for tuning.

Raters
The people who label the data used to train machine learning algorithms, specifically supervised
learning models.

Recall
Google serves cookies to analyse traffic to this site. Information about your use of our site is shared with Google
for that purpose. See details OK, got it

https://pair.withgoogle.com/glossary/ 5/7
12/10/2019 Glossary

The proportion of true positives correctly categorized out of all the true positives and false
negatives.

Redaction
When some pieces of a dataset or profile are removed to lower the possibility of identifying a
single user based on their data profile. You can redact certain features of data to shrink the data
profile, or redact examples for a certain amount of time.

Regressions
Also known as. linear regression algorithms, which try to find the best-fit line for a plot of data
points on a graph. As new data points appear over time, the algorithm adjusts the line to fit.

Reward Function
Mathematical equation that your ML algorithm uses to optimize outputs. The function weighs
some results as better than others, and optimizes for certain outcomes.

Second-order Effects
When the aggregate or outcomes or behaviors over time produces additional, unexpected
outcomes.

Specific Output Explanations

Descriptions of how a system arrives at a specific output based on a certain input.

Supervised Learning
When you “teach” your algorithm on training data. Often this is based on examples manually
labeled by humans to show “right” and “wrong” answers.

Test Data
Datasets that you use to test your ML model to make sure its predictions work on data it hasn’t
encountered before.

Training Data
Google serves cookies to analyse traffic to this site. Information about your use of our site is shared with Google
for that purpose.
Datasets that youSee
usedetails
to teachOK, got it
your ML model which outcomes correspond to which inputs.
https://pair.withgoogle.com/glossary/ 6/7
12/10/2019 Glossary

Transparency
Providing information about how a product works, including data sources, terms and conditions,
privacy, permissions, and rationale behind system output.

True Negatives
When the machine learning algorithm classifies an object as NOT in a certain category and it is
indeed not in that specific category. For example, it correctly classifies a llama as “not a sneaker”.

True Positives
When the machine learning algorithm classifies an object in a certain category, and the object is in
that category.

Tuning
When developers adjust their machine learning algorithm based on feedback or errors to improve
accuracy and performance.

Underfitting
When a model has a low predictive power across a more varied dataset.

Google serves cookies to analyse traffic to this site. Information about your use of our site is shared with Google
for that purpose. See details OK, got it

https://pair.withgoogle.com/glossary/ 7/7

Ebook 2023 Glossary AI Terms
No ratings yet
Ebook 2023 Glossary AI Terms
22 pages
ML Merged
No ratings yet
ML Merged
433 pages
Experience AI - Glossary of Terms
No ratings yet
Experience AI - Glossary of Terms
12 pages
AI Glossary Second Edit PDF
No ratings yet
AI Glossary Second Edit PDF
30 pages
Modelling
No ratings yet
Modelling
69 pages
Unit 3
No ratings yet
Unit 3
97 pages
Machine Learning Batch 8 2021
100% (1)
Machine Learning Batch 8 2021
73 pages
Glosario Sobre Aprendizaje Automático - Machine Learning - Google For Developers
No ratings yet
Glosario Sobre Aprendizaje Automático - Machine Learning - Google For Developers
270 pages
Statistics Interview Questions
100% (1)
Statistics Interview Questions
7 pages
Module 1 ML Mumbai University
No ratings yet
Module 1 ML Mumbai University
47 pages
The Machine Learning Glossary
No ratings yet
The Machine Learning Glossary
21 pages
ML Terminologies PDF
100% (1)
ML Terminologies PDF
44 pages
ML 02 Dataset-Feature Selection PDF
No ratings yet
ML 02 Dataset-Feature Selection PDF
44 pages
03 ML Essentials
No ratings yet
03 ML Essentials
52 pages
FML - KNN
No ratings yet
FML - KNN
64 pages
Ai CH 2
No ratings yet
Ai CH 2
43 pages
AIch 5
No ratings yet
AIch 5
50 pages
SEC Presentation
No ratings yet
SEC Presentation
22 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
39 pages
dbms-10 Marks
No ratings yet
dbms-10 Marks
32 pages
Glossary - Experience AI
No ratings yet
Glossary - Experience AI
17 pages
Introduction Class
No ratings yet
Introduction Class
134 pages
Ethics, Uses and Abuses of ML
No ratings yet
Ethics, Uses and Abuses of ML
11 pages
cs329s 2022 02 Slides MLSD
No ratings yet
cs329s 2022 02 Slides MLSD
99 pages
ML Chap 2
No ratings yet
ML Chap 2
60 pages
July4 SaketAnand FriendlyIntroToML
No ratings yet
July4 SaketAnand FriendlyIntroToML
84 pages
Key Elements of Machine Learning
No ratings yet
Key Elements of Machine Learning
9 pages
EE353 - 769 06 Intro To ML
No ratings yet
EE353 - 769 06 Intro To ML
27 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Ai Notes
No ratings yet
Ai Notes
8 pages
Machine - Learning - Unit - 1
No ratings yet
Machine - Learning - Unit - 1
70 pages
Unit III 1
No ratings yet
Unit III 1
21 pages
Machine Learning QB
No ratings yet
Machine Learning QB
15 pages
Machine Learning Note
No ratings yet
Machine Learning Note
40 pages
Module 4
No ratings yet
Module 4
28 pages
PSCS511 - Machine Learning
No ratings yet
PSCS511 - Machine Learning
23 pages
Machine Learning Most Important Question For Mid Term Ipu University
No ratings yet
Machine Learning Most Important Question For Mid Term Ipu University
36 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
Machine Learning - Introduction
No ratings yet
Machine Learning - Introduction
36 pages
Strategy Deck
No ratings yet
Strategy Deck
16 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
ML Week 3
No ratings yet
ML Week 3
6 pages
Lecture 3 - 1-ML and Data Systems Fundamentals
No ratings yet
Lecture 3 - 1-ML and Data Systems Fundamentals
48 pages
CSC413 Lecture Note
No ratings yet
CSC413 Lecture Note
32 pages
Asset v1 ACCA+ML001+2T2021+Type@Asset+Block@Glossary
No ratings yet
Asset v1 ACCA+ML001+2T2021+Type@Asset+Block@Glossary
5 pages
Basic Concepts of Machine Learning For Beginners 1732109263
No ratings yet
Basic Concepts of Machine Learning For Beginners 1732109263
102 pages
Disruptive Technologies AI Lecture 2
No ratings yet
Disruptive Technologies AI Lecture 2
12 pages
Ch7 Introduction To Machine Learning
No ratings yet
Ch7 Introduction To Machine Learning
29 pages
5.1 Large Scale ML
No ratings yet
5.1 Large Scale ML
10 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
10 pages
Lesson 4 - Introduction Machine Learning
No ratings yet
Lesson 4 - Introduction Machine Learning
44 pages
The Ultimate Guide To Algorithmic Trading
No ratings yet
The Ultimate Guide To Algorithmic Trading
17 pages
Designing A Learning System
No ratings yet
Designing A Learning System
21 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
No ratings yet
Lecture - 2 Classification (Machine Learning Basic and KNN)
94 pages
ML Glossary
No ratings yet
ML Glossary
44 pages
Machine Learning Notes From AWS
No ratings yet
Machine Learning Notes From AWS
5 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
ML-2 Guided Project Report
No ratings yet
ML-2 Guided Project Report
63 pages
ML Midterm Cheatsheet
No ratings yet
ML Midterm Cheatsheet
2 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Machine Learning
No ratings yet
Machine Learning
48 pages
Machine Learning & Data Mining
No ratings yet
Machine Learning & Data Mining
4 pages
Healthcare Management System Using Python Full Stack K
No ratings yet
Healthcare Management System Using Python Full Stack K
41 pages
Project Report
No ratings yet
Project Report
29 pages
U20cs604 Machine Learning Unit III
No ratings yet
U20cs604 Machine Learning Unit III
23 pages
Deep Learning Practical File
No ratings yet
Deep Learning Practical File
36 pages
Experiment 4
No ratings yet
Experiment 4
6 pages
Spam Detection in Emails Using Machine Learning
No ratings yet
Spam Detection in Emails Using Machine Learning
56 pages
BERT and RoBERTa For Sarcasm Detection - Optimizing Performance Through Advanced Fine-Tuning
No ratings yet
BERT and RoBERTa For Sarcasm Detection - Optimizing Performance Through Advanced Fine-Tuning
11 pages
AI For Earthquake Prediction
No ratings yet
AI For Earthquake Prediction
14 pages
Paper Id - ICCCAI25 - 188
No ratings yet
Paper Id - ICCCAI25 - 188
8 pages
Big Data Analytics With Java 1st Edition Rajat Mehta
No ratings yet
Big Data Analytics With Java 1st Edition Rajat Mehta
65 pages
Module 3
No ratings yet
Module 3
53 pages
openSAP Sac5 Week 4 Unit 7 PREDKEYINT Exercise
No ratings yet
openSAP Sac5 Week 4 Unit 7 PREDKEYINT Exercise
18 pages
1 s2.0 S092523122300766X Main
No ratings yet
1 s2.0 S092523122300766X Main
10 pages
AI in Healthcare Paper For Assignment 2
No ratings yet
AI in Healthcare Paper For Assignment 2
11 pages
ML Unit 1 Solution
No ratings yet
ML Unit 1 Solution
18 pages
FDS Viva
No ratings yet
FDS Viva
46 pages
A Complete Guide To Data Augmentation - DataCamp
No ratings yet
A Complete Guide To Data Augmentation - DataCamp
18 pages
IEEE Zeta Rho Chapter - Artificial Intelligence-Based Fault Detection and Localization For Underground Cables - Slides
No ratings yet
IEEE Zeta Rho Chapter - Artificial Intelligence-Based Fault Detection and Localization For Underground Cables - Slides
26 pages
Collaborative Learning For Cyberattack Detection in Blockchain Networks
No ratings yet
Collaborative Learning For Cyberattack Detection in Blockchain Networks
12 pages
Boruta Feature Selection in R - DataCamp
No ratings yet
Boruta Feature Selection in R - DataCamp
18 pages
Exploring Natural Language Processing in Model-To-Model Transformations
No ratings yet
Exploring Natural Language Processing in Model-To-Model Transformations
17 pages
Is Grad-CAM Explainable in Medical Images?
No ratings yet
Is Grad-CAM Explainable in Medical Images?
13 pages
IoT Module 5 Notes
No ratings yet
IoT Module 5 Notes
6 pages
UNIT-1 Polynomial Regression
No ratings yet
UNIT-1 Polynomial Regression
7 pages
Prediction of Stock Price Trend Based On Wavelet Neural Network and RS Attributes Reduction
No ratings yet
Prediction of Stock Price Trend Based On Wavelet Neural Network and RS Attributes Reduction
4 pages
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
From Everand
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
Steven Taylor
No ratings yet
Getting Paid To Test AI
From Everand
Getting Paid To Test AI
Michael Smith
No ratings yet
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet

Glossary

Uploaded by

Glossary

Uploaded by

12/10/2019 Glossary

Confidence Level, Model Confidence

Data Collection and Labeling

Explicit Data Collection

General System Explanations

Implicit Data Collection

Machine Learning (ML) Systems

N-Best, N-Best Classifications, N-Best Lists

Specific Output Explanations

You might also like