0% found this document useful (0 votes)

103 views3 pages

Glossary Terms From Module 2

This document defines glossary terms from module 2 of a course, including box plots, CSV files, database files, data sources, extracting, filtering, grouping, hypotheses, info(), Int64, JSON files, merging, slicing, sorting, strings, and terms from previous modules such as bias, cleaning, data visualization, discovering, exploratory data analysis (EDA), joining, PACE workflow, presenting, structuring, and validating.

Uploaded by

rariaseum

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

103 views3 pages

Glossary Terms From Module 2

Uploaded by

rariaseum

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Glossary terms from module 2

Terms and definitions from Course 3,

Module 2
Box plot: A data visualization that depicts the locality, spread, and skew of groups of values within
quartiles

CSV file: A simple text file that can be easy to import or store in other softwares, platforms, and
databases

Database (DB) file: A file type used to store data, often in tables, indexes, or fields

Data source: The location where data originates

Extracting: The process of retrieving data out of data sources for further data processing or storage

Filtering: The process of selecting a smaller part of a dataset based on specified values and using it
for viewing or analysis

First-party data: Data that was gathered from inside your own organization

Grouping: The process of aggregating individual observations of a variable into groups

Hypothesis: A theory or an explanation, based on evidence, that has not yet been refuted

Info(): Gives the total number of entries, along with the data types—called Dtypes in pandas—of the
individual entries

Int64: A standard integer data type, representing numbers somewhere between negative nine
quintillion and positive nine quintillion

JSON file: A data storage file that is saved in a JavaScript format

Merging: A method to combine two (or more) different data frames along a specified starting
column(s)

Second-party data: Data that was gathered outside your organization but directly from the original
source

Slicing: A method for breaking information down into smaller parts to facilitate efficient examination
and analysis from different viewpoints

Sorting: The process of arranging data into a meaningful order for analysis

String: A sequence of characters and punctuation that contains textual information

Third-party data: Data gathered outside your organization and aggregated

Terms and definitions from previous

modules
B
Bias: In data structuring, bias refers to organizing data results into groupings, categories, or variables
that are misrepresentative of the whole dataset

C
Cleaning: The process of removing errors that may distort your data or make it less useful; one of the
six practices of Exploratory Data Analysis (EDA)

D
Data visualization: A graph, chart, diagram, or dashboard that is created as a representation of
information

Discovering: The process data professionals use to familiarize themselves with the data so they can
start conceptualizing how to use it; one of the six practices of EDA

E
Exploratory data analysis (EDA): The process of investigating, organizing, and analyzing datasets
and summarizing their main characteristics, often by employing data wrangling and visualization
methods; the six main practices of EDA are: discovering, structuring, cleaning, joining, validating,
and presenting

J
Joining: The process of augmenting data by adding values from other datasets; one of the six
practices of EDA

P
PACE: A workflow data professionals can use to remain focused on the end goal of any given
dataset; stands for plan, analyze, construct, and execute

Presenting: The process of making a cleaned dataset available to others for analysis or further
modeling; one of the six practices of EDA
S
Structuring: The process of taking raw data and organizing or transforming it to be more easily
visualized, explained, or modeled; one of the six practices of EDA

V
Validating: The process of verifying that the data is consistent and high quality; one of the six
practices of EDA

Unit 2 PPT (BA)
No ratings yet
Unit 2 PPT (BA)
33 pages
Unit - 1 EDA
No ratings yet
Unit - 1 EDA
123 pages
UNIT 1 Exploratory Data Analysis
100% (3)
UNIT 1 Exploratory Data Analysis
21 pages
Crash Course Data Science
No ratings yet
Crash Course Data Science
7 pages
Data Wrangling, Also Known As Data Munging, Is An Iterative Process That Involves Data
No ratings yet
Data Wrangling, Also Known As Data Munging, Is An Iterative Process That Involves Data
9 pages
Unit I - Part I Notes
100% (7)
Unit I - Part I Notes
33 pages
1708443470801
No ratings yet
1708443470801
71 pages
Mylesson 3
No ratings yet
Mylesson 3
19 pages
CJrMPRb9S OIYJrkYfkgVg Course 5 Glossary
No ratings yet
CJrMPRb9S OIYJrkYfkgVg Course 5 Glossary
15 pages
Course 6 Week 4 Glossary - DA Terms and Definitions
No ratings yet
Course 6 Week 4 Glossary - DA Terms and Definitions
20 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
62 pages
Introduction To Ds - 2024
No ratings yet
Introduction To Ds - 2024
25 pages
CBDA Domain-II Source Data v0.1
No ratings yet
CBDA Domain-II Source Data v0.1
32 pages
Notes 3 (Prepare Coursera)
No ratings yet
Notes 3 (Prepare Coursera)
67 pages
Course 3 Glossary
No ratings yet
Course 3 Glossary
5 pages
Down 2
No ratings yet
Down 2
61 pages
Glossary Terms From Module 4
No ratings yet
Glossary Terms From Module 4
5 pages
Basics of Data Analytics
No ratings yet
Basics of Data Analytics
4 pages
Google Data Anlyatic Glossarydocx
No ratings yet
Google Data Anlyatic Glossarydocx
6 pages
Lecture 3 (DS) - Steps in Data Science Process
No ratings yet
Lecture 3 (DS) - Steps in Data Science Process
57 pages
Data Mining
No ratings yet
Data Mining
34 pages
Data Analystic
No ratings yet
Data Analystic
35 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
23 pages
Unit 2 Data Mining
No ratings yet
Unit 2 Data Mining
69 pages
25 Essential Data Analysis Terms Every Analyst Should Know
No ratings yet
25 Essential Data Analysis Terms Every Analyst Should Know
11 pages
FDS Most Imp Question
No ratings yet
FDS Most Imp Question
12 pages
ZolpEjBKQVioCz08u0B3kw Glossary 8
No ratings yet
ZolpEjBKQVioCz08u0B3kw Glossary 8
4 pages
Data Analyst
No ratings yet
Data Analyst
1 page
Unit - 2
No ratings yet
Unit - 2
4 pages
Data 101 Terms
No ratings yet
Data 101 Terms
6 pages
Glossary Terms From Module 1
No ratings yet
Glossary Terms From Module 1
1 page
DSBD
No ratings yet
DSBD
23 pages
PTDLKT
No ratings yet
PTDLKT
11 pages
Data Science Process
No ratings yet
Data Science Process
30 pages
Part II, Meet 4 - CH 6 Dan 7 UNP
No ratings yet
Part II, Meet 4 - CH 6 Dan 7 UNP
19 pages
Approaches in Data Analysis (Slides)
No ratings yet
Approaches in Data Analysis (Slides)
13 pages
Unit 2 Data Analytics
No ratings yet
Unit 2 Data Analytics
16 pages
Httpsroadmap - Shpdfsroadmapsdata Analyst PDF
No ratings yet
Httpsroadmap - Shpdfsroadmapsdata Analyst PDF
1 page
Grossary 6
No ratings yet
Grossary 6
7 pages
Unit 1
No ratings yet
Unit 1
36 pages
Session1 DataCharacteristics
No ratings yet
Session1 DataCharacteristics
41 pages
Assignment Big Data
No ratings yet
Assignment Big Data
7 pages
Glossary of Problem & Approach
No ratings yet
Glossary of Problem & Approach
3 pages
Approaches in Data Analysis (Slides) (Re-Brand)
No ratings yet
Approaches in Data Analysis (Slides) (Re-Brand)
13 pages
Lecture 2
No ratings yet
Lecture 2
14 pages
FDS PYQ Solution
No ratings yet
FDS PYQ Solution
8 pages
Unit I 2 Marks
No ratings yet
Unit I 2 Marks
5 pages
Foundation of Data Science Imp Notes
No ratings yet
Foundation of Data Science Imp Notes
6 pages
Download
No ratings yet
Download
4 pages
Term2 Datascience Notes
No ratings yet
Term2 Datascience Notes
8 pages
Notes - Unit 1 - Exploratory Data Analysis
No ratings yet
Notes - Unit 1 - Exploratory Data Analysis
33 pages
Unit 2 Data Gathering
No ratings yet
Unit 2 Data Gathering
14 pages
HTTTTC - Final Exam
No ratings yet
HTTTTC - Final Exam
4 pages
Exploratory Data Analysis (Eda)
No ratings yet
Exploratory Data Analysis (Eda)
10 pages
Analysis Terms
No ratings yet
Analysis Terms
1 page
DA Interview Questions
No ratings yet
DA Interview Questions
7 pages
Business Analytics Notes
No ratings yet
Business Analytics Notes
6 pages
Installation and 5ProVision Administration Manual 6.11.
No ratings yet
Installation and 5ProVision Administration Manual 6.11.
378 pages
Qualys Patch Management Getting Started Guide
No ratings yet
Qualys Patch Management Getting Started Guide
40 pages
AWS Cloud Practitioner Essentials Resume
No ratings yet
AWS Cloud Practitioner Essentials Resume
40 pages
Error in SM21
No ratings yet
Error in SM21
3 pages
VBA Error Codes
No ratings yet
VBA Error Codes
4 pages
Library Management System
50% (2)
Library Management System
73 pages
P ADM SYS 70 Sample Questions
No ratings yet
P ADM SYS 70 Sample Questions
5 pages
Open Banking On AWS
No ratings yet
Open Banking On AWS
4 pages
54 Top Business Intelligence Tools - Compare BI Software - Docurated
No ratings yet
54 Top Business Intelligence Tools - Compare BI Software - Docurated
46 pages
Q1. Define Array With Syntax and Example
No ratings yet
Q1. Define Array With Syntax and Example
9 pages
Linux Course Syllabus
No ratings yet
Linux Course Syllabus
8 pages
MIS Case Study - Honda Motors
100% (1)
MIS Case Study - Honda Motors
9 pages
Online Shopping System Project Report
No ratings yet
Online Shopping System Project Report
153 pages
TRA Installation
No ratings yet
TRA Installation
44 pages
Ravi Chandra Reddy Muli Mobile: +919182920300: Professional Summary
No ratings yet
Ravi Chandra Reddy Muli Mobile: +919182920300: Professional Summary
5 pages
Questions For Hyperion Planning Certification
No ratings yet
Questions For Hyperion Planning Certification
1 page
UiPath Certified Professional - ABA - RecommendedTraining
No ratings yet
UiPath Certified Professional - ABA - RecommendedTraining
3 pages
Module 3 - 2 MIS - Ethical Issues and Privacy
No ratings yet
Module 3 - 2 MIS - Ethical Issues and Privacy
31 pages
DBA Lecture
No ratings yet
DBA Lecture
39 pages
Cloudera User Manual
No ratings yet
Cloudera User Manual
44 pages
Adbms Final Project
No ratings yet
Adbms Final Project
15 pages
Tryton Client Web
No ratings yet
Tryton Client Web
6 pages
01 - Upload and Display An Image - UploadAndDisplayImage
No ratings yet
01 - Upload and Display An Image - UploadAndDisplayImage
28 pages
Symantec Backup Exec™ 12.5 For Windows Servers Word Descriptions
No ratings yet
Symantec Backup Exec™ 12.5 For Windows Servers Word Descriptions
15 pages
Midterm Assignment 101016653
No ratings yet
Midterm Assignment 101016653
6 pages
How To Set File and Folder Permissions in Windows
No ratings yet
How To Set File and Folder Permissions in Windows
8 pages
CPV Lab Installation Instructions
No ratings yet
CPV Lab Installation Instructions
10 pages
24 Jan DBMS Workshop BE Kargil Batch 2026
No ratings yet
24 Jan DBMS Workshop BE Kargil Batch 2026
1 page
Exercise 4 - Setting Up Node - Js and NPM
No ratings yet
Exercise 4 - Setting Up Node - Js and NPM
5 pages
CPDP
No ratings yet
CPDP
2 pages
Data Analytics with Generative AI
From Everand
Data Analytics with Generative AI
Younish P
No ratings yet
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
From Everand
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
JAMIE POWERS
No ratings yet
Data Science with R: Beginner to Expert
From Everand
Data Science with R: Beginner to Expert
Narayana Nemani
No ratings yet

Glossary Terms From Module 2

Uploaded by

Glossary Terms From Module 2

Uploaded by

Glossary terms from module 2

Terms and definitions from Course 3,

Data source: The location where data originates

Grouping: The process of aggregating individual observations of a variable into groups

JSON file: A data storage file that is saved in a JavaScript format

String: A sequence of characters and punctuation that contains textual information

Terms and definitions from previous

You might also like