0% found this document useful (0 votes)

1 views5 pages

Python Data Analysis Vocabulary List

Uploaded by

CD Monib

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1 views5 pages

Python Data Analysis Vocabulary List

Uploaded by

CD Monib

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Python & Data Analysis Vocabulary List

Python Vocabulary
Variable: A name that stores a value (e.g., x = 5)

Data Type: The kind of data (string, integer, float, boolean, etc.)

String: Text data inside quotes (e.g., 'Hello')

Integer (int): Whole numbers like 5, 100

Float: Decimal numbers like 3.14

Boolean: True or False values

List: A collection of items inside [] (e.g., [1, 2, 3])

Tuple: Like a list, but cannot change (immutable), inside ()

Dictionary (dict): Data stored as key-value pairs {key: value}

Set: A collection of unique items (no duplicates), inside {}

Operator: Symbols like +, -, *, / used in calculations

Conditional Statement: Code that makes decisions using if, elif, else

Loop: Code that repeats (e.g., for loop, while loop)

Function (def): Reusable block of code, defined using def

Argument / Parameter: Values passed into a function

Return: Gives back a result from a function

Indentation: Spaces or tabs used to structure Python code

Class: A blueprint for creating objects

Object: An instance of a class

Attribute: A variable inside a class/object

Method: A function inside a class

Module: A Python file containing functions/classes

Package: A collection of Python modules

Import: Bringing in external code using import

Library: A collection of ready-made code for specific tasks (e.g., Pandas, NumPy)

Exception: An error detected during program execution

Try-Except Block: Handling errors safely

Lambda Function: A small anonymous function

List Comprehension: A quick way to create lists with a loop in one line

Recursion: A function calling itself

Decorator: A function that adds extra features to another function

Iterable: Any object you can loop over (like list, tuple)

Iterator: An object that remembers its place during iteration

Generator: Functions that return values one at a time using yield

Data Analysis Vocabulary
DataFrame: A table of data (like Excel), main structure in Pandas

Series: A single column in Pandas

Index: Row labels in a DataFrame

CSV: Comma Separated Values file format

Excel File (.xlsx): Excel spreadsheet file format

JSON: JavaScript Object Notation, a readable data format

Data Cleaning: Fixing or removing bad data (e.g., null values, duplicates)

Missing Data (NaN): Empty or not available data

Duplicate: Repeated data entries

Merge: Combining two DataFrames based on common columns

Join: SQL-style combination of datasets

GroupBy: Grouping data and performing calculations (sum, mean, count)

Aggregation: Summarizing data (like total sales, average)

Pivot Table: A tool to summarize data by rows and columns

Reshape: Changing the structure of a DataFrame

Filter: Selecting data rows that meet certain conditions

Sort: Arranging data in order (ascending/descending)

Indexing: Accessing specific rows/columns in data

Slicing: Cutting a portion of data

Correlation: Measuring relationship between variables

Outlier: A data point that is very different from others

Skewness: Data that leans left or right

Kurtosis: Measure of whether data has heavy/light tails (extreme values)

Standard Deviation: Measure of how spread out numbers are

Variance: Square of standard deviation

Normalization: Scaling data to a standard range (0 to 1)

Standardization: Scaling data to have mean=0 and std=1

Histogram: Graph showing frequency distribution

Scatter Plot: Graph showing relationship between two variables

Box Plot: Graph showing data distribution with median, quartiles, and outliers

Bar Chart: Graph using bars to represent data values

Line Chart: Graph using lines to show trends over time

EDA: Exploratory Data Analysis - Analyzing data to find patterns

Feature: A column in data

Target Variable: The output you want to predict

Train/Test Split: Dividing data for model training and testing

Overfitting: When a model learns too much detail (bad for predictions)

Underfitting: When a model is too simple and doesn't learn enough

Model: A mathematical formula created to analyze or predict data

Machine Learning: Teaching computers to learn patterns from data

Algorithm: A set of steps to solve a problem (e.g., Linear Regression)

Common Python Data Libraries
Pandas: For data manipulation and analysis

NumPy: For numerical computations and arrays

Matplotlib: For data visualization (charts, plots)

Seaborn: For beautiful statistical plots

Scikit-learn: For machine learning models

Statsmodels: For statistical analysis

OpenPyXL: For working with Excel files

SQLAlchemy: For database connections

Overview
No ratings yet
Overview
1 page
Viva
No ratings yet
Viva
7 pages
Viva Answers
No ratings yet
Viva Answers
3 pages
L6 and 7-Data Preprocessing-Coding
No ratings yet
L6 and 7-Data Preprocessing-Coding
34 pages
Module 1.Foundations of Data Science
No ratings yet
Module 1.Foundations of Data Science
17 pages
Data Analysis Concepts Explanation
No ratings yet
Data Analysis Concepts Explanation
3 pages
Glossary Working With Data in Python
No ratings yet
Glossary Working With Data in Python
2 pages
Python
No ratings yet
Python
5 pages
Python For Statistics
No ratings yet
Python For Statistics
40 pages
GVPCOEW-Pandas and Numpy For Data Analysis - DONE
No ratings yet
GVPCOEW-Pandas and Numpy For Data Analysis - DONE
110 pages
Stats Unit1
No ratings yet
Stats Unit1
27 pages
DAL EXT 1 and 2
No ratings yet
DAL EXT 1 and 2
125 pages
Pandas What Can Pandas Do For You ?: Statsmodels SM Seaborn Sns
No ratings yet
Pandas What Can Pandas Do For You ?: Statsmodels SM Seaborn Sns
9 pages
Unit 2 1
No ratings yet
Unit 2 1
54 pages
01 Introduction To Python
No ratings yet
01 Introduction To Python
36 pages
Jenisha INTERNSHIP REPORT-2
No ratings yet
Jenisha INTERNSHIP REPORT-2
19 pages
Wa0005.
No ratings yet
Wa0005.
29 pages
DAL Oral Question Bank
No ratings yet
DAL Oral Question Bank
7 pages
Important Questions With Solutions IP
No ratings yet
Important Questions With Solutions IP
5 pages
Glossary - Working With Data in Python
No ratings yet
Glossary - Working With Data in Python
2 pages
Module 4 - Writing Functions in Python
No ratings yet
Module 4 - Writing Functions in Python
20 pages
Data Science 2
No ratings yet
Data Science 2
15 pages
Data Analytics With PowerBI
No ratings yet
Data Analytics With PowerBI
27 pages
Course - Introduction To Data Science (SD211105)
No ratings yet
Course - Introduction To Data Science (SD211105)
10 pages
Ch01 - Introduction To Data Science
No ratings yet
Ch01 - Introduction To Data Science
65 pages
Data Science
No ratings yet
Data Science
15 pages
Unit2 Modified
No ratings yet
Unit2 Modified
42 pages
01 Introduction To Python
No ratings yet
01 Introduction To Python
36 pages
DATA SCIENCE 6th Sem
No ratings yet
DATA SCIENCE 6th Sem
40 pages
Principles of Data Science WEB 3
No ratings yet
Principles of Data Science WEB 3
30 pages
Lecture 2 - Statistical Inference - EDA and DS Process - 02032023 111156am 1 - 1 27022024 012412pm
No ratings yet
Lecture 2 - Statistical Inference - EDA and DS Process - 02032023 111156am 1 - 1 27022024 012412pm
44 pages
Python For Data Analysis
No ratings yet
Python For Data Analysis
84 pages
Glosario m4
No ratings yet
Glosario m4
2 pages
Project Report
No ratings yet
Project Report
37 pages
Data Structure & Data Mining
No ratings yet
Data Structure & Data Mining
26 pages
Data Science Professional
No ratings yet
Data Science Professional
21 pages
Python 101: Understanding The Nuts and Bolts of Python
No ratings yet
Python 101: Understanding The Nuts and Bolts of Python
46 pages
Module 1 DAP
No ratings yet
Module 1 DAP
55 pages
Viva
No ratings yet
Viva
9 pages
2 Mark Key DS
No ratings yet
2 Mark Key DS
3 pages
Foundation of Data Science Previous Year Question Paper
No ratings yet
Foundation of Data Science Previous Year Question Paper
40 pages
Full Stack Data Science
No ratings yet
Full Stack Data Science
54 pages
Fds Csheet and Read The Rule
No ratings yet
Fds Csheet and Read The Rule
4 pages
Data Analytics Curriculum
No ratings yet
Data Analytics Curriculum
8 pages
Cheat Sheet
No ratings yet
Cheat Sheet
2 pages
Nac PDF
No ratings yet
Nac PDF
23 pages
Internship
No ratings yet
Internship
31 pages
Teks DATA SCIENCE Syllabus - QR
No ratings yet
Teks DATA SCIENCE Syllabus - QR
26 pages
MTE204 Data Python
No ratings yet
MTE204 Data Python
45 pages
DS Journal
No ratings yet
DS Journal
46 pages
Machine: Learning
No ratings yet
Machine: Learning
24 pages
Introduction To Python 1
No ratings yet
Introduction To Python 1
13 pages
Data Science & Machine Learning Using Python - CDR
No ratings yet
Data Science & Machine Learning Using Python - CDR
8 pages
2.1 - Introduction To Data Analytics
No ratings yet
2.1 - Introduction To Data Analytics
32 pages
ML With Python
No ratings yet
ML With Python
6 pages
IJERT Data Analysis Using Python
No ratings yet
IJERT Data Analysis Using Python
6 pages
Unit 1,2
No ratings yet
Unit 1,2
17 pages
Course 4 Week 6 Glossary - DA Terms and Definitions
No ratings yet
Course 4 Week 6 Glossary - DA Terms and Definitions
22 pages
Unit 4 - Working With Graphs - Python
No ratings yet
Unit 4 - Working With Graphs - Python
49 pages
DS Final
No ratings yet
DS Final
46 pages
DSP System Toolbox™ User's Guide
No ratings yet
DSP System Toolbox™ User's Guide
832 pages
Central Limit Theorem: Finding The Mean and Variance of The Sampling Distribution of Means
No ratings yet
Central Limit Theorem: Finding The Mean and Variance of The Sampling Distribution of Means
5 pages
Affine Processes and Applications in Finance: (With D. Duffie and W. Schachermayer)
No ratings yet
Affine Processes and Applications in Finance: (With D. Duffie and W. Schachermayer)
25 pages
Optimization Techniques Bca
No ratings yet
Optimization Techniques Bca
18 pages
Engineering Mathematics Test 5: Numerical Methods
No ratings yet
Engineering Mathematics Test 5: Numerical Methods
6 pages
Lukong Cornelius Fai - Feynman Path Integrals in Quantum Mechanics and Statistical Physics-CRC Press (2021)
100% (1)
Lukong Cornelius Fai - Feynman Path Integrals in Quantum Mechanics and Statistical Physics-CRC Press (2021)
415 pages
Chapter 5
No ratings yet
Chapter 5
44 pages
Chapter1 Introduction To AI
No ratings yet
Chapter1 Introduction To AI
40 pages
Greedy Algorithms
No ratings yet
Greedy Algorithms
12 pages
Deng (2018) (Estimation For The Spatial Autoregressive Threshold Model)
No ratings yet
Deng (2018) (Estimation For The Spatial Autoregressive Threshold Model)
4 pages
Lec7 PDF
No ratings yet
Lec7 PDF
76 pages
Wine Quality Analysis
No ratings yet
Wine Quality Analysis
27 pages
Assignment 5 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 5 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
5 pages
Session4 Easier Worksheet
No ratings yet
Session4 Easier Worksheet
6 pages
8MA0-01 As Pure (All To 2024)
No ratings yet
8MA0-01 As Pure (All To 2024)
496 pages
Man Eco 2
No ratings yet
Man Eco 2
4 pages
Regression Monograph DSBA Final
No ratings yet
Regression Monograph DSBA Final
38 pages
Real Time Object Detection Using YOLO
No ratings yet
Real Time Object Detection Using YOLO
6 pages
Aman 4TH SEM
No ratings yet
Aman 4TH SEM
2 pages
Assignment - 3 - Data Analytics
No ratings yet
Assignment - 3 - Data Analytics
25 pages
25 Most Frequent Ask DSA Questions in MAANG
No ratings yet
25 Most Frequent Ask DSA Questions in MAANG
16 pages
Maximum Flow: Some of These Slides Are Adapted From Introduction
No ratings yet
Maximum Flow: Some of These Slides Are Adapted From Introduction
56 pages
INTELLIPAAT
No ratings yet
INTELLIPAAT
13 pages
Math 18ma41 Ga
No ratings yet
Math 18ma41 Ga
16 pages
CN4212 Statistical Analysis For Construction Engineers Laboratory
No ratings yet
CN4212 Statistical Analysis For Construction Engineers Laboratory
19 pages
Tidsdiskret Pid Reg
No ratings yet
Tidsdiskret Pid Reg
5 pages
Cross Domain Sentiment Analysis
No ratings yet
Cross Domain Sentiment Analysis
17 pages
Stat 252-Practice Midterm-Solutions
No ratings yet
Stat 252-Practice Midterm-Solutions
10 pages
Design of Window Function in LABVIEW Environment
No ratings yet
Design of Window Function in LABVIEW Environment
5 pages

Python Data Analysis Vocabulary List

Uploaded by

Python Data Analysis Vocabulary List

Uploaded by

Python & Data Analysis Vocabulary List

String: Text data inside quotes (e.g., 'Hello')

Integer (int): Whole numbers like 5, 100

Float: Decimal numbers like 3.14

Boolean: True or False values

List: A collection of items inside [] (e.g., [1, 2, 3])

Tuple: Like a list, but cannot change (immutable), inside ()

Dictionary (dict): Data stored as key-value pairs {key: value}

Set: A collection of unique items (no duplicates), inside {}

Operator: Symbols like +, -, *, / used in calculations

Loop: Code that repeats (e.g., for loop, while loop)

Function (def): Reusable block of code, defined using def

Argument / Parameter: Values passed into a function

Return: Gives back a result from a function

Indentation: Spaces or tabs used to structure Python code

Class: A blueprint for creating objects

Object: An instance of a class

Attribute: A variable inside a class/object

Method: A function inside a class

Module: A Python file containing functions/classes

Package: A collection of Python modules

Import: Bringing in external code using import

Exception: An error detected during program execution

Lambda Function: A small anonymous function

Recursion: A function calling itself

Decorator: A function that adds extra features to another function

Iterator: An object that remembers its place during iteration

Generator: Functions that return values one at a time using yield

Series: A single column in Pandas

Index: Row labels in a DataFrame

CSV: Comma Separated Values file format

Excel File (.xlsx): Excel spreadsheet file format

JSON: JavaScript Object Notation, a readable data format

Missing Data (NaN): Empty or not available data

Duplicate: Repeated data entries

Merge: Combining two DataFrames based on common columns

Join: SQL-style combination of datasets

GroupBy: Grouping data and performing calculations (sum, mean, count)

Aggregation: Summarizing data (like total sales, average)

Pivot Table: A tool to summarize data by rows and columns

Reshape: Changing the structure of a DataFrame

Filter: Selecting data rows that meet certain conditions

Sort: Arranging data in order (ascending/descending)

Indexing: Accessing specific rows/columns in data

Slicing: Cutting a portion of data

Correlation: Measuring relationship between variables

Outlier: A data point that is very different from others

Skewness: Data that leans left or right

Kurtosis: Measure of whether data has heavy/light tails (extreme values)

Standard Deviation: Measure of how spread out numbers are

Variance: Square of standard deviation

Normalization: Scaling data to a standard range (0 to 1)

Standardization: Scaling data to have mean=0 and std=1

Histogram: Graph showing frequency distribution

Bar Chart: Graph using bars to represent data values

Line Chart: Graph using lines to show trends over time

EDA: Exploratory Data Analysis - Analyzing data to find patterns

Feature: A column in data

Target Variable: The output you want to predict

Train/Test Split: Dividing data for model training and testing

Underfitting: When a model is too simple and doesn't learn enough

Model: A mathematical formula created to analyze or predict data

Machine Learning: Teaching computers to learn patterns from data

Algorithm: A set of steps to solve a problem (e.g., Linear Regression)

NumPy: For numerical computations and arrays

Matplotlib: For data visualization (charts, plots)

Seaborn: For beautiful statistical plots

Scikit-learn: For machine learning models

Statsmodels: For statistical analysis

OpenPyXL: For working with Excel files

SQLAlchemy: For database connections

You might also like