0% found this document useful (0 votes)

15 views64 pages

Class Xii Information Practices PPT On Data Handling Using Pandas-I

The document outlines a blueprint for a course on data handling using Pandas, SQL database querying, computer networks, and societal impacts, totaling 100 marks. It provides detailed explanations of Python modules, libraries, and data structures, particularly focusing on Pandas Series and DataFrames, their creation, properties, and operations. Additionally, it covers data import/export methods and differences between Series and DataFrames.

Uploaded by

Avni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views64 pages

Class Xii Information Practices PPT On Data Handling Using Pandas-I

Uploaded by

Avni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 64

Blue Print:

Unit Unit Name Marks

1 Data Handling using Pandas and Data 30

Visualization

2 Database Query using SQL 25

3 Introduction to Computer Networks 7

4 Societal Impacts 8

Practical 30

Total 100
Unit 1
Data Handling using Pandas and Data
Visualization

(Data Handling using Pandas –I)

Module: Module is a file which contains python functions.
It is
.py file which has python executable code or statements.
Package: Package is namespace which contains
multiple packages or modules. It is a directory which
contains a special file init .py.
init .py file denotes Python the file that contains
init .py as package.
Library: It is collection of various packages. There is no
difference between package and python library
conceptually.

Framework: It is a collection of various libraries which

architects the code flow.
Pandas:
Pandas is the most popular open source python library
used for data analysis.
We can analyze the data in pandas in two ways-
● Series
● Da taframes
nstallation of pandas:

pip install pandas

Series:
Series is 1-Dimensional array defined in python pandas
to store any data type.

Syntax:

<Series Name>=<pd>.Series(<list name>, ...)

Example:
5 15 16 4 34

Properties of Series:
• Series will contain homogeneous data type.
• Size of the series immutable
• Values in the series are mutable.
Creation of Series:
We can create a pandas series in following ways-

● From arrays
● From Lists
● From Dictionaries
● From scalar value
From Lists :

Output:
From arrays :

Output:
From Dictionary:

Output:
From Scalar Value:

Output:
Mathematical Operations on Series:
Mathematical Operations on Series (cont…):

Output:
Head and Tail functions on Series:
head and tail functions returns first and last n rows
respectively. Syntax:
<Series name>.head(n)
<Series
name>.tail(n) n-number
of rows
Default value of n is 5
Selection, Indexing and Slicing on Series:
Selection: We can select a value from the series by using
its corresponding index.
Syntax:
<Series name>[<index number>]

Output:
Indexing:
Series.index attribute is used to get or set the index
labels for the given series.

Syntax:
<Series name>.index
Indexing (cont...):

Output:
Slicing:
Slicing operation on the series split the series based
on the given parameters.
Syntax:
<Series
name>[<start>:<stop>:<step>]
Note: start,stop,step are optional
Default values: start=0, stop=n-1,
step=1 Note: slicing will take default
index
Data Frames
Data Frames:
Data Frames is a two-dimensional(2-D) data structure
defined in pandas which consist of rows and columns.
Data Frames stores an ordered collection of columns that
can store data of different types.

Example:
S.No. Name Age Marks

1 Ravi 25 99

2 Kunal 26 98
Characteristics of Data Frames:
➢ It has two indices (two axes)
○ Row index (axis=0) ->known as index
○ Column index (axis=1) ->known as column-name
➢ Value in the Data Frame will be identifiable
by the combination of row index and
column index.
➢ Indices can be of any type
➢ Column can have data of different types.
➢ Value is mutable
➢ Size is mutable
Creation of Data Frames:
Syntax:
<Data Frame Name>=
pandas.DataFrame(
<2D data structure>,
<columns=<column sequence>,
<index=<index sequence>,.....)
We can create Data Frame in many ways, such as-
(i) Two dimensional dictionaries
(ii)Two dimensional ndarrays(NumPy arrays)
(iii) Series type object
(iv)Another Dataframe object
(v)Text/CSV files
Creating Data frame from List:

Output:
Creating Data frame from array:

Output:
Creating Data frame from Series:

Output:
Creating Data frame from another Data frame:

Output:
(i) Two dimensional dictionaries
We can create Dataframe from Two dimensional
dictionaries-

➢ Creating Dataframe from list of dictionaries

➢ Creating Dataframe from dictionary of Series

Creating Dataframe from list of dictionaries:

Output:
Creating Data frame from dictionary of Series:

Output:
(v) Text/CSV files:
We can Create Dataframe from Text/CSV Files by
using read_csv() function.
Syntax:
<data frame name>
=pandas.read_csv(filepath_or_buffer, sep=',',
delimiter=None, header='infer', names=None,
index_col=None, usecols=None, …)
(v) Text/CSV files (cont..):

Output:
Accessing values in dataframe:
Accessing a particular value:
<Data frame name>[<column name>][<index>]

Accessing a group of values:

<Data frame name>.loc[<index>],[<column name>]
Accessing values in dataframe (cont…):

Output:
NaN variable in Python:
NaN , standing for not a number, is a numeric data type
used to represent any value that is undefined or
unpresentable. For example, 0/0 is undefined as a real
number and is, therefore, represented by NaN.
Iteration on Dataframes:

In Pandas Dataframe we can iterate an element in two

ways:

● Iterating over rows

● Iterating over columns
Iterating over rows :

To iterate over the rows of the DataFrame, we can use

the following functions −
● iterrows() − iterate over the rows as (index,series)
pairs
● iteritems() − to iterate over the (key,value) pairs
● itertuples() − iterate over the rows as namedtuples
iterrows():

Output:
iteritems():

Output:
itertuples():

Output:
Iterating over Columns :In order to iterate over columns,
we need to create a list of dataframe columns and then
iterating through that list to pull out the data frame
columns.
Operations on rows and columns:

● Add
● Select
● Delete
● Rename
Column selection:

Output:
Column addition:

Output:
Column Deletion:

Output:
Column Rename:

Output:
Row selection:

Output:
Row Addition:

Output:
Row Deletion:

Output:
Row Rename:

Output:
Head and Tail functions in Data Frames:

head(n):
Returns the first n rows.
tail(n):
Returns last n rows.
Default value for n is 5
Indexing using Labels in Data Frames: We can make one
of the columns as row index label for the data frame by
using the function set_index().

Output:
Boolean indexing in Data Frames: Boolean indexing helps
us to select the data from the Data Frames using a
boolean vector.
Joining, Merging and Concatenation on Data Frames:
Merge:
pandas.merge() method is used for merging two data
frames. It will have three arguments.
● Data frame names
● how - how will take any of the three values i.e.,
left,right or inner
● on - on the common column name
Merge (cont..):
Join:The join method uses the index of the
dataframes. Use <dataframe 1>.join(<dataframe
2>) to join
Concatenation:Concatenate uses pandas.concat(<List of
data frames>).
Importing/Exporting Data between CSV files and Data
Frames:
Import data from CSV file to Data Frame:We can import
data from CSV File to Data Frame by using read_csv()
function.

Output:
Export data from Data Frame to CSV File:We can export
data from Data Frame to CSV File by using to_csv()
function.
Syntax:
<data frame name>.to_csv(<File
Python module- A python module is a python script file(.py file) containing variables, python classes, functions,
statements etc.
Python Library/package- A Python library is a collection of modules that together cater to a specific type of need or
application. The advantage of using libraries is that we can directly use functions/methods for performing specific
type of application instead of rewriting the code for that particular use. They are used by using the import command
as-
import libraryname
at the top of the python code/script file.
Some examples of Python Libraries-
1. Python standard library-It is a collection of library which is normally distributed along with Python installation.
Some of them are-
a. math module- provides mathematical functions
b. random module- provides functions for generating pseudo-random numbers.
c. statistics module- provides statistical functions
2. Numpy (Numerical Python) library- It provides functions for working with large multi-dimensional arrays(ndarrays)
and matrices. NumPy provides a large set of mathematical functions that can operate quickly on the entries of the
ndarray without the need of loops.
3. Pandas (PANel + DAta) library- Pandas is a fast, powerful, flexible and easy to use open source data analysis and
manipulation tool. Pandas is built on top of NumPy, relying on ndarray and its fast and efficient array based
mathematical functions.
4. Matplotlib library- It provides functions for plotting and drawing graphs.

Data Structure- Data structure is the arrangement of data in such a way that permits efficient access and
modification.
Pandas Data Structures- Pandas offers the following data structures-
a) Series - 1D array
b) DataFrame - 2D array
c) Panel - 3D array (not in syllabus)
Series- Series is a one-dimensional array with homogeneous data.
Key features of Series-
• A Series has only one dimension, i.e. one axis 1D Data values
• Each element of the Series can be associated with an index/label that can be used to access the data value. By
default the index starts with 0,1,2,3… but it can be set to any other data type also.
• Series is data mutable i.e. the data values can be changed in-place in memory
• Series is size immutable i.e. once a series object is created in memory with a fixed number of elements, then the
number of elements cannot be changed in place. Although the series object can be assigned a different set of values
it will refer to a different location in memory. • All the elements of the Series are homogenous data i.e. their data
type is the same.

Differences between Series and DataFrame

Structure

The most obvious difference between a Series and a DataFrame is their structure. A Series is a one-dimensional
object, while a DataFrame is two-dimensional. This means that a Series has only one index, while a DataFrame has
both row and column indexes.

Dimensions

Another key difference between Series and DataFrame is their dimensions. A Series has only one dimension, while a
DataFrame has two. This means that a Series has only one axis, while a DataFrame has both row and column axes.
Data Types

While both Series and DataFrame can hold any data type, they have some differences in how they handle data
types. A Series can hold only one data type at a time, while a DataFrame can hold multiple data types in different
columns. This means that a DataFrame can be thought of as a collection of Series, where each column is a Series.

Operations

Series and DataFrame also have some differences in the types of operations that can be performed on them. For
example, arithmetic operations can be performed directly on a Series, but not on a DataFrame. To perform
arithmetic operations on a DataFrame, you need to specify the columns or rows that you want to operate on.

Technical Brief Stats Concepts 19c
No ratings yet
Technical Brief Stats Concepts 19c
27 pages
Pandas Basics
No ratings yet
Pandas Basics
84 pages
VP-2025JV0P10083-000-O94-001 - 1 - (Installation Manuals)
No ratings yet
VP-2025JV0P10083-000-O94-001 - 1 - (Installation Manuals)
32 pages
Odata Interview Question
20% (5)
Odata Interview Question
4 pages
On Data Handling Using Pandas-I
100% (2)
On Data Handling Using Pandas-I
63 pages
1 Data Handling Using Pandas 1
No ratings yet
1 Data Handling Using Pandas 1
63 pages
On Data Handling Using Pandas-I
100% (2)
On Data Handling Using Pandas-I
64 pages
Python Data Frame New
No ratings yet
Python Data Frame New
32 pages
Class XII IP Key Points (Python Pandas)
No ratings yet
Class XII IP Key Points (Python Pandas)
5 pages
Unit 4
No ratings yet
Unit 4
36 pages
The Pandas Library
No ratings yet
The Pandas Library
39 pages
Pandas Dataframe Export The CSV File
No ratings yet
Pandas Dataframe Export The CSV File
9 pages
IP 12th Chapter 3
No ratings yet
IP 12th Chapter 3
9 pages
Data Handling Using Pandas-I-ORG
No ratings yet
Data Handling Using Pandas-I-ORG
44 pages
04-Data Manipulation With Pandas
No ratings yet
04-Data Manipulation With Pandas
28 pages
Lab 9
No ratings yet
Lab 9
9 pages
Pandas
No ratings yet
Pandas
29 pages
Class XII Data Handlinng Using PandasI
No ratings yet
Class XII Data Handlinng Using PandasI
46 pages
1 Data Handlinng Using Pandas-I
No ratings yet
1 Data Handlinng Using Pandas-I
46 pages
Pandas Class 12 Ncertttt
No ratings yet
Pandas Class 12 Ncertttt
48 pages
DataFrame Ac Win Final
No ratings yet
DataFrame Ac Win Final
30 pages
Pandas
No ratings yet
Pandas
13 pages
Data Handing Using Pandas-I
100% (2)
Data Handing Using Pandas-I
46 pages
Data Frames
No ratings yet
Data Frames
60 pages
Data Handlinng Using Pandas
No ratings yet
Data Handlinng Using Pandas
46 pages
18 Pandas
No ratings yet
18 Pandas
33 pages
Data Handlinng Using Pandas-I
No ratings yet
Data Handlinng Using Pandas-I
46 pages
Data Handling Using Pandas-1
No ratings yet
Data Handling Using Pandas-1
60 pages
Pandas
No ratings yet
Pandas
25 pages
Python 3rd Unit Question and Answer
No ratings yet
Python 3rd Unit Question and Answer
25 pages
Introduction To Pandas For Data Analysis
No ratings yet
Introduction To Pandas For Data Analysis
6 pages
Loki Temp PPT Pandas 2
No ratings yet
Loki Temp PPT Pandas 2
31 pages
Exp1 - Manipulating Datasets Using Pandas
No ratings yet
Exp1 - Manipulating Datasets Using Pandas
15 pages
Pandas
No ratings yet
Pandas
7 pages
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
No ratings yet
Pandas-Creating Series & Dataframes (DR V Gowri, Srmist)
47 pages
Python Pandas ch-2
No ratings yet
Python Pandas ch-2
56 pages
Week 4.1
No ratings yet
Week 4.1
16 pages
Panda
No ratings yet
Panda
46 pages
All Document Reader 1715619870900
No ratings yet
All Document Reader 1715619870900
6 pages
FDS Module 2 Notes
No ratings yet
FDS Module 2 Notes
24 pages
What Is Pandas
No ratings yet
What Is Pandas
9 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Cheat Sheet
No ratings yet
Cheat Sheet
10 pages
IP XII Ch2 Data Handling (DataFrame) H
No ratings yet
IP XII Ch2 Data Handling (DataFrame) H
9 pages
Chapter 1 - Part 2 - DataFrame
No ratings yet
Chapter 1 - Part 2 - DataFrame
48 pages
Python Pandas New Sylabus
No ratings yet
Python Pandas New Sylabus
53 pages
Unit 4.2
No ratings yet
Unit 4.2
24 pages
Data Series
No ratings yet
Data Series
3 pages
UNIT - 3 Pandas
No ratings yet
UNIT - 3 Pandas
21 pages
04 Introduction To Python-1
No ratings yet
04 Introduction To Python-1
29 pages
Pandas
No ratings yet
Pandas
12 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
10 pages
Pandas DataFrameObject
No ratings yet
Pandas DataFrameObject
4 pages
Pandas DataFrame
No ratings yet
Pandas DataFrame
70 pages
Pandas Python
No ratings yet
Pandas Python
11 pages
Pandas
No ratings yet
Pandas
5 pages
Pandas DataFrame Notes
67% (3)
Pandas DataFrame Notes
13 pages
UNIT II Notes
No ratings yet
UNIT II Notes
23 pages
Dataframe Ip
No ratings yet
Dataframe Ip
75 pages
DAP 3 Module
No ratings yet
DAP 3 Module
62 pages
Python Pandas Demo PDF
100% (2)
Python Pandas Demo PDF
23 pages
Algorithms and Data Structures: An Easy Guide to Programming Skills
From Everand
Algorithms and Data Structures: An Easy Guide to Programming Skills
Rigdon Jonathan
No ratings yet
Ian Talks Python A-Z
From Everand
Ian Talks Python A-Z
Ian Eress
No ratings yet
Daryl Kim Tech Resume
No ratings yet
Daryl Kim Tech Resume
2 pages
Qetero Service Booking Platform
No ratings yet
Qetero Service Booking Platform
23 pages
Certificacion 3M ISO9001
100% (1)
Certificacion 3M ISO9001
2 pages
SCMG
No ratings yet
SCMG
2 pages
Unit 4 - Cloud Programming Models
100% (2)
Unit 4 - Cloud Programming Models
21 pages
Student Researchers Guide New Template 1 Qualitative
No ratings yet
Student Researchers Guide New Template 1 Qualitative
3 pages
Learn Dutch On The Web Recommendations
No ratings yet
Learn Dutch On The Web Recommendations
3 pages
COSEC Reports Time Attendance
No ratings yet
COSEC Reports Time Attendance
74 pages
Admit Card: Important Points
No ratings yet
Admit Card: Important Points
1 page
Datasheet FSR
No ratings yet
Datasheet FSR
10 pages
MILLIPEDE Concept
No ratings yet
MILLIPEDE Concept
23 pages
The Evolution of Traditional To New Media
No ratings yet
The Evolution of Traditional To New Media
3 pages
Series: Small To Medium Displacement Vane Pump
No ratings yet
Series: Small To Medium Displacement Vane Pump
2 pages
Basic Exception Handling
No ratings yet
Basic Exception Handling
7 pages
Process Analysis and Simulation in Chemical Engineering
No ratings yet
Process Analysis and Simulation in Chemical Engineering
5 pages
Database Programming With SQL 16-1: Working With Sequences Practice Activities
No ratings yet
Database Programming With SQL 16-1: Working With Sequences Practice Activities
3 pages
5 Mva GTP For Export Job
No ratings yet
5 Mva GTP For Export Job
3 pages
Conf Ospf
No ratings yet
Conf Ospf
3 pages
WinterTech Inventions Volume 1
No ratings yet
WinterTech Inventions Volume 1
14 pages
Class 12 Competency Based Question - Computer Science Chap 8 (2024-25)
No ratings yet
Class 12 Competency Based Question - Computer Science Chap 8 (2024-25)
25 pages
Jenkins End To End
No ratings yet
Jenkins End To End
6 pages
BNYS Prospectus 2020 21
No ratings yet
BNYS Prospectus 2020 21
33 pages
MARK 301 Articles Summary
No ratings yet
MARK 301 Articles Summary
21 pages
III Term Paper EM
No ratings yet
III Term Paper EM
5 pages
Tech 301 To 400
No ratings yet
Tech 301 To 400
4 pages
DIP Lab-4
No ratings yet
DIP Lab-4
5 pages
Introduction To Cellular Mobile Radio Systems
No ratings yet
Introduction To Cellular Mobile Radio Systems
83 pages

Class Xii Information Practices PPT On Data Handling Using Pandas-I

Uploaded by

Class Xii Information Practices PPT On Data Handling Using Pandas-I

Uploaded by

Blue Print:

Unit Unit Name Marks

1 Data Handling using Pandas and Data 30

2 Database Query using SQL 25

3 Introduction to Computer Networks 7

(Data Handling using Pandas –I)

Framework: It is a collection of various libraries which

pip install pandas

<Series Name>=<pd>.Series(<list name>, ...)

➢ Creating Dataframe from list of dictionaries

➢ Creating Dataframe from dictionary of Series

Accessing a group of values:

In Pandas Dataframe we can iterate an element in two

● Iterating over rows

To iterate over the rows of the DataFrame, we can use

Differences between Series and DataFrame

You might also like