0% found this document useful (0 votes)

13 views3 pages

Data Manipulation With Pandas - Python Data Science Handbook

good python book3

Uploaded by

nicholasdevera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views3 pages

Data Manipulation With Pandas - Python Data Science Handbook

good python book3

Uploaded by

nicholasdevera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

2/18/25, 6:43 PM Data Manipulation with Pandas | Python Data Science Handbook

This is an excerpt from the Python Data Science Handbook (http://shop.oreilly.com/product/0636920034919.do) by Jake
VanderPlas; Jupyter notebooks are available on GitHub (https://github.com/jakevdp/PythonDataScienceHandbook).

The text is released under the CC-BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/3.0/us/legalcode), and code
is released under the MIT license (https://opensource.org/licenses/MIT). If you find this content useful, please consider
supporting the work by buying the book (http://shop.oreilly.com/product/0636920034919.do)!

Data Manipulation with Pandas

< Structured Data: NumPy's Structured Arrays (02.09-structured-data-

numpy.html) | Contents (index.html) | Introducing Pandas Objects (03.01-
introducing-pandas-objects.html) >

Open in Colab

(https://colab.research.google.com/github/jakevdp/PythonDataScienceHandbook/blob/master/note
Introduction-to-Pandas.ipynb)

In the previous chapter, we dove into detail on NumPy and its ndarray object,
which provides efficient storage and manipulation of dense typed arrays in
Python. Here we'll build on this knowledge by looking in detail at the data
structures provided by the Pandas library. Pandas is a newer package built on
top of NumPy, and provides an efficient implementation of a DataFrame .
DataFrame s are essentially multidimensional arrays with attached row and
column labels, and often with heterogeneous types and/or missing data. As well
as offering a convenient storage interface for labeled data, Pandas implements
a number of powerful data operations familiar to users of both database
frameworks and spreadsheet programs.

As we saw, NumPy's ndarray data structure provides essential features for the
type of clean, well-organized data typically seen in numerical computing tasks.
While it serves this purpose very well, its limitations become clear when we
need more flexibility (e.g., attaching labels to data, working with missing data,
etc.) and when attempting operations that do not map well to element-wise
broadcasting (e.g., groupings, pivots, etc.), each of which is an important piece
of analyzing the less structured data available in many forms in the world
around us. Pandas, and in particular its Series and DataFrame objects, builds
on the NumPy array structure and provides efficient access to these sorts of
"data munging" tasks that occupy much of a data scientist's time.

In this chapter, we will focus on the mechanics of using Series , DataFrame ,

and related structures effectively. We will use examples drawn from real
datasets where appropriate, but these examples are not necessarily the focus.

https://jakevdp.github.io/PythonDataScienceHandbook/03.00-introduction-to-pandas.html 1/3
2/18/25, 6:43 PM Data Manipulation with Pandas | Python Data Science Handbook

# Installing and Using Pandas

Installation of Pandas on your system requires NumPy to be installed, and if
building the library from source, requires the appropriate tools to compile the C
and Cython sources on which Pandas is built. Details on this installation can be
found in the Pandas documentation (http://pandas.pydata.org/). If you
followed the advice outlined in the Preface (00.00-preface.html) and used the
Anaconda stack, you already have Pandas installed.

Once Pandas is installed, you can import it and check the version:

In [1]: import pandas

pandas.__version__

Out[1]: '0.18.1'

Just as we generally import NumPy under the alias np , we will import Pandas
under the alias pd :

In [2]: import pandas as pd

This import convention will be used throughout the remainder of this book.

# Reminder about Built-In

Documentation
As you read through this chapter, don't forget that IPython gives you the ability
to quickly explore the contents of a package (by using the tab-completion
feature) as well as the documentation of various functions (using the ?
character). (Refer back to Help and Documentation in IPython (01.01-help-and-
documentation.html) if you need a refresher on this.)

For example, to display all the contents of the pandas namespace, you can type

In [3]: pd.<TAB>

And to display Pandas's built-in documentation, you can use this:

In [4]: pd?

More detailed documentation, along with tutorials and other resources, can be
found at http://pandas.pydata.org/ (http://pandas.pydata.org/).

https://jakevdp.github.io/PythonDataScienceHandbook/03.00-introduction-to-pandas.html 2/3
2/18/25, 6:43 PM Data Manipulation with Pandas | Python Data Science Handbook

< Structured Data: NumPy's Structured Arrays (02.09-structured-data-

numpy.html) | Contents (index.html) | Introducing Pandas Objects (03.01-
introducing-pandas-objects.html) >

Open in Colab

(https://colab.research.google.com/github/jakevdp/PythonDataScienceHandbook/blob/master/note
Introduction-to-Pandas.ipynb)

https://jakevdp.github.io/PythonDataScienceHandbook/03.00-introduction-to-pandas.html 3/3

Introduction To Pandas
No ratings yet
Introduction To Pandas
2 pages
Python Data Science Handbook - Python Data Science Handbook
0% (5)
Python Data Science Handbook - Python Data Science Handbook
4 pages
Pandas Series - Notes For PA3
No ratings yet
Pandas Series - Notes For PA3
9 pages
Python Pandas Tutorial
96% (28)
Python Pandas Tutorial
178 pages
Ass1 DSBDA Writeup
No ratings yet
Ass1 DSBDA Writeup
8 pages
Python Data Science Packages Guide
No ratings yet
Python Data Science Packages Guide
11 pages
Lab Python Numpy Opencv
No ratings yet
Lab Python Numpy Opencv
45 pages
Day 10 Pandas For Data Science Part 1
No ratings yet
Day 10 Pandas For Data Science Part 1
38 pages
Manipulating and Analyzing Data With Pandas
No ratings yet
Manipulating and Analyzing Data With Pandas
50 pages
Pandas Guide for Data Science
No ratings yet
Pandas Guide for Data Science
42 pages
Python Data Science Handbook Python Data Science Handbook
0% (1)
Python Data Science Handbook Python Data Science Handbook
5 pages
Enrolled: (Self-Paced) Starts Jul 15, 2020
No ratings yet
Enrolled: (Self-Paced) Starts Jul 15, 2020
8 pages
Eda U2
No ratings yet
Eda U2
61 pages
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual Matt Harrison Instant Download
No ratings yet
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual Matt Harrison Instant Download
135 pages
UNIT II Material
No ratings yet
UNIT II Material
34 pages
Learning Pandas Library
100% (2)
Learning Pandas Library
271 pages
Practical Guide To Pandas For Data Science
100% (1)
Practical Guide To Pandas For Data Science
26 pages
Unit 4
No ratings yet
Unit 4
36 pages
NumPy & Pandas
No ratings yet
NumPy & Pandas
27 pages
Python Data Science Handbook - Python Data Science Handbook
No ratings yet
Python Data Science Handbook - Python Data Science Handbook
4 pages
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
100% (18)
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
208 pages
Module 2 Pandas 1
No ratings yet
Module 2 Pandas 1
79 pages
Week1 - Introduction To Machine Learning and Toolkit
No ratings yet
Week1 - Introduction To Machine Learning and Toolkit
102 pages
Pandas Assignment
No ratings yet
Pandas Assignment
12 pages
Resources ML
No ratings yet
Resources ML
2 pages
01 Data Handling Using Pandas I
No ratings yet
01 Data Handling Using Pandas I
19 pages
Python Data Science Handbook
No ratings yet
Python Data Science Handbook
7 pages
Python Pandas Tutorial
No ratings yet
Python Pandas Tutorial
6 pages
Unit III Part 2 1725700061785
No ratings yet
Unit III Part 2 1725700061785
85 pages
Pandas Introduction
No ratings yet
Pandas Introduction
4 pages
Python Pandas Tutorial PDF
100% (1)
Python Pandas Tutorial PDF
13 pages
Python Pandas Tutorial For Beginners
No ratings yet
Python Pandas Tutorial For Beginners
203 pages
Data Manipulation With Pandas and NumPy - Lect 3
No ratings yet
Data Manipulation With Pandas and NumPy - Lect 3
20 pages
MSBA315 Intro To Python For ML
No ratings yet
MSBA315 Intro To Python For ML
3 pages
Practical - 3 (Ai)
No ratings yet
Practical - 3 (Ai)
12 pages
101 - Introducing DataFrames - Python
No ratings yet
101 - Introducing DataFrames - Python
2 pages
Python Data Analysis with Pandas
No ratings yet
Python Data Analysis with Pandas
30 pages
Report
No ratings yet
Report
18 pages
Module 4
No ratings yet
Module 4
57 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
63 pages
Wa0005.
No ratings yet
Wa0005.
29 pages
01 Introduction To Python
No ratings yet
01 Introduction To Python
36 pages
Unit III - Notes
No ratings yet
Unit III - Notes
12 pages
Python Pandas Beginner's Guide
No ratings yet
Python Pandas Beginner's Guide
45 pages
RAW Data
No ratings yet
RAW Data
22 pages
Python Libraries for Data Science
No ratings yet
Python Libraries for Data Science
96 pages
Python-Numpy & Pandas
No ratings yet
Python-Numpy & Pandas
78 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
PPS - Unit 5 (Imp Topics)
No ratings yet
PPS - Unit 5 (Imp Topics)
7 pages
Pandas Learndatasci
No ratings yet
Pandas Learndatasci
86 pages
Practical 7
No ratings yet
Practical 7
8 pages
Python Pandas
100% (1)
Python Pandas
96 pages
DV Lab2 Updated
No ratings yet
DV Lab2 Updated
12 pages
Attachment 3 Python For Data Analysis Lyst9850
No ratings yet
Attachment 3 Python For Data Analysis Lyst9850
31 pages
DSLab2020 - Week 1 Exercises
No ratings yet
DSLab2020 - Week 1 Exercises
30 pages
Python Pandas - I
No ratings yet
Python Pandas - I
32 pages
FDS Exp4
No ratings yet
FDS Exp4
5 pages
CHED Memorandum Order CMO Guidelines For Student Internship Abroad Program SIAP PDF
100% (2)
CHED Memorandum Order CMO Guidelines For Student Internship Abroad Program SIAP PDF
9 pages
Motivation Master Public Health
No ratings yet
Motivation Master Public Health
1 page
Activity: Let's Brainstorm: Fo Un Da Tio Na L Co Ur Se in en TR Ep Re Ne Ur
100% (1)
Activity: Let's Brainstorm: Fo Un Da Tio Na L Co Ur Se in en TR Ep Re Ne Ur
1 page
Madhuri Resume
No ratings yet
Madhuri Resume
2 pages
Revised ASSESSMENT 2 SYLLABUS 2023 2024 1
No ratings yet
Revised ASSESSMENT 2 SYLLABUS 2023 2024 1
21 pages
EAPP Q1 W5 Mod5 Outlining Technique
No ratings yet
EAPP Q1 W5 Mod5 Outlining Technique
14 pages
Vignette
No ratings yet
Vignette
3 pages
Match Analysis Manchester City Vs Bayern Munich - Bayern
No ratings yet
Match Analysis Manchester City Vs Bayern Munich - Bayern
5 pages
Notary Renewal Petition
No ratings yet
Notary Renewal Petition
3 pages
Lesson - 20-Cri-170
No ratings yet
Lesson - 20-Cri-170
20 pages
Word of Life Gopher Buddies Preschool Ministry Program Overview
No ratings yet
Word of Life Gopher Buddies Preschool Ministry Program Overview
12 pages
Detailed Lesson Plan in Math V I. Objectives: ST ST
100% (1)
Detailed Lesson Plan in Math V I. Objectives: ST ST
6 pages
Psy 210.full Notes
100% (1)
Psy 210.full Notes
56 pages
Customer Service ESL Worksheet
50% (2)
Customer Service ESL Worksheet
4 pages
Ch-5 Therapeutic Approaches - PPT 4
No ratings yet
Ch-5 Therapeutic Approaches - PPT 4
7 pages
Scoring Keys For Form F12
No ratings yet
Scoring Keys For Form F12
3 pages
Ucc Programmes and Cutoff Points
100% (1)
Ucc Programmes and Cutoff Points
3 pages
Bloom Gardner Systems of Equations-Kristans
No ratings yet
Bloom Gardner Systems of Equations-Kristans
2 pages
07 r05310304 Kinematics of Machinery
No ratings yet
07 r05310304 Kinematics of Machinery
9 pages
Esp8 Q4-Mod.21
100% (1)
Esp8 Q4-Mod.21
49 pages
8b - Final Reflection Paper Internship I
No ratings yet
8b - Final Reflection Paper Internship I
5 pages
Engleza Cls A 9 A B Bar2023
No ratings yet
Engleza Cls A 9 A B Bar2023
3 pages
Local Self Government
No ratings yet
Local Self Government
11 pages
Super Secret Sauce - March 2025
No ratings yet
Super Secret Sauce - March 2025
5 pages
Teen Motherhood Challenges
0% (1)
Teen Motherhood Challenges
3 pages
Sample Reflection Paper Thesis
100% (3)
Sample Reflection Paper Thesis
7 pages
Management Principles Course
No ratings yet
Management Principles Course
1 page
Mohamed Abdelgaber Mostafa Alashkar
No ratings yet
Mohamed Abdelgaber Mostafa Alashkar
2 pages
b17749785 PDF
No ratings yet
b17749785 PDF
144 pages

Data Manipulation With Pandas - Python Data Science Handbook

Uploaded by

Data Manipulation With Pandas - Python Data Science Handbook

Uploaded by

2/18/25, 6:43 PM Data Manipulation with Pandas | Python Data Science Handbook

Data Manipulation with Pandas

< Structured Data: NumPy's Structured Arrays (02.09-structured-data-

In this chapter, we will focus on the mechanics of using Series , DataFrame ,

# Installing and Using Pandas

In [1]: import pandas

In [2]: import pandas as pd

# Reminder about Built-In

And to display Pandas's built-in documentation, you can use this:

< Structured Data: NumPy's Structured Arrays (02.09-structured-data-

You might also like