0% found this document useful (0 votes)

53 views

Read and Write CSV and XLS Files

The document discusses various methods for reading, writing, and manipulating dataframes using Pandas in Python. It shows how to read and write CSV and Excel files, group and aggregate data by columns, concatenate and merge multiple dataframes, and use numerical indexing with .loc and .iloc to select subsets of data.

Uploaded by

Sagar Khode

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

Read and Write CSV and XLS Files

Uploaded by

Sagar Khode

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Read and write CSV and XLS files

In [1]:

import pandas as pd
df = pd.read_csv('weather_data.csv')
df

Out[1]:

day temperature windspeed event

0 1/1/2017 32 6 Rain

1 1/2/2017 35 7 Sunny

2 1/3/2017 28 2 Snow

3 1/4/2017 24 7 Snow

4 1/5/2017 32 4 Rain

5 1/6/2017 31 2 Sunny

In [0]:
#INSTALL: pip3 install xlrd

#read excel file

df = pd.read_excel('weather_data.xlsx')
df

In [0]:
#write DF to csv
df.to_csv('new.csv')
df.to_csv('new_noIndex.csv', index=False)

In [0]:
# INSTALL: pip3 install openpyxl

#write DF to Excel
df.to_excel('new.xlsx', sheet_name='weather_data')

GROUP-BY
In [0]:

import pandas as pd
df = pd.read_csv('weather_data_cities.csv')
df #weather by cities

Out[0]:

day city temperature windspeed event

0 1/1/2017 new york 32 6 Rain

1 1/2/2017 new york 36 7 Sunny

2 1/3/2017 new york 28 12 Snow

3 1/4/2017 new york 33 7 Sunny

4 1/1/2017 mumbai 90 5 Sunny

5 1/2/2017 mumbai 85 12 Fog

6 1/3/2017 mumbai 87 15 Fog

day city temperature windspeed event
7 1/4/2017 mumbai 92 5 Rain

8 1/1/2017 paris 45 20 Sunny

9 1/2/2017 paris 50 13 Cloudy

10 1/3/2017 paris 54 8 Cloudy

11 1/4/2017 paris 42 10 Cloudy

In [0]:

g = df.groupby('city')
g

Out[0]:

<pandas.core.groupby.DataFrameGroupBy object at 0x106d495f8>

In [0]:
for city, city_df in g:
print(city)
print(city_df)

mumbai
day city temperature windspeed event
4 1/1/2017 mumbai 90 5 Sunny
5 1/2/2017 mumbai 85 12 Fog
6 1/3/2017 mumbai 87 15 Fog
7 1/4/2017 mumbai 92 5 Rain
new york
day city temperature windspeed event
0 1/1/2017 new york 32 6 Rain
1 1/2/2017 new york 36 7 Sunny
2 1/3/2017 new york 28 12 Snow
3 1/4/2017 new york 33 7 Sunny
paris
day city temperature windspeed event
8 1/1/2017 paris 45 20 Sunny
9 1/2/2017 paris 50 13 Cloudy
10 1/3/2017 paris 54 8 Cloudy
11 1/4/2017 paris 42 10 Cloudy

In [0]:
#or to get specific group
g.get_group('new york')

Out[0]:

day city temperature windspeed event

0 1/1/2017 new york 32 6 Rain

1 1/2/2017 new york 36 7 Sunny

2 1/3/2017 new york 28 12 Snow

3 1/4/2017 new york 33 7 Sunny

In [0]:
#Find maximum temperature in each of the cities
print(g.max())

day temperature windspeed event

city
mumbai 1/4/2017 92 15 Sunny
new york 1/4/2017 36 12 Sunny
paris 1/4/2017 54 20 Sunny
In [0]:

print(g.mean())

temperature windspeed
city
mumbai 88.50 9.25
new york 32.25 8.00
paris 47.75 12.75

In [0]:
print(g.describe())

temperature \
count mean std min 25% 50% 75% max
city
mumbai 4.0 88.50 3.109126 85.0 86.50 88.5 90.50 92.0
new york 4.0 32.25 3.304038 28.0 31.00 32.5 33.75 36.0
paris 4.0 47.75 5.315073 42.0 44.25 47.5 51.00 54.0

windspeed
count mean std min 25% 50% 75% max
city
mumbai 4.0 9.25 5.057997 5.0 5.00 8.5 12.75 15.0
new york 4.0 8.00 2.708013 6.0 6.75 7.0 8.25 12.0
paris 4.0 12.75 5.251984 8.0 9.50 11.5 14.75 20.0

concatenate Data Frames

In [0]:
import pandas as pd
india_weather = pd.DataFrame({
"city": ["mumbai","delhi","banglore"],
"temperature": [32,45,30],
"humidity": [80, 60, 78]
})

india_weather

Out[0]:

city humidity temperature

0 mumbai 80 32

1 delhi 60 45

2 banglore 78 30

In [0]:

us_weather = pd.DataFrame({
"city": ["new york","chicago","orlando"],
"temperature": [21,14,35],
"humidity": [68, 65, 75]
})
us_weather

Out[0]:

city humidity temperature

0 new york 68 21

1 chicago 65 14

2 orlando 75 35
In [0]:

#concate two dataframes

df = pd.concat([india_weather, us_weather])
df

Out[0]:

city humidity temperature

0 mumbai 80 32

1 delhi 60 45

2 banglore 78 30

0 new york 68 21

1 chicago 65 14

2 orlando 75 35

In [0]:
#if you want continuous index
df = pd.concat([india_weather, us_weather], ignore_index=True)
df

Out[0]:

city humidity temperature

0 mumbai 80 32

1 delhi 60 45

2 banglore 78 30

3 new york 68 21

4 chicago 65 14

5 orlando 75 35

In [0]:
df = pd.concat([india_weather, us_weather],axis=1)
df

Out[0]:

city humidity temperature city humidity temperature

0 mumbai 80 32 new york 68 21

1 delhi 60 45 chicago 65 14

2 banglore 78 30 orlando 75 35

Merge DataFrames
In [0]:
temperature_df = pd.DataFrame({
"city": ["mumbai","delhi","banglore", 'hyderabad'],
"temperature": [32,45,30,40]})
temperature_df

Out[0]:

city temperature

0 mumbai 32

1 delhi 45
1 delhi 45
city temperature
2 banglore 30

3 hyderabad 40

In [0]:

humidity_df = pd.DataFrame({
"city": ["delhi","mumbai","banglore"],
"humidity": [68, 65, 75]})
humidity_df

Out[0]:

city humidity

0 delhi 68

1 mumbai 65

2 banglore 75

In [0]:

#merge two dataframes with out explicitly mention index

df = pd.merge(temperature_df, humidity_df, on='city')
df

Out[0]:

city temperature humidity

0 mumbai 32 65

1 delhi 45 68

2 banglore 30 75

In [0]:

#OUTER-JOIN
df = pd.merge(temperature_df, humidity_df, on='city', how='outer')
df

Out[0]:

city temperature humidity

0 mumbai 32 65.0

1 delhi 45 68.0

2 banglore 30 75.0

3 hyderabad 40 NaN

Numerical Indexing (.loc vs iloc)

In [0]:
import pandas as pd
import numpy as np

In [0]:
df = pd.DataFrame([1,2,3,4,5,6,7,8,9,19], index=[49,48,47,46,45, 1, 2, 3, 4, 5])
df

Out[0]:
0

49 1

48 2

47 3

46 4

45 5

1 6

2 7

3 8

4 9

5 19

In [0]:

s.loc[:2]

---------------------------------------------------------------------------
NameError Traceback (most recent call last)
<ipython-input-3-a7a6a418f874> in <module>()
----> 1 s.loc[:2]

NameError: name 's' is not defined

In [0]:

s.iloc[:2]

Out[0]:

49 1
48 2
dtype: int64

In [0]:
s.loc[45]

Out[0]:
5

In [0]:

s.iloc[45]

---------------------------------------------------------------------------
IndexError Traceback (most recent call last)
<ipython-input-20-a6772688a529> in <module>()
----> 1 s.iloc[45]

/Library/Frameworks/Python.framework/Versions/3.6/lib/python3.6/site-
packages/pandas/core/indexing.py in __getitem__(self, key)
1326 else:
1327 key = com._apply_if_callable(key, self.obj)
-> 1328 return self._getitem_axis(key, axis=0)
1329
1330 def _is_scalar_access(self, key):

/Library/Frameworks/Python.framework/Versions/3.6/lib/python3.6/site-
packages/pandas/core/indexing.py in _getitem_axis(self, key, axis)
1747
1748 # validate the location
-> 1749 self._is_valid_integer(key, axis)
1750
1751 return self._get_loc(key, axis=axis)
/Library/Frameworks/Python.framework/Versions/3.6/lib/python3.6/site-
packages/pandas/core/indexing.py in _is_valid_integer(self, key, axis)
1636 l = len(ax)
1637 if key >= l or key < -l:
-> 1638 raise IndexError("single positional indexer is out-of-bounds")
1639 return True
1640

IndexError: single positional indexer is out-of-bounds

In [0]:

Silt Trap Calculation (Msma I)
100% (2)
Silt Trap Calculation (Msma I)
3 pages
Review of The Adam and Eve Story by Chan Thomas
100% (7)
Review of The Adam and Eve Story by Chan Thomas
17 pages
Methodology For Pollution Control of Lakes and Management
No ratings yet
Methodology For Pollution Control of Lakes and Management
19 pages
Arid Landforms
100% (1)
Arid Landforms
43 pages
Pandas Group by PDF
No ratings yet
Pandas Group by PDF
7 pages
Python
No ratings yet
Python
3 pages
Panda 2
No ratings yet
Panda 2
2 pages
pandas_workshop - Jupyter Notebook
No ratings yet
pandas_workshop - Jupyter Notebook
5 pages
Dataframes - Jupyter Notebook
No ratings yet
Dataframes - Jupyter Notebook
9 pages
MLT Use Case
No ratings yet
MLT Use Case
13 pages
hanoi 2019 và 2020-descriptive statistics
No ratings yet
hanoi 2019 và 2020-descriptive statistics
7 pages
DS_task-2
No ratings yet
DS_task-2
6 pages
Project Information-Gain
No ratings yet
Project Information-Gain
5 pages
hcm temp 2019 và 2020-descriptive statistics
No ratings yet
hcm temp 2019 và 2020-descriptive statistics
6 pages
explainable-ai-driven-rainfall-prediction-using-dl
No ratings yet
explainable-ai-driven-rainfall-prediction-using-dl
66 pages
Acknowledgement
No ratings yet
Acknowledgement
25 pages
MLRecord
No ratings yet
MLRecord
24 pages
Dma 89
No ratings yet
Dma 89
21 pages
Practical 2.ipynb - Colaboratory
No ratings yet
Practical 2.ipynb - Colaboratory
2 pages
DF Ques1
No ratings yet
DF Ques1
2 pages
wether excel
No ratings yet
wether excel
5 pages
Tcs EDA Question
0% (1)
Tcs EDA Question
5 pages
CSE315:Introduction To Data Science: WEEK-8
No ratings yet
CSE315:Introduction To Data Science: WEEK-8
27 pages
World Air Quality Analysis
No ratings yet
World Air Quality Analysis
15 pages
Project Anish
No ratings yet
Project Anish
27 pages
Series Dataframeboardques
No ratings yet
Series Dataframeboardques
15 pages
Explore Weather Trends
No ratings yet
Explore Weather Trends
6 pages
Class XII-IP-Practical File 1
No ratings yet
Class XII-IP-Practical File 1
28 pages
DSBDA1 - Jupyter Notebook
No ratings yet
DSBDA1 - Jupyter Notebook
11 pages
PROGRAM NUMBER 22pdf
No ratings yet
PROGRAM NUMBER 22pdf
2 pages
DATAFRAME
No ratings yet
DATAFRAME
11 pages
Pandas Plots
No ratings yet
Pandas Plots
14 pages
HW4 Weather
No ratings yet
HW4 Weather
6 pages
01 - Python Pandas 1 & 2
No ratings yet
01 - Python Pandas 1 & 2
5 pages
I.P (Python Qs)
No ratings yet
I.P (Python Qs)
13 pages
PR 1
No ratings yet
PR 1
7 pages
DSBDA GRP A Print
No ratings yet
DSBDA GRP A Print
65 pages
Class 10 Ai Practical
No ratings yet
Class 10 Ai Practical
7 pages
Python Libraries Cheat Sheets
No ratings yet
Python Libraries Cheat Sheets
6 pages
Dengue Case Prediction Using Machine Learning: Import As Import As Import As Import As Import
No ratings yet
Dengue Case Prediction Using Machine Learning: Import As Import As Import As Import As Import
137 pages
Weather
No ratings yet
Weather
1 page
41b Data Wrangling, Grouping and Aggregation
No ratings yet
41b Data Wrangling, Grouping and Aggregation
31 pages
RainFall - Prediction - Ipynb - Colaboratory
No ratings yet
RainFall - Prediction - Ipynb - Colaboratory
7 pages
Data Science Journal
No ratings yet
Data Science Journal
3 pages
Class 12-IP-Practical File 2024 1
No ratings yet
Class 12-IP-Practical File 2024 1
71 pages
1[1][1]
No ratings yet
1[1][1]
6 pages
air-quality-randomforest
No ratings yet
air-quality-randomforest
5 pages
Practical File Questions With Answers
No ratings yet
Practical File Questions With Answers
7 pages
Latihan Python
No ratings yet
Latihan Python
15 pages
Introduction to Matplotlib
No ratings yet
Introduction to Matplotlib
58 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Assessment On Tuple
No ratings yet
Assessment On Tuple
8 pages
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
No ratings yet
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
1 page
Lab 7
No ratings yet
Lab 7
6 pages
Document 1
No ratings yet
Document 1
16 pages
What Is Pandas-Python? Introduction and Installation
No ratings yet
What Is Pandas-Python? Introduction and Installation
2 pages
Fundamental - Python
No ratings yet
Fundamental - Python
3 pages
12 IP Pandas DataFrame - Question Bank
No ratings yet
12 IP Pandas DataFrame - Question Bank
10 pages
Bot_shreyasi_Assignment solutions
No ratings yet
Bot_shreyasi_Assignment solutions
5 pages
Weather Live
No ratings yet
Weather Live
2 pages
PRACTICAL FILE IP - Copy (1)
No ratings yet
PRACTICAL FILE IP - Copy (1)
27 pages
Rainfall - Prediction - Ipynb - Colaboratory
No ratings yet
Rainfall - Prediction - Ipynb - Colaboratory
10 pages
Profound Python Libraries
From Everand
Profound Python Libraries
Onder Teker
No ratings yet
Exam AZ-800: Administering Windows Server Hybrid Core Infrastructure Preparation
From Everand
Exam AZ-800: Administering Windows Server Hybrid Core Infrastructure Preparation
Georgio Daccache
No ratings yet
Assignment No.8: 8.1 Title
No ratings yet
Assignment No.8: 8.1 Title
6 pages
Laboratory Practice I (410246)
No ratings yet
Laboratory Practice I (410246)
28 pages
Oral Questions LP-II: Data Mining (Rapid Miner)
No ratings yet
Oral Questions LP-II: Data Mining (Rapid Miner)
6 pages
LP1 1
No ratings yet
LP1 1
129 pages
#4767 Proposal Guideline
No ratings yet
#4767 Proposal Guideline
1 page
Q1a) What Is Big Data? Explain Characteristics of Big Data (4M) Ans
No ratings yet
Q1a) What Is Big Data? Explain Characteristics of Big Data (4M) Ans
16 pages
Slope Maintenance Manual
100% (1)
Slope Maintenance Manual
111 pages
Atlas of Middle-Earth accordiing to ICE v1.8
No ratings yet
Atlas of Middle-Earth accordiing to ICE v1.8
91 pages
Initial Environmental Examination Report For The Improvements of WTP, Intake, Weir and Sludge Treatment Plant - Gatambe
No ratings yet
Initial Environmental Examination Report For The Improvements of WTP, Intake, Weir and Sludge Treatment Plant - Gatambe
91 pages
Geography of Antarctica Extreme Points of Antarctica List of Antarctic and Subantarctic Islands
No ratings yet
Geography of Antarctica Extreme Points of Antarctica List of Antarctic and Subantarctic Islands
8 pages
Geologic Time Scale
No ratings yet
Geologic Time Scale
56 pages
Climate Change
No ratings yet
Climate Change
3 pages
Environment Vocab Exercises
No ratings yet
Environment Vocab Exercises
2 pages
Module 2 - Landscape - Elemnts - Principles - Materials High
No ratings yet
Module 2 - Landscape - Elemnts - Principles - Materials High
26 pages
IGCSE Geography Syllabus
No ratings yet
IGCSE Geography Syllabus
9 pages
Mangroves Notes
No ratings yet
Mangroves Notes
4 pages
Tunnel Report44
No ratings yet
Tunnel Report44
33 pages
Notes On Natural Resources
No ratings yet
Notes On Natural Resources
18 pages
EnvironmentAL Essay
No ratings yet
EnvironmentAL Essay
12 pages
Bab - 11 Klassifikasi Tanah
No ratings yet
Bab - 11 Klassifikasi Tanah
35 pages
Basin Modelling
No ratings yet
Basin Modelling
6 pages
Geotechnical Engg
No ratings yet
Geotechnical Engg
259 pages
Lotusarise Com Important Rivers in India Upsc Srsltid=AfmBOooTBBHX 7astlbWzlUnF...
No ratings yet
Lotusarise Com Important Rivers in India Upsc Srsltid=AfmBOooTBBHX 7astlbWzlUnF...
49 pages
Embankment Assignment Two
No ratings yet
Embankment Assignment Two
25 pages
Tayo ch1
No ratings yet
Tayo ch1
6 pages
Download the PDF of Living Physical Geography 1st Edition Gervais Solutions Manual to read all chapters
100% (14)
Download the PDF of Living Physical Geography 1st Edition Gervais Solutions Manual to read all chapters
42 pages
Hydraulic Structure 1
No ratings yet
Hydraulic Structure 1
211 pages
Effect of Land Use Land Cover Changes On Land Surface Temperature During 1984-2020: A Case Study of Baghdad City Using Landsat Image
No ratings yet
Effect of Land Use Land Cover Changes On Land Surface Temperature During 1984-2020: A Case Study of Baghdad City Using Landsat Image
25 pages
20 Volcanoes in Philippines
No ratings yet
20 Volcanoes in Philippines
5 pages
Veblen Et Al, 1996
No ratings yet
Veblen Et Al, 1996
125 pages
Journey To CTR Notes Wblanks
No ratings yet
Journey To CTR Notes Wblanks
4 pages
The-Future-100-2023 Page 11
No ratings yet
The-Future-100-2023 Page 11
1 page

Read and Write CSV and XLS Files

Uploaded by

Read and Write CSV and XLS Files

Uploaded by

Read and write CSV and XLS files

day temperature windspeed event

#read excel file

day city temperature windspeed event

0 1/1/2017 new york 32 6 Rain

1 1/2/2017 new york 36 7 Sunny

2 1/3/2017 new york 28 12 Snow

3 1/4/2017 new york 33 7 Sunny

4 1/1/2017 mumbai 90 5 Sunny

5 1/2/2017 mumbai 85 12 Fog

6 1/3/2017 mumbai 87 15 Fog

8 1/1/2017 paris 45 20 Sunny

9 1/2/2017 paris 50 13 Cloudy

10 1/3/2017 paris 54 8 Cloudy

11 1/4/2017 paris 42 10 Cloudy

<pandas.core.groupby.DataFrameGroupBy object at 0x106d495f8>

day city temperature windspeed event

0 1/1/2017 new york 32 6 Rain

1 1/2/2017 new york 36 7 Sunny

2 1/3/2017 new york 28 12 Snow

3 1/4/2017 new york 33 7 Sunny

day temperature windspeed event

concatenate Data Frames

city humidity temperature

city humidity temperature

#concate two dataframes

city humidity temperature

city humidity temperature

city humidity temperature city humidity temperature

0 mumbai 80 32 new york 68 21

#merge two dataframes with out explicitly mention index

city temperature humidity

city temperature humidity

Numerical Indexing (.loc vs iloc)

NameError: name 's' is not defined

IndexError: single positional indexer is out-of-bounds

You might also like