Data Types in Pandas by Jaume Boguñá
Data Types in Pandas by Jaume Boguñá
Jaume Boguñá
Dive into Python
Data Types in Pandas
Numeric Types
int64 64-bit integer
Jaume Boguñá
Jaume Boguñá
Jaume Boguñá
Data type conversions are essential when working with different types of
data such as integers, floats, strings, and datetime objects.
1. astype()
2. pd.to_datetime()
Jaume Boguñá
Parameters
DataFrame.astype(
dtype,
copy=None,
errors='raise'
Jaume Boguñá
import pandas as pd
data = {
"Duration": ['50', '40', '45'],
"Pulse": [109, 117, 110],
"Calories": [409.1, 479.5, 340.8]
}
df = pd.DataFrame(data)
df.dtypes
Duration object
Pulse int64
Calories float64
Jaume Boguñá
df
Duration Pulse Calories
0 50 109 409.1
1 40 117 479.5
2 45 110 340.8
df
Duration Pulse Calories
0 50 109 409.1
1 40 117 479.5
2 45 110 340.8
Jaume Boguñá
Parameters
pd.to_datetime(arg,
errors='raise',
dayfirst=False,
yearfirst=False,
utc=False,
format=None,
exact=<no_default>,
unit=None,
infer_datetime_format=<no_default>,
origin='unix',
cache=True
)
Jaume Boguñá
import pandas as pd
Jaume Boguñá
import pandas as pd
# Convert to datetime
pd.to_datetime(timestamp_in_seconds, unit='s')
2023-01-01 00:00:00
Jaume Boguñá
Jaume Boguñá
import pandas as pd
# Unix timestamp
timestamp_in_seconds = [1609459200, 1609545600]
# Convert to UTC
utc_time = pd.to_datetime(timestamp_in_seconds,
unit='s', utc=True)
DatetimeIndex(['2021-01-01 00:00:00+00:00', '2021-
01-02 00:00:00+00:00'], dtype='datetime64[ns,
UTC]', freq=None)
Jaume Boguñá
import pandas as pd
# Repeated dates
repeated_dates = ['2024-10-03'] * 100000
Jaume Boguñá
Jaume Boguñá
Aerospace Engineer | Data Scientist