0% found this document useful (0 votes)

28 views11 pages

MLStack Cafe 2

Uploaded by

Ankita Kurle

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views11 pages

MLStack Cafe 2

Uploaded by

Ankita Kurle

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

MLStack.

Cafe - Kill Your Data Science & ML Interview

MLStack.Cafe - Kill Your Data Science & ML

Interview

Q1: How to create new columns derived from existing columns in

Pandas? ☆

Topics: Pandas

Answer:

We create a new column by assigning the output to the DataFrame with a new column name in between the
[] .
Let's say we want to create a new column 'C' whose values are the multiplication of column 'B' with
column 'A' . The operation will be easy to implement and will be element-wise, so there's no need to loop
over rows.

import pandas as pd

# Create example data

df = pd.DataFrame({
"A": [420, 380, 390],
"B": [50, 40, 45]
})

df["C"] = df["A"] * df["B"]

Also other mathematical operators ( + , - , \* , / ) or logical operators ( < , > , = , … ) work element-wise.
But if we need more advanced logic, we can use arbitrary Python code via apply() .
Depending on the case, we can use rename with a dictionary or function to rename row labels or column
names according to the problem.

Q2: How do you count unique values per group with Pandas? ☆

Topics: Pandas

Problem:

You are given the following dataframe:

>>> data = {'ID': [123, 123, 123, 456, 456, 456, 456, 789, 789],
"domain":['vk.com', 'vk.com', 'twitter.com', 'vk.com','facebook.com',
'vk.com','google.com','twitter.com','vk.com']
}

>>> df = pd.DataFrame(data)
>>> df
ID domain
0 123 vk.com
1 123 vk.com
2 123 twitter.com
3 456 vk.com
4 456 facebook.com
5 456 vk.com
6 456 google.com

Page 1 of 11
MLStack.Cafe - Kill Your Data Science & ML Interview

7 789 twitter.com
8 789 vk.com

You are required to count unique ID values in every domain .

Solution:

We can use the nunique() function:

>>> df = df.groupby('domain')['ID'].nunique()
>>> df
domain
facebook.com 1
google.com 1
twitter.com 2
vk.com 3
Name: ID, dtype: int64

Q3: How are iloc() and loc() diﬀerent? ☆☆

Topics: Pandas

Answer:

DataFrame.iloc is a method used to retrieve data from a Data frame, and it is an integer position-based
locator (from 0 to length-1 of the axis), but may also be used with a boolean array. It takes input as
integer, arrays of integers, a slice object, boolean array and functions.

df.iloc[0]
df.iloc[-5:]
df.iloc[:, 2] # the : in the first position indicates all rows
df.iloc[:3, :3] # The upper-left 3 X 3 entries (assuming df has 3+ rows and columns)

DataFrame.loc gets rows (and/or columns) with particular labels. It takes input as a single label, list of
arrays and slice objects with labels.

df = pd.DataFrame(index=['a', 'b', 'c'], columns=['time', 'date', 'name'])

df.loc['a'] # equivalent to df.iloc[0]
df.loc['b':, 'date'] # equivalent to df.iloc[1:, 1]

Q4: What are the operations that Pandas Groupby method is based
on ? ☆☆

Topics: Pandas

Answer:

Splitting the data into groups based on some criteria.

Applying a function to each group independently.
Combining the results into a data structure.

Page 2 of 11
MLStack.Cafe - Kill Your Data Science & ML Interview

Q5: Describe how you will get the names of columns of a

DataFrame in Pandas ☆☆

Topics: Pandas

Answer:

By Simply iterating over columns, and printing the values.

for col in data.columns:

print(col)

Using .columns() method with the dataframe object, this returns the column labels of the DataFrame.

list(data.columns)

Using the column.values() method to return an array of index.

list(data.columns.values)

Using sorted() method, which will return the list of columns sorted in alphabetical order.

sorted(data)

Q6: In Pandas, what do you understand as a bar plot and how can
you generate a bar plot visualization ☆☆

Topics: Pandas

Answer:

A Bar Plot is a plot that presents categorical data with rectangular bars with lengths proportional to the
values that they represent.
A bar plot shows comparisons among discrete categories.
One axis of the plot shows the speciﬁc categories being compared, and the other axis represents a measured
value.

# Code Sample for how to plot

df.plot.bar(x='x_values’', y='y_values')

Page 3 of 11
MLStack.Cafe - Kill Your Data Science & ML Interview

Q7: How would you iterate over rows in a DataFrame in Pandas?

☆☆

Topics: Pandas

Answer:

DataFrame.iterrows is a generator which yields both the index and row (as a Series):

import pandas as pd

df = pd.DataFrame({'c1': [10, 11, 12], 'c2': [100, 110, 120]})

for index, row in df.iterrows():

print(row['c1'], row['c2'])

10 100
11 110
12 120

Q8: How to check whether a Pandas DataFrame is empty? ☆☆

Topics: Pandas

Answer:

You can use the attribute df.empty to check whether it's empty or not:

if df.empty:
print('DataFrame is empty!')

Page 4 of 11
MLStack.Cafe - Kill Your Data Science & ML Interview

Q9: If we have a date column in our dataset, then how will you
perform Feature Engineering using Python? ☆☆

Topics: Pandas Dimensionality Reduction Feature Engineering

Answer:

From a date column, we can get lots of important features such as:

day of the week,

day of the month,
day of the quarter, and
day of the year, etc.

Moreover, we can extract the date, month, and year from that column also.

All these features can impact our prediction and make our model robust. For example, in a case study, the sales
of the business can be impacted by the month or day of the week.

To perform this kind of feature engineering in Python, we must convert the data type associated with the date
column in a datetime type using the Pandas library as follows,

# convert date_column to datetime type

df.date_column = pd.to_datetime(df.date_column)

Now to extract the month, the day of the month, and the hour we use the following commands,

# extract month feature

months = df.date_column.dt.month

# extract day of month feature

day_of_months = df.date_column.dt.day

# extract hour feature

hours = df.date_column.dt.hour

Q10: How can you sort the DataFrame? ☆☆

Topics: Pandas

Answer:

The function used for sorting in pandas is called DataFrame.sort_values() . It is used to sort a DataFrame by its
column or row values. The function comes with a lot of parameters, but the most important ones to consider for
sort are:

by : The optional by parameter is used to specify the column/row(s) which are used to determine the sorted
order.
axis : speciﬁes whether sort for row ( 0 ) or columns ( 1 ),

ascending : speciﬁes whether to sort the dataframe in ascending or descending order. The default value is
ascending. To sort in descending order, we need to specify ascending=False .

Example

Page 5 of 11
MLStack.Cafe - Kill Your Data Science & ML Interview

>>> df = pd.DataFrame({
'col1': ['A', 'A', 'B', np.nan, 'D', 'C'],
'col2': [2, 1, 9, 8, 7, 4],
'col3': [0, 1, 9, 4, 2, 3],
'col4': ['a', 'B', 'c', 'D', 'e', 'F']
})

>>> df
col1 col2 col3 col4
0 A 2 0 a
1 A 1 1 B
2 B 9 9 c
3 NaN 8 4 D
4 D 7 2 e
5 C 4 3 F

#Sort by col1
>>> df.sort_values(by=['col1'])
col1 col2 col3 col4
0 A 2 0 a
1 A 1 1 B
2 B 9 9 c
5 C 4 3 F
4 D 7 2 e
3 NaN 8 4 D

# Sort by multiple columns

>>> df.sort_values(by=['col1', 'col2'])
col1 col2 col3 col4
1 A 1 1 B
0 A 2 0 a
2 B 9 9 c
5 C 4 3 F
4 D 7 2 e
3 NaN 8 4 D

# Sort descending
df.sort_values(by='col1', ascending=False)
col1 col2 col3 col4
4 D 7 2 e
5 C 4 3 F
2 B 9 9 c
0 A 2 0 a
1 A 1 1 B
3 NaN 8 4 D

Q11: How to convert str to datetime format in Pandas? ☆☆

Topics: Pandas

Answer:

Usign to_datetime() function we can not only convert str but int , float , list and more objects to
datetime . For example,

>>> df = pd.DataFrame({'A' : ['foo', 'bar', 'foo', 'bar'],

'B' : ['one', 'one', 'two', 'three'],
'C' : np.random.randn(4),
'I_date' : ['28-03-2021 2:15:00 PM', '28-03-2021 2:17:28 PM', '28-03-2021 2:50:50
PM', '28-03-2021 2:50:50 PM']
})

>>> df['I_date'] = pd.to_datetime(df['I_date'])

>>> df.dtypes
A object

Page 6 of 11
MLStack.Cafe - Kill Your Data Science & ML Interview

B object
C float64
I_date datetime64[ns]
dtype: object

Q12: What does describe() percentiles values tell about our data?
☆☆

Topics: Pandas

Answer:

The percentiles describe the distribution of your data: 50 should be a value that describes the middle of the
data, also known as median. 25 , 75 is the border of the upper/lower quarter of the data. With this can get an
idea of how skew our data is.

If the mean is higher than the median, the data is right skewed.

Q13: Deﬁne the diﬀerent ways a DataFrame can be created in

Pandas ☆☆

Topics: Pandas

Answer:

We can create a DataFrame using the following ways:

Constructing DataFrame from a dictionary:

>>> d = {'col1': [1, 2], 'col2': [3, 4]}

>>> df = pd.DataFrame(data=d)
>>> df
col1 col2
0 1 3
1 2 4

Constructing a DataFrame from numpy ndarray:

>>> df2 = pd.DataFrame(np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]]),

columns=['a', 'b', 'c'])
>>> df2

Page 7 of 11
MLStack.Cafe - Kill Your Data Science & ML Interview

a b c
0 1 2 3
1 4 5 6
2 7 8 9

Q14: Why do should make a copy of a DataFrame in Pandas? ☆☆

Topics: Pandas

Answer:

In general, it is safer to work on copies than on original DataFrames, except when you know that you won't be
needing the original anymore and want to proceed with the manipulated version.

This is because in Pandas, indexing a DataFrame returns a reference to the initial DataFrame. Thus, changing
the subset will change the initial DataFrame. Thus, you'd want to use the copy if you want to make
sure the initial DataFrame shouldn't change.

Normally, you would still have some use for the original data frame to compare with the manipulated version,
etc. Therefore, depending on the case it's a good practice to work on copies and merge at the end.

Q15: What does the in operator do in Pandas? ☆☆

Topics: Pandas

Answer:

The in operator in Python tests dictionary keys, not values. In Pandas, Series are dict-like, therefore, the
in operator on a Series tests for membership in the index, not membership among the values. If we want
to test for membership in the values, we use the method isin() .

For DataFrames , likewise, in applies to the column axis, testing for membership in the list of column
names.

Q16: How can you ﬁnd the row for which the value of a speciﬁc
column is max or min? ☆☆

Topics: Pandas

Problem:

>>> import pandas as pd

>>> df = pandas.DataFrame(np.random.randn(5,3),columns=['A','B','C'])
>>> df
A B C
0 -0.068471 -0.006429 1.453785
1 -0.655960 0.084291 0.344351
2 -0.058856 0.025537 1.303488
3 -0.300120 -0.207405 1.108704
4 2.027010 0.190007 -0.064194

Solution:

Use the pandas idxmax and idxmin function. It's straightforward:

Page 8 of 11
MLStack.Cafe - Kill Your Data Science & ML Interview

Maximal value:

>>> df['A'].idxmax()
4

Minimal value:

>>>df['A'].idxmin()
1

Q17: How does the groupby() method works in Pandas? ☆☆

Topics: Pandas

Answer:

In the ﬁrst stage of the process, data contained in a pandas object, whether a Series , DataFrame , or
otherwise, is split into groups based on one or more keys that we provide.

The splitting is performed on a particular axis of an object. For example, a DataFrame can be grouped on its
rows (axis=0) or its columns (axis=1) .

Once this is done, a function is applied to each group, producing a new value. Finally, the results of all those
function applications are combined into a result object. The form of the resulting object will usually depend
on what's being done to the data.

In the ﬁgure below, this process is illustrated for a simple group aggregation.

Q18: How to get a count of the number of observations for each

year in the example dataframe? ☆☆

Page 9 of 11
MLStack.Cafe - Kill Your Data Science & ML Interview

Topics: Pandas

Problem:

df = pd.DataFrame(['2017-12-05',
'2016-12-05',
'2017-12-05',
'2015-12-05',
'2017-12-06',
'2018-12-06',
'2019-12-05',
'2019-11-05',
'2020-12-05',
'2017-12-07'], columns=['date'])

Solution:

We ﬁrst convert the date column from string dtype to datetime dtype and then we use value_counts() on the
year attribute.

>>> df['date'] = pd.to_datetime(df['date'])

>>> pd.to_datetime(df['date']).dt.year.value_counts()
2017 4
2019 2
2016 1
2015 1
2018 1
2020 1
Name: date, dtype: int64

Q19: A column in a df has boolean True/False values, but for

further calculations, we need 1/0 representation. How would you
transform it? ☆☆

Topics: Pandas

Answer:

A succinct way to convert a single column of boolean values to a column of integers 1 or 0 is:

df["somecolumn"] = df["somecolumn"].astype(int)

Q20: Name some methods you know to replace NaN values of a

DataFrame in Pandas ☆☆

Topics: Pandas

Answer:

To replace missing values in a Pandas DataFrame we can use the fillna() function, In Pandas, some methods
available to use in this function are:

pad / ffill : propagate last valid observation forward to next valid back with df.fillna(method="pad") .

Page 10 of 11
MLStack.Cafe - Kill Your Data Science & ML Interview

fill / bfill : use next valid observation to ﬁll the missing value with df.fillna(method="fill") .

Replace NaN with a scalar value with df.fillna(n) , where n can be int , str , etc.
Replace NaN with a PandasObject : the use case of this is to ﬁll a DataFrame with the resulting operation of
apply a function to a column. For example replace NaN values with a mean of some column
( df.fillna(df.mean() ).

Page 11 of 11

Pandas Handbook
No ratings yet
Pandas Handbook
33 pages
Knowledge Pillars Code Questions
No ratings yet
Knowledge Pillars Code Questions
46 pages
Pandas
No ratings yet
Pandas
13 pages
Unit Iv
No ratings yet
Unit Iv
63 pages
Pandas Interview Questions
No ratings yet
Pandas Interview Questions
21 pages
7 Days Analytics Course 3feiz7 4
No ratings yet
7 Days Analytics Course 3feiz7 4
8 pages
Data Wrangling With Python and Pandas
No ratings yet
Data Wrangling With Python and Pandas
7 pages
Pandas 1705297450
No ratings yet
Pandas 1705297450
21 pages
Top Python Questions 1735201448
No ratings yet
Top Python Questions 1735201448
25 pages
Pandas
No ratings yet
Pandas
5 pages
Pandas Questions
No ratings yet
Pandas Questions
11 pages
PYTHON Pandas and Manipulation Data
No ratings yet
PYTHON Pandas and Manipulation Data
36 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Commands SQL, Python (BASICS)
No ratings yet
Commands SQL, Python (BASICS)
7 pages
04-Data Manipulation With Pandas
No ratings yet
04-Data Manipulation With Pandas
28 pages
DataFrame Ac Win Final
No ratings yet
DataFrame Ac Win Final
30 pages
Lab 1 ML Lab
No ratings yet
Lab 1 ML Lab
15 pages
10 Minutes To Pandas - Pandas 2.1.1 Documentation
No ratings yet
10 Minutes To Pandas - Pandas 2.1.1 Documentation
24 pages
Data Frame
No ratings yet
Data Frame
95 pages
Pandas
No ratings yet
Pandas
94 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
1 page
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
39 pages
Introduction To Pandas
No ratings yet
Introduction To Pandas
26 pages
Cheat Sheet
No ratings yet
Cheat Sheet
10 pages
Lecture 14
No ratings yet
Lecture 14
33 pages
Data Frames
No ratings yet
Data Frames
60 pages
Pandas (Ziad)
No ratings yet
Pandas (Ziad)
38 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
14 pages
Python Notes by Prof T
No ratings yet
Python Notes by Prof T
10 pages
Pandas - Digitalocean
No ratings yet
Pandas - Digitalocean
15 pages
Python Unit 4&5 Que
No ratings yet
Python Unit 4&5 Que
33 pages
Lab 9
No ratings yet
Lab 9
9 pages
Python Unit Iv - Pandas
No ratings yet
Python Unit Iv - Pandas
36 pages
DevOps Session 3 Pandas
No ratings yet
DevOps Session 3 Pandas
33 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
60 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
10 pages
Pandas DataFrameObject
No ratings yet
Pandas DataFrameObject
4 pages
Pandas Notes
No ratings yet
Pandas Notes
4 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
12 pages
IP 12th Chapter 3
No ratings yet
IP 12th Chapter 3
9 pages
Pandas Advance Quiz - Data Science Masters - PW Skills
100% (1)
Pandas Advance Quiz - Data Science Masters - PW Skills
5 pages
Pandas
No ratings yet
Pandas
27 pages
Data Aggregation and Group Operations
No ratings yet
Data Aggregation and Group Operations
34 pages
Chapter 1 - Part 2 - DataFrame
No ratings yet
Chapter 1 - Part 2 - DataFrame
48 pages
20 Pandas Functions For 80% of Your Data Science
No ratings yet
20 Pandas Functions For 80% of Your Data Science
22 pages
Pandas
No ratings yet
Pandas
44 pages
01-Numpy & Pandas
No ratings yet
01-Numpy & Pandas
69 pages
Introduction To Pandas in Data Analytics
No ratings yet
Introduction To Pandas in Data Analytics
12 pages
DAP 3 Module
No ratings yet
DAP 3 Module
62 pages
Pandas
No ratings yet
Pandas
25 pages
Pandas PDF
No ratings yet
Pandas PDF
25 pages
Day64 - Pandas Interview Questions
No ratings yet
Day64 - Pandas Interview Questions
5 pages
Murali Internship
No ratings yet
Murali Internship
34 pages
10 Minutes To Pandas - Pandas 1.2.4 Documentation
No ratings yet
10 Minutes To Pandas - Pandas 1.2.4 Documentation
18 pages
CH-6 Data Loading, Storage, and File Formats
No ratings yet
CH-6 Data Loading, Storage, and File Formats
163 pages
Python Pandas Demo PDF
100% (2)
Python Pandas Demo PDF
23 pages
Reference Guide - Pandas Tools For Structuring A Dataset
No ratings yet
Reference Guide - Pandas Tools For Structuring A Dataset
5 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
10 pages
Pandas
No ratings yet
Pandas
24 pages
C++ Date Type
No ratings yet
C++ Date Type
4 pages
Chapter 8
No ratings yet
Chapter 8
16 pages
GCSE - Python-All Tasks - With Helpsheets
No ratings yet
GCSE - Python-All Tasks - With Helpsheets
285 pages
CodeGuru - C# 4.0 Cheat Sheet
100% (6)
CodeGuru - C# 4.0 Cheat Sheet
2 pages
MCQ (Key Answer)
No ratings yet
MCQ (Key Answer)
5 pages
C# Concepts
No ratings yet
C# Concepts
6 pages
Embedded C and RTOS Unit 1 and 2
No ratings yet
Embedded C and RTOS Unit 1 and 2
59 pages
STP60 SHP75 SunSpec Modbus TI en 13
No ratings yet
STP60 SHP75 SunSpec Modbus TI en 13
57 pages
Chisel3 Cheat Sheet: Basic Data Types
No ratings yet
Chisel3 Cheat Sheet: Basic Data Types
2 pages
Assignment 7 - Arrays
No ratings yet
Assignment 7 - Arrays
8 pages
Python Lecture 2018
No ratings yet
Python Lecture 2018
174 pages
What Parameters Are Passed To WinMain
No ratings yet
What Parameters Are Passed To WinMain
1 page
Coding Questions
No ratings yet
Coding Questions
166 pages
GRAB Test
No ratings yet
GRAB Test
6 pages
Follow & Share: William - Wilson
No ratings yet
Follow & Share: William - Wilson
13 pages
Online Shopping Mall Management 1
80% (5)
Online Shopping Mall Management 1
97 pages
Pololu - Arduino Library For The Pololu QTR Reflectance Sensors
No ratings yet
Pololu - Arduino Library For The Pololu QTR Reflectance Sensors
8 pages
CS8251-Programming in C Notes
81% (21)
CS8251-Programming in C Notes
91 pages
Assignments and Lab Works
No ratings yet
Assignments and Lab Works
6 pages
JEDI Slides-Intro1-Chapter07-Java Arrays
No ratings yet
JEDI Slides-Intro1-Chapter07-Java Arrays
23 pages
M3-R4 Programming and Problem Solving Through C PDF
No ratings yet
M3-R4 Programming and Problem Solving Through C PDF
16 pages
IT Practical File 2024 25 Using Libre Office
No ratings yet
IT Practical File 2024 25 Using Libre Office
43 pages
NGK Mpi
No ratings yet
NGK Mpi
74 pages
Blackberry Java SDK Development Guide 1249411 0803110230 001 6.0 US
No ratings yet
Blackberry Java SDK Development Guide 1249411 0803110230 001 6.0 US
55 pages
Java Buzzwords
No ratings yet
Java Buzzwords
58 pages
C Programming: Summing Arithmetic Series
No ratings yet
C Programming: Summing Arithmetic Series
4 pages
Java Interview Specific Codes
No ratings yet
Java Interview Specific Codes
57 pages
CS 159 - Spring 2021 - Lab #9: Contact Prior
No ratings yet
CS 159 - Spring 2021 - Lab #9: Contact Prior
5 pages
42,0410,2012
No ratings yet
42,0410,2012
62 pages

MLStack Cafe 2

Uploaded by

MLStack Cafe 2

Uploaded by

MLStack.

Cafe - Kill Your Data Science & ML Interview

MLStack.Cafe - Kill Your Data Science & ML

Q1: How to create new columns derived from existing columns in

# Create example data

df["C"] = df["A"] * df["B"]

You are given the following dataframe:

You are required to count unique ID values in every domain .

We can use the nunique() function:

Q3: How are iloc() and loc() diﬀerent? ☆☆

df = pd.DataFrame(index=['a', 'b', 'c'], columns=['time', 'date', 'name'])

Splitting the data into groups based on some criteria.

Q5: Describe how you will get the names of columns of a

By Simply iterating over columns, and printing the values.

for col in data.columns:

Using the column.values() method to return an array of index.

# Code Sample for how to plot

Q7: How would you iterate over rows in a DataFrame in Pandas?

df = pd.DataFrame({'c1': [10, 11, 12], 'c2': [100, 110, 120]})

for index, row in df.iterrows():

Q8: How to check whether a Pandas DataFrame is empty? ☆☆

Topics: Pandas Dimensionality Reduction Feature Engineering

day of the week,

# convert date_column to datetime type

# extract month feature

# extract day of month feature

# extract hour feature

Q10: How can you sort the DataFrame? ☆☆

# Sort by multiple columns

Q11: How to convert str to datetime format in Pandas? ☆☆

>>> df = pd.DataFrame({'A' : ['foo', 'bar', 'foo', 'bar'],

>>> df['I_date'] = pd.to_datetime(df['I_date'])

Q13: Deﬁne the diﬀerent ways a DataFrame can be created in

We can create a DataFrame using the following ways:

Constructing DataFrame from a dictionary:

>>> d = {'col1': [1, 2], 'col2': [3, 4]}

Constructing a DataFrame from numpy ndarray:

>>> df2 = pd.DataFrame(np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]]),

Q14: Why do should make a copy of a DataFrame in Pandas? ☆☆

Q15: What does the in operator do in Pandas? ☆☆

>>> import pandas as pd

Use the pandas idxmax and idxmin function. It's straightforward:

Q17: How does the groupby() method works in Pandas? ☆☆

Q18: How to get a count of the number of observations for each

>>> df['date'] = pd.to_datetime(df['date'])

Q19: A column in a df has boolean True/False values, but for

Q20: Name some methods you know to replace NaN values of a

You might also like