Assess Your Project Knowledge
Assess Your Project Knowledge
Question 1
Which of the following is a technique used in data preprocessing? (select all that apply)
Question 2
Question 3
Which method in PySpark DataFrames is used to print column data types in a DataFrame?
show()
printSchema()
describe()
head()
Question 4
What is the purpose of the Train-Test split method for training a machine learning model?
(select all that apply)
Question 5
Question 6
Question 7
Question 8
Question 9
What are the steps needed to be taken to prepare the Numerical Features for PySpark
machine learning model?
Question 10
How do you find the count of each unique value in a categorical column in a dataframe called
df?
df.groupby(column_name).count()
df.uniquecount(column_name)
df.uniques(column_name)
df.count(column_name)