Week 7
Week 7
Use the “employment.csv” data set and perform time series analysis and visualization through the
following questions.
1. Convert datestamp column to a datetime object and Set the datestamp columns as the index of
your DataFrame. Check if there are missing values in each column.
2. Generate a boxplot to find the distribution of unemployment rate for every industry .
3. Using line chart Visualize the unemployment rate of workers by industry .
4. Plot the monthly and yearly trends .
5. Apply time series decomposition to your dataset to visualize the trend and seasonality .
6. Visualize the seasonality of Agriculture, Health and Finance sector.
7. Visualize the seasonality of multiple time series and the correlation between each time series
in the dataset.
3. Find any missing review are present or not, If present remove those data.
4. Clean the data and remove the special characters and replace the contractions with its
expansion by converting the uppercase character to lower case. Also, remove the
punctuations.
5. Add the Polarity, length of the review, the word count and average word length of
each review.
6. Visualize the distribution of the word count, review length, and polarity.
8. Visualize the count of the reviews of each rating available in the dataset.
10. Visualize to check whether the review length changes with rating.