0% found this document useful (0 votes)
5 views27 pages

Disha Data Science

The document outlines practical exercises for Excel and Python, focusing on data analysis techniques such as conditional formatting, pivot tables, VLOOKUP, and what-if analysis in Excel, as well as data frame manipulation, pre-processing, feature scaling, and hypothesis testing in Python. It includes step-by-step instructions for each task, emphasizing the use of statistical tests and data handling methods. Additionally, it covers ANOVA for comparing means across groups.

Uploaded by

bhoslesam177
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views27 pages

Disha Data Science

The document outlines practical exercises for Excel and Python, focusing on data analysis techniques such as conditional formatting, pivot tables, VLOOKUP, and what-if analysis in Excel, as well as data frame manipulation, pre-processing, feature scaling, and hypothesis testing in Python. It includes step-by-step instructions for each task, emphasizing the use of statistical tests and data handling methods. Additionally, it covers ANOVA for comparing means across groups.

Uploaded by

bhoslesam177
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 27

PRACTICAL 1

Introduction to Excel

A. Perform conditional formatting on a dataset using various criteria.

Steps :

Step 1 : Go to conditional formatting > Greater Than


Step 2: Enter the greater than filter value for example 2000.

Step 3: Go to Data Bars > Solid Fill in conditional formatting.


B. Create a pivot table to analyse and summarize data.
Steps :

Step 1: select the entire table and go to Insert tab PivotChart > Pivotchart.
Step 2: Select “New worksheet” in the create pivot chart window.

Step 3: Select and drag attributes in the below boxes.


C. Use VLOOKUP function to retrieve information from a different
worksheet or table.
Steps :

Step 1: click on an empty cell and type the following command.


=VLOOKUP(A9, A9:C9,1, TRUE)

B. Perform what-if analysis using Goal Seek to determine input values for desiredoutput.
Steps :

Step 1: In the Data tab go to the what if analysis>Goal seek.


Step 2: Fill the information in the window accordingly and click ok.
PRACTICAL 2 (Python)
Data Frames and Basic Data Pre-processing
A.Read data from CSV and JSON files into a data frame

1.Read data from a csv.

PRACTICAL 2 .A

2.Reading data from a JSON file:


PRACTICAL 2
B. Perform basic data pre-processing tasks such as handling missing values and outliers.
1. Replacing NA values using fillna()

OUTPUT :
PRACTICAL 2.B
2. Dropping NA values using dropna()

OUTPUT :
PRACTICAL 2
C. Manipulate and transform data using functions like filtering, sorting, and
Grouping

OUTPUT :
PRACTICAL 2 A.1(R-STUDIO)

OUTPUT :
PRACTICAL 2 A.2
PRACTICAL 2 B.1 (R-STUDIO)

Output :
PRACTICAL 2 B.2

OUTPUT :
PRACTICAL 2 C

OUTPUT :
PRACTICAL 3 (Python)
Feature Scaling and Dummification

A. Apply feature-scaling techniques like standardization and normalization to numerical


features.

OUTPUT :
PRACTICAL 3 (Python)

B. Perform feature Dummification to convert categorical variables into numerical


representations.

OUTPUT :
PRACTICAL 3 A (R-STUDIO)

OUTPUT :
PRACTICAL 3 B

OUTPUT ;
PRACTICAL 4
Hypothesis Testing
Conduct a hypothesis test using appropriate statistical tests (e.g., t-test, chi-square test)
# t-test
PRACTICAL 4
#chi-square test

OUTPUT :
Conclusion: There is sufficient evidence to reject the null hypothesis, indicating that there is a
significant association between 'horsepower_new' and 'modelyear_new' categories.
PRACTICAL 4 A (R-STUDIO)
PRACTICAL 4 B (R-STUDIO)
PRACTICAL 5 ( PYTHON)
ANOVA (Analysis of Variance)
Perform one-way ANOVA to compare means across multiple groups. Conduct post-hoc tests
to identify significant differences between group means.

OUTPUT :
PRACTICAL 5 ( R-STUDIO)

You might also like