IIT FDS Assignment2
IIT FDS Assignment2
In today’s data-driven world, businesses rely heavily on data analysis to make informed
decisions and gain competitive advantages. This assignment focuses on applying various
data analysis techniques to understand and interpret business data, leveraging statistical
tests to validate hypotheses and drive strategic decisions. The chosen dataset represents a
financial analytics scenario, providing a realistic context for students to apply their skills in
data visualization, univariate and multivariate analysis, and statistical testing.
2 Content
Students will work through a comprehensive case study that involves exploratory data
analysis, probability distributions, hypothesis testing, and A/B testing. The goal is to
provide a hands-on experience with real-world data, enhancing their analytical skills and
understanding of statistical concepts. This assignment will guide students through the
process of data profiling, visualization, and implementing various statistical tests to derive
meaningful insights and make data-driven decisions.
3 Data Description
The dataset used in this assignment is the "Credit Card Fraud Detection" dataset, available
on Kaggle. It contains transactions made by credit cards in September 2013 by European
cardholders. This dataset presents a real-world problem of identifying fraudulent
transactions, providing a perfect backdrop for financial analytics and hypothesis testing.
4 Objective
5 Tasks
3. Probability Distributions:
Implement and interpret discrete and continuous probability distributions within the
dataset.
Apply the Central Limit Theorem to demonstrate the distribution of sample means.
4. Hypothesis Testing:
Formulate and test hypotheses using the Null and Alternate Hypothesis framework.
Use the Critical Value Method and P-Value Method to make decisions.
Perform two-sample mean and proportion testing, ANOVA, and Chi-Square tests to validate
findings.
5. A/B Testing:
Implement A/B testing to compare two groups within the dataset and interpret the results.