Data Analysis Jury Document
Data Analysis Jury Document
Submitted by-
Sheetal (BFT/22/166)
Dated- 13.05.2025
Page | 1
ACKNOWLEDGEMENT
In preparation of our assignment, we had to take the help and
guidance of a few respected sources, who deserve our deepest
gratitude. As the completion of this assignment gave us much
pleasure, we would like to show our gratitude towards. Prof TVSN
Murthy, Data Analysis & R Instructor, National Institute of Fashion
Technology, Hyderabad, who, after numerous consultations, guided
us well on this opportunistic assignment. In addition, we would also
like to thank him for introducing us to the methodology of work,
and whose passion for the "underlying structures" had lasting effect.
We would also like to thank our parents for motivating us with the
assignment. Many people, especially our classmates have made
valuable comments on our assignment which inspired us to improve
the overall quality of it.
Page | 2
Contents
9 Advantages of R Programming 8
10 Disadvantages of R Programming 8
11 Applications of R Programming 9
12 Applications of R Programming in 9
manufacturing
14 Conclusion 22
15 References 23
Page | 3
What is Data Analysis?
A Data analysis involves several key steps that help us to get insights from the
raw data Now Let’s understand the process of Data Analysis.
Page | 4
Define Objectives : Clearly define the goals of the analysis and the
specific questions you aim to answer. Establish a clear understanding of
what insights or decisions the analyzed data should inform.
Data Collection: Gather relevant data from various sources. Ensure data
integrity, quality, and completeness. Organize the data in a format suitable
for analysis. There are two types of data: qualititative and quantitative data .
Data Cleaning and Preprocessing: Address missing values, handle
outliers, and transform the data into a usable format. Cleaning and
preprocessing steps are crucial for ensuring the accuracy and reliability of
the analysis.
Exploratory Data Analysis (EDA): Conduct exploratory analysis to
understand the characteristics of the data. Visualize distributions, identify
patterns, and calculate summary statistics. EDA helps in formulating
hypotheses and refining the analysis approach.
Statistical Analysis : Apply appropriate statistical methods or modeling
techniques to answer the defined questions. This step involves testing
hypotheses, building predictive models, or performing any analysis
required to derive meaningful insights from the data.
Visualization and Communication: Interpret the results in the context of
the original objectives. Communicate findings through reports,
visualizations, or presentations. Clearly articulate insights, conclusions, and
recommendations based on the analysis to support informed decision-
making.
Data Analysis are mainly divided into four types depending on the nature of
the data and the questions being
addressed.
1. Descriptive Analysis
Descriptive analysis helps us
understand what happened in the past.
Page | 5
It looks at historical data and summarizes it in a way that makes sense. For
example, a company might use descriptive analysis to see how much they sold
last year or to find out which product was most popular.
2. Diagnostic Analysis
Diagnostic analysis works hand in hand with Descriptive Analysis. As
descriptive Analysis finds out what happened in the past, diagnostic Analysis,
on the other hand, finds out why did that happen or what measures were taken
at that time, or how frequently it has happened. It helps businesses figure out
the reasons behind certain outcomes.
3. Predictive Analysis
By forecasting future trends based on historical data, Predictive
analysis predictive analysis enables organizations to prepare for upcoming
opportunities and challenges. For example, a store might use predictive
analysis to figure out what products will be popular in the upcoming season. It
helps businesses prepare for future events and make plans.
4. Prescriptive Analysis
Prescriptive Analysis is an advanced method that takes Predictive Analysis
insights and gives suggestions on the best actions to take. For example, if
predictive analysis shows that a certain product will be popular, prescriptive
analysis might suggest how much stock to buy or what marketing strategies to
use. It’s about giving businesses clear advice on how to act.
Several tools are available to facilitate effective data analysis. These tools can
range from simple spreadsheet applications to complex statistical software.
Some popular tools include:
Page | 6
What is R Programming Language?
The R Language is a powerful tool widely used for data analysis, statistical
computing and machine learning. Here are several reasons why professionals
across various fields prefer R:
R is free and open for everyone to use.
It was built for working with data, making it easy to clean and analyze
numbers.
R comes with many tools to create clear graphs and charts.
There are many add-on packages available.
It works well with other software and data formats, making it a flexible
choice.
Page | 7
Features of R Programming Language
The R Language is renowned for its extensive features that make it a powerful
tool for data analysis, statistical computing and visualization. Here are some of
the key features of R:
i. Cross-Platform Support: R works on Windows, Mac and Linux, so we
can use it no matter what computer we have.
ii. Easy-to-Learn Syntax: The language is designed with a simple style
that helps beginners pick it up quickly.
iii. Data Exploration: We can try out code and see results right away,
which makes it easy to test ideas.
iv. Community Resources: There are many forums, tutorials and help
guides available when we need assistance.
v. Regular Updates: New features and packages are added often, keeping
R up-to-date with the latest tools.
vi. Prototyping: It lets us quickly set up and test data models before
running them on larger datasets.
Advantages of R Programming
Page | 8
5. In R, everyone is welcome to provide new packages, bug fixes and code
enhancements.
Disadvantages of R Programming
Applications of R Programming
Page | 10
R Programming in Production (apparel industry)
1.Introduction
Production in the apparel industry involves strategic planning of what, how
much, and when to manufacture. Fashion demand changes rapidly due to
seasons, trends, and consumer preferences. Hence, integrating data analytics
into production decisions is not just helpful but it's essential. This project uses R
programming to process, analyze, and forecast apparel sales data from 2018–
2022 to support smarter production decisions.
2. Objective
Analyze product-level sales by category and season.
Forecast upcoming demand to avoid overproduction/stock-outs.
Visualize high-demand periods for production scaling.
Deliver actionable insights for production managers using R.
install.packages("tidyverse")
install.packages("lubridate")
install.packages("forecast")
install.packages("ggplot2")
install.packages("tseries")
library(tidyverse)
Page | 11
library(lubridate)
library(forecast)
library(ggplot2)
library(tseries)
Why these?
6. Dataset Overview
Page | 12
summary(fashion_data)
Observations:
7.Data Cleaning
fashion_data$date <-
as.Date(paste(fashion_data$year_of_sale,fashion_data$month_of_sale,
Page | 13
# Check for invalid dates or outliers
summary(fashion_data$date)
Justification:
8. Data Transformation
Reasoning: This helps plot sales trends over time by product category,
library(dplyr)
Page | 14
9. Exploratory Data Analysis (EDA)
group_by(category) %>%
arrange(desc(total_sales))
Page | 15
B. Seasonal Sales Trend
group_by(season) %>%
summarise(total_sales = sum(sales_count))
Page | 16
10.Category-Wise Insights for Production(2018–2022)
Objective
To identify the top product categories in the fashion industry from 2018 to
2022 using R programming, and derive actionable production insights based on
category-wise trends
R Programming Implementation
Inspect Dataset
Page | 17
colnames(fashion_data) <- tolower(colnames(fashion_data))
# Preview structure
str(fashion_data)
library(dplyr)
group_by(category) %>%
arrange(desc(total)) %>%
slice_head(n = 5)
Page | 18
# Summarize count by year and category
library(ggplot2)
category)) +
Page | 19
fill = "Category") +
theme_minimal()Plot bar graph
● The bar chart shows how production volumes for each category
changed annually.
● Shirts and dresses dominate most years, while jackets peak closer
to winter seasons.
● Pants show consistent year-round demand.
● Strategic Production Recommendation
● Focus production on these four categories ahead of peak seasons:
○ Shirts – Spring/Summer
○ Jackets – Autumn/Winter
○ Dresses – Spring and Festive
○ Pants – Year-round
● Use this insight to plan inventory, raw materials, and workforce
allocation in advance.
Page | 20
11.Seasonal Trends for Production Planning
Peak Seasons: Spring and Summer dominate for most fashion categories.
group_by(date) %>%
summarise(total_sales = sum(sales_count))
#output in enviroinment
12)
# ARIMA Model
# Plot forecast
autoplot(forecast_result) +
labs(title = "Forecasted Sales for 2026", y = "Predicted Sales")
Page | 21
Interpretation:
● Dataset lacks:
Page | 22
15. Justification for Code Use
Conclusion
Page | 23
References
https://www.geeksforgeeks.org/what-is-data-analysis/
https://www.geeksforgeeks.org/r-programming-language-introduction/
https://ruralhandmade.com/blog/advancement-in-fashion-using-data-analytics-and-
r#:~:text=Supply%20chain%20optimization%3A%20R%20programming,production%20planning%2C
%20and%20inventory%20management
https://www.proserveit.com/blog/introduction-to-data-analysis#what-is
https://olibr.com/blog/what-are-the-disadvantages-and-advantages-of-using-r/
https://www.r-project.org/about.html
https://www.sigmamagic.com/blogs/analytics-advantages-and-limitations/#:~:text=One%20of
%20the%20biggest%20limitations,whose%20services%20they%20are%20using
https://rsult.one/erp-per-industry/erp-for-garment-manufacturing-industry-specific-benefits-and-
features/
R Documentation: https://cran.r-project.org/
THANK YOU
Page | 24