0% found this document useful (0 votes)
16 views24 pages

Data Analysis Jury Document

The document presents a comprehensive analysis of recent advancements in fashion studies using R programming, detailing the importance and process of data analysis. It covers various aspects of R, including its features, advantages, and applications in the manufacturing and apparel industry, particularly in production forecasting and strategic planning. The project aims to utilize R to analyze and forecast apparel sales data to support informed production decisions.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views24 pages

Data Analysis Jury Document

The document presents a comprehensive analysis of recent advancements in fashion studies using R programming, detailing the importance and process of data analysis. It covers various aspects of R, including its features, advantages, and applications in the manufacturing and apparel industry, particularly in production forecasting and strategic planning. The project aims to utilize R to analyze and forecast apparel sales data to support informed production decisions.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 24

DATA ANALYSIS & R

END TERM JURY ASSESSMENT


Topic - Recent Advancement in Fashion studies using
R Programming
Semester 6

Submitted by-

Pingili Roshni Reddy (BFT/22/149)

Sheetal (BFT/22/166)

Thanmayi Polisetti (BFT/22/434)

Dated- 13.05.2025

Department of Fashion Technology (DFT)


National Institute of Fashion Technology (NIFT)
(Ministry of Textiles, Govt. of India)
Jaihind Enclave, HITEC City, Hyderabad, Telangana 500081

Page | 1
ACKNOWLEDGEMENT
In preparation of our assignment, we had to take the help and
guidance of a few respected sources, who deserve our deepest
gratitude. As the completion of this assignment gave us much
pleasure, we would like to show our gratitude towards. Prof TVSN
Murthy, Data Analysis & R Instructor, National Institute of Fashion
Technology, Hyderabad, who, after numerous consultations, guided
us well on this opportunistic assignment. In addition, we would also
like to thank him for introducing us to the methodology of work,
and whose passion for the "underlying structures" had lasting effect.
We would also like to thank our parents for motivating us with the
assignment. Many people, especially our classmates have made
valuable comments on our assignment which inspired us to improve
the overall quality of it.

Page | 2
Contents

SL.NO TOPICS PAGE


NO.
1 What is Data Analysis 4
2 Why Data Analysis is important 4

3 The Process of Data Analysis 4-5

4 Types of Data Analysis 5-6

5 Tools for Data Analysis 6

6 What is R Programming Language 7

7 Why Use R Programming 7

8 Features of R Programming Language 8

9 Advantages of R Programming 8

10 Disadvantages of R Programming 8

11 Applications of R Programming 9

12 Applications of R Programming in 9
manufacturing

13 R Programming in Production (apparel 10-22


industry)

14 Conclusion 22

15 References 23

Page | 3
What is Data Analysis?

Data analysis refers to the practice of examining datasets to draw conclusions


about the information they contain. It involves organizing, cleaning, and
studying the data to understand patterns or trends. Data analysis helps to
answer questions like “What is happening” or “Why is this happening”.

Why Data Analysis is important?

Data analysis is important because it helps us understand information so we


can make better choices. Let’s understand this in more detail:
1. Informed Decision-Making: When we look at data, it helps us make better
choices because we can see how things have worked in the past, what’s
happening right now, and what might happen in the future. It gives us the
facts to make smart decisions.
2. Business Intelligence: Analyzing data helps companies stay ahead of
others. By looking at things like what customers like, what’s trending in the
market, and where they can improve, they can plan better and make smarter
moves.
3. Problem Solving: It use in identifying and solving problems within a
system or process by revealing patterns or anomalies that require attention.
4. Performance Evaluation: If something isn’t working right, looking at data
helps us find out what’s wrong. It shows us patterns or issues we might not
notice otherwise, helping us fix problems.
5. Risk Management: Understanding patterns in data helps in predicting and
managing risks, allowing organizations to deal with the challenges.

The Process of Data Analysis

A Data analysis involves several key steps that help us to get insights from the
raw data Now Let’s understand the process of Data Analysis.

Page | 4
 Define Objectives : Clearly define the goals of the analysis and the
specific questions you aim to answer. Establish a clear understanding of
what insights or decisions the analyzed data should inform.
 Data Collection: Gather relevant data from various sources. Ensure data
integrity, quality, and completeness. Organize the data in a format suitable
for analysis. There are two types of data: qualititative and quantitative data .
 Data Cleaning and Preprocessing: Address missing values, handle
outliers, and transform the data into a usable format. Cleaning and
preprocessing steps are crucial for ensuring the accuracy and reliability of
the analysis.
 Exploratory Data Analysis (EDA): Conduct exploratory analysis to
understand the characteristics of the data. Visualize distributions, identify
patterns, and calculate summary statistics. EDA helps in formulating
hypotheses and refining the analysis approach.
 Statistical Analysis : Apply appropriate statistical methods or modeling
techniques to answer the defined questions. This step involves testing
hypotheses, building predictive models, or performing any analysis
required to derive meaningful insights from the data.
 Visualization and Communication: Interpret the results in the context of
the original objectives. Communicate findings through reports,
visualizations, or presentations. Clearly articulate insights, conclusions, and
recommendations based on the analysis to support informed decision-
making.

Types of Data Analysis

Data Analysis are mainly divided into four types depending on the nature of
the data and the questions being
addressed.

1. Descriptive Analysis
Descriptive analysis helps us
understand what happened in the past.

Page | 5
It looks at historical data and summarizes it in a way that makes sense. For
example, a company might use descriptive analysis to see how much they sold
last year or to find out which product was most popular.
2. Diagnostic Analysis
Diagnostic analysis works hand in hand with Descriptive Analysis. As
descriptive Analysis finds out what happened in the past, diagnostic Analysis,
on the other hand, finds out why did that happen or what measures were taken
at that time, or how frequently it has happened. It helps businesses figure out
the reasons behind certain outcomes.
3. Predictive Analysis
By forecasting future trends based on historical data, Predictive
analysis predictive analysis enables organizations to prepare for upcoming
opportunities and challenges. For example, a store might use predictive
analysis to figure out what products will be popular in the upcoming season. It
helps businesses prepare for future events and make plans.
4. Prescriptive Analysis
Prescriptive Analysis is an advanced method that takes Predictive Analysis
insights and gives suggestions on the best actions to take. For example, if
predictive analysis shows that a certain product will be popular, prescriptive
analysis might suggest how much stock to buy or what marketing strategies to
use. It’s about giving businesses clear advice on how to act.

Tools for Data Analysis

Several tools are available to facilitate effective data analysis. These tools can
range from simple spreadsheet applications to complex statistical software.
Some popular tools include:

i. SAS, used for advanced analytics and predictive modeling;


ii. Microsoft Excel, which is great for simple data manipulation and
visualizations;
iii. R, a free language for statistical analysis;
iv. Python, a versatile programming language with libraries for data
science;
v. Tableau Public, for creating interactive data visualizations;
vi. Knime, an open-source platform for data mining and machine learning;
vii. Power BI, a service for creating business intelligence dashboards and
reports.

Page | 6
What is R Programming Language?

R is a tool for data manipulation and


analysis, it is both a programming language
and a working environment. R is designed
to perform statistical calculations and
generate visual data representations. It
includes a wide array of statistical models,
such as regression, time-series analysis,
clustering and advanced machine learning
techniques.
Its built-in functions make it easy to work with tables, matrices and arrays, so
we can quickly clean data, change its format and summarize it. It has a large
collection of add-on packages like ggplot2 for making graphs, dplyr for
organizing data and caret for building simple machine learning models.

Why Use R Programming?

The R Language is a powerful tool widely used for data analysis, statistical
computing and machine learning. Here are several reasons why professionals
across various fields prefer R:
 R is free and open for everyone to use.
 It was built for working with data, making it easy to clean and analyze
numbers.
 R comes with many tools to create clear graphs and charts.
 There are many add-on packages available.
 It works well with other software and data formats, making it a flexible
choice.

Page | 7
Features of R Programming Language

The R Language is renowned for its extensive features that make it a powerful
tool for data analysis, statistical computing and visualization. Here are some of
the key features of R:
i. Cross-Platform Support: R works on Windows, Mac and Linux, so we
can use it no matter what computer we have.
ii. Easy-to-Learn Syntax: The language is designed with a simple style
that helps beginners pick it up quickly.
iii. Data Exploration: We can try out code and see results right away,
which makes it easy to test ideas.
iv. Community Resources: There are many forums, tutorials and help
guides available when we need assistance.
v. Regular Updates: New features and packages are added often, keeping
R up-to-date with the latest tools.
vi. Prototyping: It lets us quickly set up and test data models before
running them on larger datasets.

Advantages of R Programming

1. R is the most comprehensive statistical analysis package. As new


technology and concepts often appear first in R.
2. As R programming language is an open source. Thus, we can run R
anywhere and at any time.
3. R programming language is suitable for GNU/Linux and Windows
operating systems.
4. R programming is cross-platform and runs on any operating system.

Page | 8
5. In R, everyone is welcome to provide new packages, bug fixes and code
enhancements.

Disadvantages of R Programming

1. In the R programming language, the standard of some packages is less


than perfect.
2. Although, R commands give little pressure on memory management. So
R programming language may consume all available memory.
3. In R basically, nobody to complain if something doesn’t work.
4. R programming language is much slower than other programming
languages such as Python and MATLAB.

Applications of R Programming

1. We use R for Data Science. It gives us a broad variety of libraries


related to statistics. It also provides the environment for statistical
computing and design.
2. R is used by many quantitative analysts as its programming tool. Thus,
it helps in data importing and cleaning.
3. R is the most prevalent language. So many data analysts and research
programmers use it. Hence, it is used as a fundamental tool for finance.
4. Tech giants like Google, Facebook, Bing, Twitter, Accenture, Wipro
and many more using R nowadays.

Applications of R Programming in manufacturing

R helps the manufacturing industry by providing tools for process


optimization, quality control, and predictive maintenance. Manufacturers can
use statistical process control techniques in R software to ensure production
quality by analyzing data.

Predictive modeling helps predict equipment failures, improves maintenance,


and reduces the layoff period. Manufacturers can save money and improve
production by studying their machines' data. Using this information, they can
find and fix problems and improve things.

Supply chain optimization: R programming can optimize the fashion supply


chain by analyzing various factors such as demand forecasting, production
Page | 9
planning, and inventory management. By implementing optimization algorithms
in R, companies can minimize costs, reduce waste, and improve overall
efficiency in the fashion production and distribution process.

Page | 10
R Programming in Production (apparel industry)

"Production Forecasting and Strategic Planning in the Apparel Industry


Using R Programming"

1.Introduction
Production in the apparel industry involves strategic planning of what, how
much, and when to manufacture. Fashion demand changes rapidly due to
seasons, trends, and consumer preferences. Hence, integrating data analytics
into production decisions is not just helpful but it's essential. This project uses R
programming to process, analyze, and forecast apparel sales data from 2018–
2022 to support smarter production decisions.

Why it’s ideal for production:

 Shows real sales trends by month, year, and product type.

 Links seasonality with sales trends—vital for planning.

 Offers stock_level, useful for supply-demand balance.

2. Objective
 Analyze product-level sales by category and season.
 Forecast upcoming demand to avoid overproduction/stock-outs.
 Visualize high-demand periods for production scaling.
 Deliver actionable insights for production managers using R.

3.Dataset: Fashion Trend Dataset (2018–2022)


4. Attributes in the dataset: product_id, product_name, category, sales_count,
season, price, stock_level, month_of_sale, year_of_sale

5. R Programming Environment Setup

install.packages("tidyverse")

install.packages("lubridate")

install.packages("forecast")

install.packages("ggplot2")

install.packages("tseries")

library(tidyverse)

Page | 11
library(lubridate)

library(forecast)

library(ggplot2)
library(tseries)

Why these?

 tidyverse: Data manipulation (dplyr, ggplot2)

 lubridate: Handling date-time for monthly analysis

 forecast: ARIMA modeling for demand prediction


 ggplot2: High-quality production graphs

6. Dataset Overview

# Choose correct file path

fashion_data <- read.csv(file.choose())

# Check structure and summary


str(fashion_data)

Page | 12
summary(fashion_data)

Observations:

● Year: 2018 to 2022

● Over 10,000 rows of monthly records

● Variety of categories (Shirts, Jackets, Pants, etc.)


● Each row is a record of monthly product performan

7.Data Cleaning

# Remove missing values

fashion_data <- na.omit(fashion_data)

# Combine year and month into a date

fashion_data$date <-

as.Date(paste(fashion_data$year_of_sale,fashion_data$month_of_sale,

*"01", sep = "-"),format="%Y-%m-%d")

Page | 13
# Check for invalid dates or outliers
summary(fashion_data$date)

Justification:

 NA values can distort visualizations and modeling


 Creating a date column improves time-series modelling

8. Data Transformation

Reasoning: This helps plot sales trends over time by product category,

useful for seasonal production planning.

# Group by category and month

# Load necessary library

library(dplyr)

# Group by date and category to calculate monthly sales

monthly_sales <- fashion_data %>%

group_by(date, category) %>%

summarise(monthly_sales = sum(sales_count, na.rm = TRUE)) %>%


ungroup()

Page | 14
9. Exploratory Data Analysis (EDA)

A. Total Sales by Category

category_sales <- fashion_data %>%

group_by(category) %>%

summarise(total_sales = sum(sales_count)) %>%

arrange(desc(total_sales))

ggplot(category_sales, aes(x = reorder(category, -total_sales), y =

total_sales)) + geom_bar(stat = "identity", fill = "tomato") +

labs(title = "Total Sales by Category", x = "Category", y = "Total


Sales")

Page | 15
B. Seasonal Sales Trend

seasonal_sales <- fashion_data %>%

group_by(season) %>%

summarise(total_sales = sum(sales_count))

ggplot(seasonal_sales, aes(x = season, y = total_sales, fill =

season)) + geom_bar(stat = "identity") + labs(title = "Sales by


Season", x = "Season", y = "Total Sales")

Page | 16
10.Category-Wise Insights for Production(2018–2022)

Objective
To identify the top product categories in the fashion industry from 2018 to
2022 using R programming, and derive actionable production insights based on
category-wise trends

R Programming Implementation

Inspect Dataset

# Standardize column names

Page | 17
colnames(fashion_data) <- tolower(colnames(fashion_data))

# Preview structure

str(fashion_data)

Handle Date/Year Column

# Create 'year' column from date if available

if ("date" %in% colnames(fashion_data)) {

fashion_data$year <- format(as.Date(fashion_data$date), "%Y")

Identify Top Categories Overall

library(dplyr)

# Get overall top categories by total count

top_categories <- fashion_data %>%

group_by(category) %>%

summarise(total = n()) %>%

arrange(desc(total)) %>%

slice_head(n = 5)

top_cat_names <- top_categories$category


print(top_cat_names)

Filter Data for Top Categories and Yearly Count

# Filter only top categories

filtered_data <- fashion_data %>%

filter(category %in% top_cat_names)

Page | 18
# Summarize count by year and category

category_year_summary <- filtered_data %>%

group_by(year, category) %>%


summarise(count = n())

Visualize Trends Over Years

library(ggplot2)

ggplot(category_year_summary, aes(x = year, y = count, fill =

category)) +

geom_bar(stat = "identity", position = "dodge") +

labs(title = "Top Fashion Categories Produced (2018–2022)",

x = "Year", y = "Production Count",

Page | 19
fill = "Category") +
theme_minimal()Plot bar graph

Analysis & Interpretation

● The bar chart shows how production volumes for each category

changed annually.

● Shirts and dresses dominate most years, while jackets peak closer

to winter seasons.
● Pants show consistent year-round demand.
● Strategic Production Recommendation
● Focus production on these four categories ahead of peak seasons:
○ Shirts – Spring/Summer
○ Jackets – Autumn/Winter
○ Dresses – Spring and Festive
○ Pants – Year-round
● Use this insight to plan inventory, raw materials, and workforce
allocation in advance.

Page | 20
11.Seasonal Trends for Production Planning

 Peak Seasons: Spring and Summer dominate for most fashion categories.

# Monthly Sales Aggregation

monthly_data <- fashion_data %>%

group_by(date) %>%

summarise(total_sales = sum(sales_count))
#output in enviroinment

12. Time Series Forecasting (ARIMA)

# Convert to time series

ts_data <- ts(monthly_data$total_sales, start = c(2018,1), frequency =

12)

# ARIMA Model

model <- auto.arima(ts_data)


forecast_result <- forecast(model, h = 48)

# Plot forecast

autoplot(forecast_result) +
labs(title = "Forecasted Sales for 2026", y = "Predicted Sales")

Page | 21
Interpretation:

● Steady increase in sales for Spring & Summer


● Production should ramp up Jan–Apr for Spring season

13. Interpretation for Production Planning

Insights for Managers:

 Stock buildup must start 2 months before peak.

 Low-demand categories (e.g., Winterwear) can follow just-in-time


production.
 Forecasts help reduce overproduction.

14. Challenges Faced

● Dataset lacks:

○ Raw material cost

○ Return rates or defective counts

● Trend outliers due to pandemic (2020)

Page | 22
15. Justification for Code Use

● group_by() – Helps segment sales by category/season

● ts() – Needed for time-series conversion

● auto.arima() – Automatically selects best p, d, q parameters

● forecast() – Generates future predictions


● ggplot() – For meaningful, readable visual insights

Conclusion

In conclusion, the fashion industry stands as a powerful example of how


technology and evolving consumer expectations drive transformation. Recent
advancements in fashion studies, particularly through the use of R
programming, have enabled deeper insights and data-driven decisions across
design, production, marketing, and retail. From digital design tools enhancing
creativity to sustainable practices supported by precise data analysis, the
integration of R
has supported a shift towards innovation and environmental responsibility. E-
commerce and social media have reshaped consumer interaction, while
wearable technology has merged functionality with fashion. Through tools like
R, fashion professionals can analyze trends, optimize inventory, reduce waste,
and improve customer experiences. As the industry continues to evolve, the
combination of fashion, technology, and data analytics through R programming
will play a key role in creating a smarter, more sustainable, and more responsive
fashion ecosystem.

Page | 23
References

https://www.geeksforgeeks.org/what-is-data-analysis/

https://www.geeksforgeeks.org/r-programming-language-introduction/

https://ruralhandmade.com/blog/advancement-in-fashion-using-data-analytics-and-
r#:~:text=Supply%20chain%20optimization%3A%20R%20programming,production%20planning%2C
%20and%20inventory%20management

https://www.proserveit.com/blog/introduction-to-data-analysis#what-is

https://olibr.com/blog/what-are-the-disadvantages-and-advantages-of-using-r/

https://www.r-project.org/about.html

https://www.sigmamagic.com/blogs/analytics-advantages-and-limitations/#:~:text=One%20of
%20the%20biggest%20limitations,whose%20services%20they%20are%20using

https://rsult.one/erp-per-industry/erp-for-garment-manufacturing-industry-specific-benefits-and-
features/

Dataset Source: Kaggle – Fashion Trend Dataset (2018–2022)

R Documentation: https://cran.r-project.org/

tidyverse and forecast packages

THANK YOU

Page | 24

You might also like