Assignment Question
Assignment Question
For the assignment, you are asked to explore the application of data analytics techniques to
the dataset that is provided. You must study data problems related to the dataset, giving
special consideration to the unique properties of the problem domain, and testing one or
more techniques on it.
Your analysis needs to be thorough and go beyond the scope of what has been covered in
this course. You should incorporate data exploration, manipulation, transformation, and
visualization concepts with data analysis techniques in your solution. It is crucial to provide
explanations and justifications for the chosen techniques.
You also may need to pre-process your data to get it into an appropriate format. The
assignment should involve several techniques by categorizing it into different criteria and a
detailed exploration of the commands used in each criterion. Outline the findings, analyze
them, and justify them correctly with an appropriate graph. Also, a supporting document is
needed to reflect the graph and code using R programming concepts.
This assignment will help you to explore and analyze a set of data and reconstruct it into
meaningful representations for decision-making.
3.0 TYPE
Group Assignment (2–4 members)
the given dataset to identify the factors that measure customer satisfaction, represented in
the form of product ratings by consumers and provide recommendations to stakeholders.
Techniques
The dataset provided for this assignment consists of customer personal information (i.e.,
Name, Email, Phone, City, …), demographics (Age, Gender, Income, …) along with their
purchasing behaviour (Total Purchases, Total Amount, Product Brand, Product Type,
Feedback, Payment Method, …). In addition to the techniques (data exploration,
manipulation, transformation, and visualization techniques) covered in the course to
conduct analysis, you might consider exploring and implementing more advanced concepts
to enhance the effectiveness of data retrieval, especially if it fits your requirements.
DELIVERABLES:
The complete RScript (source code) and report must be submitted to the APU Learning
Management System (Moodle).
5.1 RScript (Program Code):
• Name the file under your group number.
• Start the first few lines in your program by typing all member's names and TP
numbers. For example:
# Name1, TP000001
# Name2, TP000002
# Name3, TP000003
# Name4, TP000004
o For each objective example, provide student id and explain what you want to
discover. For example:
o Hypothesis 1: Customer segments with higher purchasing power (e.g., premium
or frequent buyers) tend to give higher ratings compared to segments with lower
purchasing power.
Objective 1: To evaluate the relationship between customer segments and level
of ratings. NAME, TPXXXXXX
Analysis 1-1: Is there any correlation between different customer segments
and level of ratings?
Analysis 1-2: Is customer segment a key predictor for ratings?
Analysis 1-3: What are the external factors, if any, that share a causal
relationship with customer segment to influence purchase ratings?
o For each additional example, provide an ID and explanation.
# Extra feature 1
# comments about the extra feature
A) Cover Page:
All reports must be prepared with a front cover. A protective transparent plastic
sheet can be placed in front of the report to protect the front cover. The front cover
should be presented with the following details:
Ä Module
Ä Coursework Title
Ä Intake
Ä Students name and id
Ä Date Assigned (the date the report was handed out).
Ä Date Completed (the date the report is due to be handed in).
B) Contents:
o Introduction
ü Data Description
ü Assumptions (if any)
ü Hypothesis and Objectives
o Data Preparation
ü Data import
ü Cleaning / pre-processing (if necessary)
ü Data Validation (if necessary)
o Data Analysis
ü Each objective (along with student name) must start in a separate page
and contain:
§ Analytical technique(s) – e.g. descriptive using statistics
§ Justification of technique(s)
§ Screenshot of source code with output/plot.
§ Outline the findings based on the results obtained.
ü The extra feature explanation must be in a separate page and contains:
§ Screenshot of source code with output/plot.
§ Explain how adding this extra feature can improve the results.
ü Interpret the results from each analysis
o Conclusion
ü Overall discussion on the findings from all objectives
ü Recommendation
ü Limitation and future direction
ü State the word count (at the end of page)
C) Workload Matrix
D) References
Ä You may source algorithms and information from the Internet or books.
Proper referencing of the resources should be evident in the document.
Ä All references must be made using the APA (American Psychological
Association) referencing style as shown below: