Explore and Transform Data Based On Rows - Transcript
The document provides an overview of a guided project on applying data analysis in business using R programming. It introduces the instructor and describes the goals of the project, which are to analyze trends in consumer purchasing behavior using a department store sales dataset from 2020. The document outlines the key steps and skills that will be covered in the project, including data transformation, visualization, and correlation analysis.
Download as TXT, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
16 views
Explore and Transform Data Based On Rows - Transcript
The document provides an overview of a guided project on applying data analysis in business using R programming. It introduces the instructor and describes the goals of the project, which are to analyze trends in consumer purchasing behavior using a department store sales dataset from 2020. The document outlines the key steps and skills that will be covered in the project, including data transformation, visualization, and correlation analysis.
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 3
Welcome to this guided project application of data analysis
in business with R programming.
I'm Nilosree Sengupta and I will be your instructor for this project. I am very passionate in the fields of artificial intelligence, data science and business management and I am having 18 specializations and 2 professional certificates in these fields. I am also a Curriculum Architect Program Tester at Deeplearning.Ai for my passion in artificial intelligence. Now this project is for the people who are interested in building their career in data science for example data analyst. And by the end of this project you will be able to analyze the trends and patterns of data with data analysis techniques that will help you in strategic decision making. Now I have already done the initial steps of data analysis. It's like, preparing your questions, validating the problem statement and also building cleaned data set for you to work with. So that we can particularly and directly focus into the core part of data analysis like number one which you will be doing data transformation and statistical interpretation of data. Number two, data visualization and number three finding correlation between the quantitative variables. Now with the skills that you will gain from this project will help you in finding insights from data, that will help you in building reports and making recommendations in future at your job. Data analysis plays a major role in optimizing the business decisions like optimizing the cost of production, risk management, supply chain management, understanding the consumer purchasing behavior and trends, competitive analysis and many more. In this guided project, it will be your analysis and research study of a departmental store data From a period of 2020 which was affected by COVID-19. And this data is built by me after thorough market research and here you will be analyzing the consumer purchasing behavior and the trends. To get the most out of this course you will need to have a prior knowledge in R programming, statistics and familiarity with R studio. Now let's have a look and explore our cloud workspace and the files. Alright! So this is your folder for Application of Data Analysis in Business with R Programming. Open it .After that, this is your departmental store project which will be having the R programming entire project where you will be working after that. These will be your presentations and these are your supplementary materials for correlation, data manipulation and data visualization, which will be covered in the entire project. Now let's have a look how your final project will look like. Okay. So, Application of Data Analysis in Business with R Programming final glimpse by Nilosree Sengupta, Coursera Project Network. So these are your glimpses over here. You will be building all of these at the end of this guided project. I want you to feel confident in applying the data analysis skills on your own to a real world scenario. Hence to help you do that, I have included an activity at the end of this project for you to complete it on your own and to do this activity you will have to use everything that you have learned from this project. Like data transformation, statistical interpretation of data, graphical interpretation of data and finally finding the correlation to analyze the trends and patterns of the business. You can also use this end activity as a way to see if you have mastered the skills which you have learned from this guided project. So all the best for this project. So welcome to the guided project. Welcome to data manipulation. using dplyr. Under data manipulation, you will be doing data transformation and statistical interpretation of data. The important functions of dplyr are filter, slice, select, arrange, mutate, summarise and group_by. We will also cover some more important functions for descriptive statistics. After that here is your problem statement. You can go through it ,where you will be provided with the data set of departmental store from 2020 and here you will need to analyze the trends and patterns of the consumer purchasing behavior for different products. Next you can see over here that there is a warning message. You can completely ignore it and clear your console by using control + L. In this task you will be exploring and transforming data based on rows. So let's get started with our coding. I have already installed the dplyr package but in case it is not installed, you can run this particular code to install it. After that, let's load our dplyr package then let's load our data set. Now let's view it. So this is your data set over here after that let's glimpse it to get idea of what is it. So here it is 550 rows and 8 columns. After that, let's get started with our first function which is filter function. It is a function that is used to extract particular roles depending on columns. Okay, so now let's check what is the first question .It asked to get the information of the product where product type is snacks. So here we need to take a new data frame store one then write filter.Then within the bracket you have to write store, then put a coma, product underscore type equal to equal to snacks. Okay now run it.Now view it. So over here you can get the information of the products where the product type is snacks after that. Let's check an example of using this function. After every every function you will see another example which I have written down for you just to observe the output. The basic code remains the same. So you just need to run it to check the outputs. Okay so run it and View Store one. As for the question which was asked to get the information of the products that belonged to company S and M. So here it is.After that let's go to the next one which is slice function. It is used to select particular rules based on location. Now as for the first question which asks to get the information of the 1st 10 rows we need to write over here let's say store 2, a new data frame, then write the initial data frame name, store, after that, give the pipe operator and write slice. Then within the Bracket 1:10. Okay. Now run it. Now view it. So here is your new data frame which is having 1st 10 rows After that, let's go to the next example of using this function. Here are the first six rows. Using the slice_head function, run it, view it. So here are your first six rows. If you are lagging behind now pause the video and complete your work then continue to the next part after that let's go to the next one which is arrange function. It is used to arrange your data set in ascending order or descending order. So at first it is asked to arrange the data set in ascending order of quantity demanded. Take a new data frame, store one for example, then arrange. Then within the brackets, you need to write store then put a comma and write quantity underscore demanded. Okay now run it, view it. So here it is. Quantity demanded is the least , two over here. Now let's take the next example of this function which is descending order of selling price. Now run it, view it. So here is your new data frame ordered in descending order of selling price. Okay, so that's all for this task. In the next task, you are going to transform data based on columns. Keep practicing, See you in the next task.