0% found this document useful (0 votes)
17 views37 pages

Portfolio

Uploaded by

priyankabhoutkar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views37 pages

Portfolio

Uploaded by

priyankabhoutkar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 37

PORTFOLIO

Professional Background:

Results-driven Senior Process Specialist with extensive experience in leading and directing
team activities, coordinating projects, and achieving challenging targets. Skilled in fostering an
enterprising mindset to meet and exceed goals. Bringing [Number] years of experience and
seeking a challenging position with opportunities for advancement. Detail-oriented team player
with strong organizational skills and the ability to handle multiple projects simultaneously with
high accuracy.

Objective: To secure a full-time position that offers professional challenges, utilizing my


interpersonal skills, excellent time management, and problem-solving abilities. Hardworking and
passionate job seeker with a strong organizational background, eager to contribute to team
success. Organized and dependable, with a positive attitude and a willingness to take on
additional responsibilities to achieve team goals.

Data Analytics Expertise: Completed 8 projects showcasing expertise in data analysis and
interpretation. Proficient in Excel, Statistics, MySQL, Tableau, Power-BI & Python bringing
valuable insights to the field of data analytics.

1
Table Of Contents

Sr. No. Contents Page No.

1 Professional Background 1
2 Table Of Contents 2-3
3 Instagram User Analytics 4-7
Description
SQL Queries and Results
Conclusion
4 Operation Analytics & Investigating Metric Spike 8 – 13
Description
SQL Queries and Results
Conclusion
5 Hiring Process Analytics 14 - 16
Description
Problem Solution
Insights
Results
Conclusions
6 IMBD Movie Analysis 17 - 20
Description
Problem Solution
Insights
Results
Conclusions
7 Bank Loan Case Study 21 - 24
Description
Problem Solution
Insights
Results
Conclusions
8 Analysing the Impact of Car Features on Price and Profitability 25 - 31

Description

Problem Solution
Insights

2
Results
Dashboard
Conclusions
9 ABC Call Volume Trend Analysis 32 - 35
Description
Problem Solution
Insights
Results
Conclusions
10 Appendix 37

3
Instagram User Analytics

Description:

User analysis is the process by which we track how users engage and interact with our digital product (software or
mobile application) in an attempt to derive business insights for marketing, product & development teams.

These insights are then used by teams across the business to launch a new marketing campaign, decide on features to
build for an app, track the success of the app by measuring user engagement and improve the experience altogether
while helping the business grow.

SQL Queries and Results:

Marketing:

A. Loyal User Reward: The marketing team wants to reward the most loyal users, i.e., those who have been
using the platform for the longest time.

Identify the five oldest users on Instagram from the provided database.

Query: Result:

B. Inactive User Engagement:The team wants to encourage inactive users to start posting by sending them
promotional emails.
Identify users who have never posted a single photo on Instagram

4
C. Contest Winner Declaration: The team has organized a contest where the user with the most likes on a
single photo wins.
Determine the winner of the contest and provide their details to the team

Query: Result:

D. Hashtag Research: A partner brand wants to know the most popular hashtags to use in their posts to reach
the most people.
Identify and suggest the top five most commonly used hashtags on the platform.

5
E. Ad Campaign Launch: The team wants to know the best day of the week to launch ads.
Determine the day of the week when most users register on Instagram. Provide insights on when to
schedule an ad campaign.

Query:

F. User Engagement: Investors want to know if users are still active and posting on Instagram or if they are
making fewer posts.
Calculate the average number of posts per user on Instagram. Also, provide the total number of photos on
Instagram divided by the total number of users.

Query:

G. Bots & Fake Accounts: Investors want to know if the platform is crowded with fake and dummy accounts.
Identify users (potential bots) who have liked every single photo on the site, as this is not typically possible
for a normal user.

Query:

6
Conclusion:
Company need to remove the bots and fake accounts from the Instagram platform to enhance the user experience.
User engagement activity can be very useful for the growth & success of the company. Company can send
Promotional E-mail to inactive users. Use popular hashtags for promotions. Also we can reword most promising &
loyal users.

7
Operation Analytics and Investigating Metric Spike
Description:
Operational Analytics is a crucial process that involves analyzing a company's end-to-end operations. This
analysis helps identify areas for improvement within the company. As a Data Analyst, we'll work closely with
various teams, such as operations, support, and marketing, helping them derive valuable insights from the data
they collect.

One of the key aspects of Operational Analytics is investigating metric spikes. This involves understanding and
explaining sudden changes in key metrics, such as a dip in daily user engagement or a drop in sales. As a Data
Analyst, we'll need to answer these questions daily, making it crucial to understand how to investigate these
metric spikes.

Case Study 1: Job Data Analysis


Jobs Reviewed Over Time:
Write an SQL query to calculate the number of jobs reviewed per hour for each day in November 2020.

Query:

Result:

8
Throughput Analysis:
Write an SQL query to calculate the 7-day rolling average of throughput. Additionally, explain whether you
prefer using the daily metric or the 7-day rolling average for throughput, and why.

Query:

Result:

Language Share Analysis:


Write an SQL query to calculate the percentage share of each language over the last 30 days.

Query:

Result:

9
Duplicate Rows Detection:

Write an SQL query to display duplicate rows from the job_data table.
Query: Result:

Case Study 2: Investigating Metric Spike


Weekly User Engagement:
Write an SQL query to calculate the weekly user engagement.

Query: Result:

Insight: Highest user week 28

Minium user week 17

User Growth Analysis:


Write an SQL query to calculate the user growth for the product.

Query:

10
Result:

Insight: The 12th & 33 week of 2014 saw the greatest number of users.

The lowest saw in 35th week 2014.

Weekly Retention Analysis:


Write an SQL query to calculate the weekly retention of users based on their sign-up cohort.

Query: Result:

Insight: The user 11816 was retained for the longest duration that is 17 weeks.

Weekly Engagement Per Device:

Write an SQL query to calculate the


weekly engagement per device.

Query:

11
Result:

Insight: Weeks 31 & 32 of the year 2014 had the highest user engagement of 317 users each
week for the product and the device being used was "MacBook Pro" for both the weeks
Email Engagement Analysis:
Write an SQL query to calculate the email engagement metrics.

Query: Result:

Insight: Out of the total Emails sent , Around 35-73% of the are opened & 15-74% of those emails were
clicked.

Conclusions:
The project's key results
included the identification
of reviewed jobs
distribution across languages calculation of retention Rates

12
SQL is one of the most crucial skills for anyone in a data driven position. Analysts can
effectively contribute to improving daily operations, optimizing user engagement, and boosting
sales. By delivering actionable insights, the Lead Data Analyst plays a pivotal role in driving the
company's success and ensuring it remains agile and responsive in a dynamic business landscape.

13
Hiring Process Analytics
Description:
The hiring process is a crucial function of any company, and understanding trends such as the number of
rejections, interviews, job types, and vacancies can provide valuable insights for the hiring department.

As a data analyst at a multinational company my task is to analyze the company's hiring process data and draw
meaningful insights from it & to analyze this data and answer certain questions that can help the company
improve its hiring process.

Insights:
Through the data analytics process, several key insights were uncovered:

Analysis of gender distribution provided insight.

Salary analysis revealed.

Examination of departmental composition highlighted.

Analysis of position tiers uncovered.

Results:
Hiring Analysis:
Determine the gender distribution of hires. How many males and females have been hired by the company?

Use Formula:
=COUNTIFS(Table_1[event_name],"Male",Table_1[Status],"Hired")
=COUNTIFS(Table_1[event_name],"Female",Table_1[Status],"Hired")

14
Salary Analysis:
What is the average salary offered by this company? Use Excel functions to calculate this.

Use Formula:
=AVERAGE(G2:G7169)

Salary Distribution:

Create class intervals for the salaries in the company. This will help you understand the salary distribution.

15
Departmental Analysis:

Use a pie chart, bar graph, or any other suitable visualization to show the proportion of people working in
different departments.

Position Tier Analysis:

Use a chart or graph to represent the different position tiers within the company. This will help you understand
the distribution of positions across different tiers.

Conclusion:
Hiring process is little bit complex process It involved lost of steps. I learned lot of new things
about Statistics & Excel during preparing project.

16
IMDB Movie Analysis
Descriptions:

The dataset provided is related to IMDB Movies.

IMBD is well known movie & series rating side for users and Critics Wordwise.

Here, success can be defined by high IMDB ratings. The impact of this problem is significant for movie
producers, directors, and investors who want to understand what makes a movie successful to make informed
decisions in their future projects.

Task: Determine the most common genres of movies in the dataset. Then, for genre, calculate
descriptive statistics (mean, median, mode, range, variance, standard deviation) of the IMDB scores.

Genres of movies in the dataset Descriptive statistics

ghts:

Insights:

Average is 22.4117647

The Median is 38

Standard deviation is 335.5605495

17
Task: Analyze the distribution of movie durations and identify the relationship between movie duration
and IMDB score.

Descriptive statistics:

Insights:
The are no outliers in the data Because there are no big difference between Mean & Median.
STDEV is at 22.67 which is high.

Task: Determine the most common languages used in movies and analyze their impact on the
IMDB score using descriptive statistics.

18
Insights:
English is the common & popular language to used in movie.

19
Task: Identify the top directors based on their average IMDB score and analyze their
contribution to the success of movies using percentile calculations.

Top 10 Directors with their IMDB movie score

Insights:
9 refers as 75% of the average IMDB scores are equal to or below 9

Task: Analyze the correlation between movie budgets and gross earnings, and identify the movies with
the highest profit margin.

Insights:

Avatar earned 523505847 profit margin This is the highest profit earning movie..

Conclusions:
Avatar earned 523505847 profit margin This is the highest profit earning movie..
English is the common & popular language to used in movie.
9 refers as 75% of the average IMDB scores are equal to or below 9

The are no outliers in the data Because there are no big difference between Mean & Median.

STDEV is at 22.67 which is high.

20
Bank Loan Case Study

As a data analyst for a financial business that specializes in urban lending, our organization has to deal with a
serious problem: certain clients take advantage of their short credit history, which leads to loan defaults. The
goal of the project is to use Exploratory Data Analysis (EDA) to methodically examine data patterns. Our goals
are twofold: first, we want to avoid turning away eligible candidates, and second, we want to reduce the default
risks brought on by a lack of credit history.

Task: Identify the missing data in the dataset and decide on an appropriate method to deal with it using Excel
built-
in

functions and features.

21
22
We found that Average 12198.46 data is blank

Task: Detect and identify outliers in the dataset using Excel statistical functions and features, focusing on
numerical variables.

23
Task: Determine if there is data imbalance in the loan application dataset
and calculate the ratio of data imbalance using Excel functions.

Task: Perform univariate analysis to understand the


distribution of individual variables, segmented univariate analysis to compare variable distributions for
different scenarios, and bivariate analysis to explore relationships between variables and the target variable
using Excel functions and features.

24
Task: Segment the dataset based on different scenarios (e.g., clients with payment difficulties and all other
cases) and identify the top correlations for each segmented data using Excel functions.

Conclusions:
This study offered insightful knowledge on data analysis in datasets containing loan application
information. Using Excel's functions and capabilities, I carefully examined how to handle missing data,
spot outliers, and interpret data imbalances. I developed a sophisticated grasp of the variables influencing
loan default by analysing the dataset and closely examining the connections between different features
and loan default. This knowledge is essential for risk assessment and lending sector decision-making.
This initiative also shown the value of data-driven approaches for reducing default risks and streamlining
the loan approval procedure. With my enhanced ability to analyse data using Excel, I can now better
traverse complicated situations and derive useful insights to improve decision-making in a
variety of sectors.

25
Analyzing the Impact of Car Features on Price and
Profitability

Description:
The automotive industry has been rapidly evolving over the past few decades, with a growing focus on fuel
efficiency, environmental sustainability, and technological innovation. With increasing competition among
manufacturers and a changing consumer landscape, it has become more important than ever to understand the
factors that drive consumer demand for cars.
In recent years, there has been a growing trend towards electric and hybrid vehicles and increased interest in
alternative fuel sources such as hydrogen and natural gas. At the same time, traditional gasoline-powered cars
remain dominant in the market, with varying fuel types and grades available to consumers.

As a Data Analyst, the client has asked How can a car manufacturer optimize pricing and product development
decisions to maximize profitability while meeting consumer demand?
This problem could be approached by analyzing the relationship between a car's features, market category, and
pricing, and identifying which features and categories are most popular among consumers and most profitable
for the manufacturer. By using data analysis techniques such as regression analysis and market segmentation,
the manufacturer could develop a pricing strategy that balances consumer demand with profitability, and
identify which product features to focus on in future product development efforts. This could help the
manufacturer improve its competitiveness in the market and increase its profitability over time.

Project Problem:
Investigating the relationship between a car's features and its popularity. By examining the popularity variable
in the dataset, a data analyst could identify which features are most popular among consumers and how they
affect a car's popularity. This could help manufacturers make informed decisions about product development
and marketing.

Predicting the price of a car based on its features and market category: By using the various features and
market category variables in the dataset, a data analyst could develop a model to predict the price of a car. This
could help manufacturers and consumers understand how different features affect the price of a car and make
informed decisions about pricing and purchasing.

Overall, this dataset could be a valuable resource for data analysts interested in exploring various aspects of the
automotive industry and could provide insights that could inform decisions related to product development,
marketing, and pricing.

Insights:
 People in US like engine with higher horse power. As the engine power increases, so does the price.
 Engine Horse Power and Number of Cylinder a car has are the major contributors deciding the price
followed by efficiency of the car.
 Exotic cars hold up their value and aspiration over the years.
 Electric cars tend to loose their value over the years.
 The efficiency of the car is inversely related to the Engine Horse Power and the Number of Cylinders.
As the Power and No. of Cylinders increase, efficiency of the car decreases.

26
Task 1.A:
Create a pivot table that shows the number of car models in each market category and their
corresponding popularity scores.

Task 1.B:
Create a combo chart that visualizes the relationship between market category and popularity.

Task 2:
Create a scatter chart that plots engine power on the x-axis and price on the y-axis. Add a
trendline to the chart to visualize the relationship between these variables.
Task 3:

27
Use regression analysis to identify the variables that have the strongest relationship with a car's price. Then
create a bar chart that shows the coefficient values for each variable to visualize their relative importance.

Task 4.A:

Create a bar chart or a horizontal stacked bar chart that visualizes the relationship between manufacturer
and average price.

Task 4.B:

28
Create a pivot table that shows the average price of cars for each manufacturer.

Task 5.A:

Calculate the correlation coefficient between the number of cylinders and highway MPG to quantify the
strength and direction of the relationship.

Task 5.B:

Create a scatter plot with the number of cylinders on the x-axis and highway MPG on the y-axis. Then create a
trendline on the scatter plot to visually estimate the slope of the relationship and assess its significance.

29
Building the Dashboard:
Now for the Next portion of the Project, need to create the Interactive Dashboard. Use filters and slicers to
make the chart interactive. The client has requested these questions given below:

Task 1:

How does the distribution of car prices vary by brand and body style?

Task 2:

Which car brands have the highest and lowest average MSRPs, and how does this vary by body style?

Result:

Bugatti has the highest MSRP and Plymouth has the lowest Average MSRP..

Task 3:

How do the different features sucfl as transmission type affect the MSRP, and how does this
vary by body style?

30
Result:

Automated_manual is most expensive category and most popular also.

Task 4:

How does the fuel efficiency of cars vary across different body styles and model years?

Result:

Over the year fuel efficiency is increasing at a slow speed.

31
Conclusions:
This project involved extensive use of Excel. Pivot tables were extensively used to accomplish the tasks. Major
challenge was to understand the data to fill the NULL/Missing values and the data irregularities rather than just
dropping the NULLs as they would have dropped particular car brands all together and the pricing for those
would be unknown by the dealers. Creating dashboard and Regression Analysis helped me understand the
factors affecting the car prices better. Over all the project was challenging and intuitive at the same time.

32
ABC Call Volume Trend Analysis

Project Description:
The attached dataset is of Inbound calls of an ABC company from the insurance category consists of a
Customer Experience (CX) Inbound calling team for 23 days. Data includes Agent_Name, Agent_ID,
Queue_Time [duration for which customer have to wait before they get connected to an agent), Time
[time at which call was made by customer in a day], Time_Bucket [for easiness we have also provided
you with the time bucket), Duration [duration for which a customer and executives are on call,
Call_Seconds [for simplicity we have also converted those time into seconds), call status (Abandon,
answered, transferred).
A customer experience (CX) team consists of professionals who analyze customer feedback and data, and
share insights with the rest of the organization. Typically, these teams fulfil various roles and
responsibilities such as: Customer experience programs (CX programs), Digital customer experience,
Design and processes, Internal communications, Voice of the customer (VoC), User experiences,
Customer experience management, Journey mapping, Nurturing customer interactions, Customer success,
Customer support, Handling customer data, Learning about the customer journey.
In a Customer Experience team there is a huge employment opportunities for Customer service
representatives A.k.a. call center agents, customer service agents. Some of the roles for them include
Email support, Inbound support, Outbound support, social media support.
Inbound customer support is defined as the call center which is responsible for handling inbound calls of
customers. Inbound calls are the incoming voice calls of the existing customers or prospective customers
for our business which are attended by customer care representatives. Inbound customer service is the
methodology of attracting, engaging, and delighting our customers to turn them into our business' loyal
advocates. By solving our customers' problems and helping them achieve success using our product or
service, we can delight our customers and turn them into a growth engine for our business.

Task 1:
What is the average duration of calls for each time bucket?

33
Task 2:
Can you create a chart or graph that shows the number of calls received in each time
bucket?

Task 3:
What is the minimum number of agents required in each time bucket to reduce the
abandon rate to 10%?

34
Insights:
To determine the total number of agents required, we can use the formula:
Total agents = (Average calls/Time per person).
Given the following information:
Average calls on a single day: 139.53
Total time spent by one person in a single day: 5 hours
To achieve a 90% call connection rate (instead of the current 60%), we calculate the number of additional
agents needed. Applying the unitary method, we find that approximately 57 agents would be required.
Therefore, the total number of agents needed to achieve a 90% call connection rate is approximately 57.

1. We began by creating a pivot table where we placed Date & Time in the Rows section and Call
Status in the Columns section. We then calculated the average of abandoned, answered, and
transferred calls using the average Excel formula.
2. The analysis revealed that 29% of the calls were abandoned, 1% were transferred, and 70%
were answered during the daytime.
3. To ensure that 90% of the calls are answered each day, a total of 57 agents are required.
4. The minimum number of agents required for each time bucket can be calculated by
multiplying 57 by the count of time, which was calculated in a previous question.
5. This information provides insights into call handling efficiency, the distribution of call
statuses, and the required staffing levels for effective customer service.

35
Task 4:
Propose a manpower plan for each time bucket throughout the day, keeping the maximum
abandon rate at 10%.

Results:
1. Throughout this project, I have gained valuable insights into the impact of an analyst in the customer
service department. It is evident that a company strives to ensure maximum customer satisfaction through
effective customer handling strategies.
2. One of the notable tools used is the Interactive Voice Response (IVR) system, which employs Al
technology to address customer queries by identifying their specific concerns and routing the calls to the
appropriate agents for resolution.
3. The analysis of the provided data was made easier by the pre-calculated time buckets and call duration
converted into seconds, saving time and effort in calculations.
4. Additionally, I have delved into the realm of behavioral analytics, which involves studying customer
behavior patterns to identify trends, preferences, and opportunities for enhancing the overall customer
experience.
5. Overall, this project has provided me with valuable knowledge and insights into the dynamics of
customer service and the role of an analyst in optimizing customer satisfaction.

Conclusion:
This project has provided me with valuable knowledge and insights into the dynamics of
customer service and the role of an analyst in optimizing customer satisfaction. I have gained
valuable insights into the impact of an analyst in the customer service department. It is evident
that a company strives to ensure maximum customer satisfaction through effective customer
handling strategies. This project involved extensive use of Excel. Pivot tables were extensively
used to accomplish the tasks

Appendix
36
1. Instagram User Analytics:
https://drive.google.com/file/d/1-DCNH7hFVe6Oca8FqT8fDNC6lQXDSn3c/view?usp=drivesdk

2. Operation Analytics & Investigating Metric Spike:


https://drive.google.com/file/d/1XSH68Sc01mqWOw-eYACRP988PZR50rvO/view?
usp=drivesdk

3. Hiring Process Analytics:


https://drive.google.com/file/d/1-Bgn5cEdPR05FfY1bTfL76rd2xyLBWgT/view?usp=drivesdk

4. IMBD Movie Analysis:


https://drive.google.com/file/d/1-tuqES_J4wuokIdqnhT31UHMIJ4cVbmJ/view?usp=drivesdk

5. Bank Loan Case Study:


https://drive.google.com/file/d/109Qm5LUbV7rmSSgBs4E7atKr1YSwdvfN/view?usp=drivesdk

6. Analysing the Impact of Car Features on Price and Profitability:


https://drive.google.com/file/d/10JHmHAV5MqFw2Zd3nhNz0SdtZT6Mgu7f/view?usp=drivesdk

7. ABC Call Volume Trend Analysis:


https://drive.google.com/file/d/10hFt5cUwR2yOyESzUdXBYI4UPeymFG5c/view?usp=drivesdk

37

You might also like