0% found this document useful (0 votes)
3 views3 pages

Project 3 - Online Course Platform Analysis

The document outlines a project for analyzing an online course platform using a dataset that includes user, course, category, and enrollment information. Key tasks involve creating a database, loading data, executing SQL queries for insights, and visualizing data in Power BI. The final deliverables include SQL code, a Power BI file, a summary report, and screenshots of visualizations.

Uploaded by

tanishmittal999
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views3 pages

Project 3 - Online Course Platform Analysis

The document outlines a project for analyzing an online course platform using a dataset that includes user, course, category, and enrollment information. Key tasks involve creating a database, loading data, executing SQL queries for insights, and visualizing data in Power BI. The final deliverables include SQL code, a Power BI file, a summary report, and screenshots of visualizations.

Uploaded by

tanishmittal999
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Project 3: Online Course Platform Analysis

Dataset Description
The dataset represents data from an online course platform, tracking user enrollments,
course details, and user progress. It includes information about users, courses,
categories, and enrollment activities. The dataset is provided in CSV format with the
following tables:
1. Users:
o user_id (INT, Primary Key)
o first_name (VARCHAR)
o last_name (VARCHAR)
o email (VARCHAR)
o country (VARCHAR)
o signup_date (DATE)
2. Courses:
o course_id (INT, Primary Key)
o course_name (VARCHAR)
o category (VARCHAR, e.g., Programming, Data Science, Business, Design)
o duration_hours (DECIMAL)
o price (DECIMAL)
3. Categories:
o category_id (INT, Primary Key)
o category_name (VARCHAR, matches category in Courses)
o description (VARCHAR)
4. Enrollments:
o enrollment_id (INT, Primary Key)
o user_id (INT, Foreign Key)
o course_id (INT, Foreign Key)
o enrollment_date (DATE)
o progress_percentage (DECIMAL, 0 to 100)
o completed (BOOLEAN, 1 for completed, 0 for not completed)
Tasks:
1. Database and Table Creation:
• Create a database named CoursePlatformDB.
• Create the four tables (Users, Courses, Categories, Enrollments) with
appropriate data types, primary keys, and foreign key constraints.
2. Data Loading:
• Load the provided CSV data into the respective tables. (CSV files will be
generated by the Python script below.)
3. SQL Queries:
Write queries to answer the following questions:

• Summarize total revenue by category and highlight the most profitable


category.
• Discuss courses with high progress percentages and their potential appeal to
users.
• Analyze user distribution across countries and suggest targeted marketing
strategies.
• Interpret the monthly enrollment trend and identify peak periods.
• Highlight the top 5 users by completed courses and propose rewards for
active learners.
• Discuss course rankings within categories and identify top-performing
courses.
• Explain cumulative enrollment trends for categories and their growth patterns.
• List courses with no enrollments and recommend promotional strategies.
• Analyze users enrolling in high-priced courses and their engagement
behavior.
4. Power BI Visualization
1. Data Import:
Import the CoursePlatformDB tables into Power BI using a SQL Server or CSV
connector.
2. Data Model:
Create relationships between the tables based on user_id, course_id, and category (link
Courses.category to Categories.category_name).
3. Visualizations:
Create the following visuals:
▪ Bar Chart: Total revenue by category.
▪ Column Chart: Average progress percentage by course.
▪ Map: Number of users by country.
▪ Line Chart: Monthly enrollment trend for the past year.
▪ Table: Top 5 users by completed courses (from the CTE query).
▪ Clustered Bar Chart: Course rankings by revenue within each
category.
▪ Area Chart: Cumulative enrollments by category over time.
4. Dashboard:
Combine all visuals into a single Power BI dashboard with filters for category, country,
and enrollment_date.
Insights to Share in Power BI Report:
• Highlight the top revenue-generating category in the bar chart and discuss its
success factors.
• Use the column chart to discuss courses with high engagement and their quality.
• Analyze the map to identify user concentration and suggest regional expansion
strategies.
• Interpret the line chart to discuss enrollment trends and predict future growth.
• Use the table to showcase top users and propose engagement initiatives.
• Discuss course performance within categories using the clustered bar chart and
suggest improvements for low performers.
• Explain the area chart to show category growth trends and their strategic importance.

Report Deliverables:
• SQL code files with comments
• Power BI file (.pbix)
• A 1-page summary answering all 10 insight questions using both SQL and visual
evidence
• Screenshots of dashboards and key visuals in the report

You might also like