Intern Final Out
Intern Final Out
Submitted by
BACHELOR OF TECHNOLOGY
IN
2022
CERTIFICATE
………………………… ..………………………
First I would like to thank Mr. K. VIGNESH KUMAR for giving me the
opportunity to do an internship within the WE & DATA. Although quite short, for me
this was a great experience I can learn from. It helped me to explore my skills and
increased my interest in Artificial Intelligence.
Without the help and support of all the mentioned people, I may not be able to
fulfill my internship and learn different things. I would like to thank all those people for
their valuable contribution and proper guidance throughout my internship period.
Finally, I must acknowledge with due respect the constant support and patience
of my parents.
TABLE OF CONTENTS
Acknowledgement
Abstract 2
List of figures 3
List of Abbreviations 4
Weekly Report 5
1 Introduction 6
1.7 Visualization 8
2 Power BI Dashboard 10
4 Conclusion 31
References 32
1
ABSTRACT
Power BI has Interactive and stunning visualization tools and techniques to share with a wide range
of audience. Traditional BI tools like Tableau or Qlikview restrict you to historical analysis. By using
Power BI you can access real-time information so you can identify trends early. By doing so, you can
identify issues and improve performance. Power BI has rich personalized dashboards With power BI
one can extract business intelligence rapidly and accurately. With Custom visuals, Power BI allows
you to visualize data in almost every possible way you can imagine. Power BI is often used by
management to draw insights and inferences about a company’s forecasts, customer behavior, etc. It
can also be used to track an organization’s internal employee performance, etc. Power BI can do in
minutes what takes analysts weeks to produce.Simply load them into Power BI, rearrange the data,
create your reports, and your analysis is ready to present to decision makers.Power BI is a collection
of software services, apps, and connectors that work together to turn your unrelated sources of data
into coherent, visually immersive, and interactive insights. Your data may be an Excel spreadsheet, or
a collection of cloud-based and on-premises hybrid data warehouses. Using this power BI, I have
done an analysis on heart disease using the heart disease dataset. Our goal on taking this dataset is to
educate people on the need of a healthy life routine to be fit and free from heart diseases. These
visuals represent the ultimate need for maintenance of one's physical aspects. It helps us to analyse
whether the person has heart disease or to know whether there are any chances of getting them in
future. It relates a healthy heart with one's exercising habits, proper diet, blood pressure, etc..
2
LIST OF FIGURE
Fig:2.2 Dashboard 12
Fig:2.3 Cards 13
Fig:2.10 Tables 18
Fig:2.11 Treemap 19
Fig:2.15 Configuration 22
Fig:2.17 R Script 24
3
LIST OF ABBREVIATIONS
BI Business Intelligence
4
WEEKLY OVERVIEW OF INTERNSHIP ACTIVITIES
5
CHAPTER-1
INTRODUCTION
Today's world, technology is growing very fast, and we are getting in touch
with different new technologies day by day. Here, one of the booming technologies of
computer science is Business Intelligence which is ready to create a new revolution in the
world by making intelligent machines.
6
1.4 History of Intelligence
Power BI was designed by West Chadic George in 2010 and named Project
Crescent. In 2011, it was tied up with SQL server codenamed Mount McKinley. In
September 2014, Microsoft revealed the first preview to Power BI.
The first version of Power BI was released on 24 July 2015.
● Power BI is faster and performs better when there is a smaller volume of data
● Power BI provides an interface based on Microsoft Office 365 that is user-friendly,
intuitive, and easy to understand
● Using Power BI, you can work with several data sources, such as Excel, Text/CSV, JSON,
SQL Server databases, IBM DB2, MySQL, etc.
● Power BI can connect with the R programming language, and it also supports various Data
Analysis Expression (DAX) functions and measures
● Power BI has functional integration with the Microsoft Azure cloud platform. It helps to
analyze insights and patterns in datasets
Data Modeling is one of the features used to connect multiple data sources in BI tool
using a relationship. A relationship defines how data sources are connected with each other
and you can create interesting data visualizations on multiple data sources.
With the modeling feature, you can build custom calculations on the existing tables and
these columns can be directly presented into Power BI visualizations. This allows businesses
to define new metrics and to perform custom calculations for those metrics.
7
1.7 Creating Simple Visualizations
Visualizations are used to effectively present your data and are the basic building
blocks of any Business Intelligence tool. Power BI contains various default data visualization
components that include simple bar charts to pie charts to maps, and also complex models
such as waterfalls, funnels, gauges, and many other components.
In Power BI, you can create visualization in two ways. First is by adding from the right
side pane to Report Canvas. By default, it is the table type visualization, which is selected in
Power BI. Another way is to drag the fields from the right sidebar to the axis and value axis
under Visualization. You can add multiple fields to each axis as per the requirement.
8
In Power BI, it is also possible to move your visualization on the reporting canvas by clicking
and then dragging it. You can also switch between different types of charts and visualizations
from the Visualization pane. Power BI attempts to convert your selected fields to the new
visual type as closely as possible.
1. Area charts
2. Line charts
3. Bar charts
4. Column charts
5. Pie charts
6. Doughnut charts
7. Gauge charts
8. Funnel charts
9. Scatter charts
10. Bubble charts
9
CHAPTER 2
POWER BI DASHBOARD
A single page visualization with multiple charts and graphs to tell a story is called a
Power BI dashboard. This one-page visualization in a dashboard is also known as a Canvas.
The Power BI dashboard is a feature only available in Power BI Service. Since a Power BI
dashboard is limited to one page, it only contains the highlights of a story. You cannot create a
dashboard on Power BI Desktop.
In a dashboard, visualizations are generated from reports, and each report is based on
one dataset. The visualizations present on the dashboard are called tiles, and report designers
pin these tiles to the dashboard.
10
2.2 Advantages of a Power BI dashboard
● A Power BI dashboard enables users to analyze reports and view all important metrics at a
glance
● Using a Power BI dashboard, users can create visualizations from multiple datasets or
multiple reports
● You can customize dashboards to meet the requirements of any enterprise
● Power BI dashboards can be embedded into applications to provide a unified user
experience
You can instantly share a dashboard with other colleagues in your organization
For our demo, we’ll be looking at a Marketing campaign dataset that contains
personal data about customers. The dataset contains various customer-related details,
including customer ID, Education and income, product names, product categories and
subcategories, profit made, and so on. This dataset is commonly known as a Sample
Superstore dataset. We’ll analyze this dataset to compare Education and income from each
segment, year, and quarter.
11
The following is the dataset that we’ll be using to create the dashboard:
CARDS:
First, we will create three separate cards to calculate the total IDs, income, and quantity of
different products. From the visualization panel on the right, click on “card,” which will create
an empty card.
12
Fig:2.2 Dashboard
Drag the sales column on to the fields, which will give you the total IDs of all customers. You
can edit this card by changing the font size and the color of the sales value. Also, you can add
a title to this card. Similarly, you can create two more cards for total responses, total products
and total income by dragging respective columns on to fields.
After that, we’ll add a slicer to our visualization to slice the data based on the “Year_brith”
column. This will help us filter the data and view it according to the need.
13
Fig:2.3 Cards
LINE CHART:
Suppose you are looking to visualize and analyze Education and Marital status over a
certain number of Income. In order to do this, you’ll create a line and stacked column chart.
Select the line and stacked column chart from the visualization panel and drag the
“Year_Birth” column on to “Shared Axis.” Put “Education” under “Column” values, and
insert the “Income” column into “Line” values. That will generate the graph. You can edit the
color of the bars and the line under the format tab.
After that, we’ll add a slicer to our visualization to slice the data based on the “Marital
Status” column. This will help us filter the data and view it according to the need.
In the chart above, the lines represent the income. The graph reveals the highest amount
of income and Marital Status as Widow. Power BI Desktop provides an excellent feature to
drill down into the data and visualize the graph in terms of each quarter, month, and day. This
option is available at the top when you navigate to a specific chart.
14
Fig:2.5 Stacked column chart
From the visualization above, we can see that November had the most sales.You can also add
the labels to the graph by clicking the “Format” tab and switching on the data labels.
Now, let’s create a clustered column chart by analyzing annual and quarterly sales. From the
Visualizations menu, select the “Clustered Column Chart.” Add the “Year” column onto the
axis. Take the “Quarter” columns under “Legend,” and add the “Sales” columns to “Value.”
15
Fig:2.7 The clustered bar chart
The graph above shows that the first quarter of every year had the lowest sales
numbers, while the fourth quarter was consistently the highest. You also have the option to
change the colors on your graph to make it more visually appealing.
DONUT CHART
Next, let’s look at how to create a donut chart to analyze products by segment. Under
visualizations, click on the donut chart. Under legend, add the segment column. Drag the
“NumCatalogPurchases” column on to “Values” and “Revenue” on to “Tooltips.”
16
Fig:2.8 Donut chart for different products
The donut chart above indicates that the NumStorePurchases segment had the highest amount
of sales and profit, while the NumDealsPurchases segment had the lowest. You can format
the visualization as per your requirements to make it more visually attractive.
PIE CHART
Now, we’ll move on to creating a pie chart, which will enable us to visualize the Income for
different Education profile. Move the “Education” column on to “Legend” and add the
“Income” column under “Values.” Drag the “Sales” column onto “Tooltips.”
The Graduation category generated the highest amount of income, while Basic made the
17
lowest. The “format” tab is used to change the colors and make any necessary edits.
Let us now visualize our data at a more granular level by analyzing sales, profit, and units
sold at regional and state levels. First, we’ll create three slicers to filter our data. Click on the
slicer from the visualizations panel and add the “Region” column on to “Field” to create a
regional slicer. Similarly, you can create a state and year slicer. You also have the option to
edit the slicers to change the font size, increase the size of the text, and add a background
color.
Fig:2.10 Tables
We’ll now create a table to check the quantity and the profit made in each city. Under the
“Visualizations” tab, select “Table”. Add the “City” column under “Values,” followed by
quantity and the “Profit” column. Add a background color to the column headers and increase
the size of the font. You can filter the data by selecting the items from each of the slicers.
Below, we have selected the central region and selected the state of Minnesota. In the table,
you can see all the cities in the state of Minnesota, the quantity sold, and the profit they made.
TREEMAP
Next, let’s verify which is the maximum among “AcceptedCmps” by category using a
Treemap chart. Select theTreemap chart from visualizations and drag all the “AcceptedCmps”
18
column onto the values.
AREA CHART
Next, let’s verify the annual profit by category using an area chart. Select the area chart
from visualizations and drag the “Year” column on to the axis, the “Category” column on to the
legend, and “Profit” on to “Values.”
19
Fig:2.12 Area chart using Power BI
You can analyze the profit and quantity that each product category and subcategory sold. You
can also create a funnel. Users can explore their many options through the dashboard. If you
want to go in detail and create a more granular level report, you can do that by creating a
report based on the products’ categories and subcategories, as shown in the report below. You
can use category and subcategory as your slicers and create different line charts, bubble
charts, treemaps, and pie charts to visualize the data.
We’ll now publish our overall sales report on Power BI Service and create a dashboard. Save
your reports on Power BI Desktop. Then, click on the home menu and select the “Publish”
button.
You can also create a new workspace or publish the report to an existing workspace in Power
BI Service.
20
Fig;2.14 Publishing to Power BI successfully
R installation :
configuration :
When you open Power BI Desktop, Power BI Desktop tries to locate an R installation on your
system. You can verify whether the correct installation has been detected by opening the
Options dialog box, available by going to File > Options and settings > Options. The
following figure shows the Options dialog box with the R scripting section selected, as it
appears on my system.
21
Fig;2.15 Configuration
When using an R script to import data, the script must return at least one data frame
that serves as the basis for your imported table. If the script returns multiple data frames, you
can choose which ones to include as part of the import process. Power BI will create a table
for each imported data frame.
To use an R script to import data into Power BI Desktop, click the “Get Data” button on the
Home ribbon. When the “Get Data” dialog box appears, navigate to the “R script” option in
the Other category, as shown in the following figure.
22
When you click the Connect button, Power BI Desktop launches the R script dialog
box, where you can type or paste your R script. For the first example, use the following R
script to retrieve data from the gapminder dataset (included with the CRAN distribution) and
assign it to the gm variable:
Fig;2.17 R Script
Before entering your script into the R script dialog box, you should test it in your IDE to
ensure that it runs properly and returns the results you expect. If your script generates an error
within Power BI Desktop, it can be very difficult to make sense of it. Plus, you have to start
over with the import process.
When the above R script is executed we get a pop window showing the gapminder dataset.
Since the gapminder dataset doesn’t contains any missing data, processing is not required.
The Navigator dialog box lists the data frames in the Display Options section, under the R[1]
folder. You must select the checkbox associated with the “gm” data frame you want to import.
Once you’ve made your selection, click Load.
23
Fig;2.18 Viewing the dataset
After Power BI Desktop loads the data, you can view the dataset in Data view, as
shown in the following figure.
24
Creating visualizations using an R script :
Another great feature in Power BI Desktop is the ability to use R to create visualizations
within Report view.
To add an R-based visualization, go to Report view and click the R button on the
Visualizations pane. The first time you do this, you’ll be prompted to enable script
visualizations, as shown in the following figure. Simply click the Enable button,
The script uses the ggplot function in the ggplot2 package (be sure to install the ggplot2
package if you haven’t already) to create a visualization with the specified colors and labels.
When you add the columns, Power BI Desktop inserts several comments in the R script editor
pane. The first two comments indicate that a data frame named dataset has been created,
based on the columns you added to the Values section of the Visualizations pane. You must
use a dataset to reference the source data in your R script.
25
Fig;2.21 R script for visualization
After you define the R script, click the Run script button at the top of the R script
editor pane. Power BI Desktop processes the script and displays the visualization in the
placeholder added earlier, when you first clicked the R button. The following figure shows
how the visualization appears on my system.
26
CHAPTER 3
Output of Dashboard - 1 :
Fig;2.23 Dashboard - 1
Interference :
● The Graduation category generated the highest amount of income, while Basic made the
lowest.
● Slicer helps to filter the visualization according to the year.
● The bar chart between year_Birth and Marital_Status represents the relationship between the
columns.
27
Output of Dashboard - 2
Fig;2.24 Dashboard - 2
Interference :
● The donut chart above indicates that the NumStorePurchases segment had the highest amount
of sales and profit, while the NumDealsPurchases segment had the lowest.
28
Output of Dashboard - 3
Fig;2.25 Dashboard - 3
Interference :
29
CHAPTER 4
CONCLUSION
4.1 Following are the reasons why Power BI is so popular and needed in the BI domain:
1. Access to Volumes of Data easily
Power BI can access vast volumes of data from multiple sources. It allows you to view,
analyze, and visualize vast quantities of data that cannot be opened in Excel. Some of
the important data sources available for Power BI are Excel, CSV, XML, JSON, pdf,
etc. Power BI uses powerful compression algorithms to import and cache the data
within the.PBIX file.
Using Power BI with Azure allows you to analyze and share massive volumes of data.
An azure data lake can reduce the time it takes to get insights and increase
collaboration between business analysts, data engineers, and data scientists.
30
REFERENCES
◆ https://www.simplilearn.com/tutorials/power-bi-tutorial/what-is-power-bi
◆ https://app.powerbi.com/embed?dashboardId=
◆ https://www.edureka.co/blog/what-is-power-bi/
◆ https://www.microsoft.com/en-in/p/power-bi-desktop/9ntxr16hnw1t
◆ https://www.tutorialspoint.com/power_bi/power_bi_introduction.htm
◆ https://docs.microsoft.com/en-us/dax/dax-function-reference
◆ https://docs.microsoft.com/en-us/dax/dax-function-reference
◆ https://www.tutorialspoint.com/power_bi/dax_basics_in_power_bi.htm
◆ https://learn.radacad.com/course/basics-of-power-bi-modeling/
◆ https://mindmajix.com/power-bi-visualization-types
◆ https://www.tutorialspoint.com/power_bi/power_bi_visualization_options.htm
◆ https://radacad.com/basics-of-m-power-query-formula-language
◆ https://www.red-gate.com/simple-talk/databases/sql-server/bi-sql-server/power-bi-
introduction-working-with-r-scripts-in-power-bi-desktop-part-3/
31