using-microsoft-power-bi-with-aws
using-microsoft-power-bi-with-aws
© 2020 Amazon Web Services, Inc. or its affiliates. All rights reserved.
Contents
Introduction ..........................................................................................................................1
The Microsoft Power BI Suite..............................................................................................2
Power BI Desktop.............................................................................................................2
Power BI service ..............................................................................................................3
Power BI Report Server ...................................................................................................3
Microsoft On-premises data gateway ..............................................................................3
Connecting Power BI Desktop to AWS data sources ........................................................3
Using Power BI Desktop on premises .............................................................................4
Using Power BI Desktop in the AWS Cloud ..................................................................13
Summary of Power BI Desktop connectivity options ....................................................17
Connecting the Power BI service to AWS data sources ..................................................17
Recommended configuration .........................................................................................18
Additional considerations ...............................................................................................19
Using Power BI Report Server in AWS.............................................................................21
Using Microsoft Power BI with Amazon QuickSight .........................................................26
Conclusion .........................................................................................................................27
Contributors .......................................................................................................................28
Further reading ..................................................................................................................28
Document Revisions..........................................................................................................28
Appendix: Microsoft Power BI supported AWS data sources ..........................................28
Abstract
This whitepaper discusses how to integrate and use Microsoft Power BI (Desktop,
Report Server, Service, and On-premises data gateway) with the AWS Cloud. It
presents options for customers looking to connect Microsoft Power BI products to AWS
services such as Amazon Redshift, Amazon Athena, and Amazon RDS, with a focus on
connectivity, security, performance, and cost optimization.
This whitepaper is for IT decision makers and architects looking to quickly understand
Microsoft Power BI concepts and what options exist to make use of those technologies
when using AWS services as data sources.
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Introduction
Customers of all sizes are using AWS products and services to store their data reliably,
cost effectively, and securely. This is due in part to the broad ecosystem of mature data
storage and analytics offerings that are available. Some of these offerings include the
following services:
AWS gives customers full flexibility in mixing the technologies they prefer for their data
needs. While many customers choose Amazon QuickSight for their business
intelligence (BI) needs, other customers choose vendors such as Microsoft Power BI,
Tableau, and Qlik.
This document focuses on the Microsoft Power BI suite of products and services, and
how to use them in combination with AWS services.
To better understand how services relate to one another, we often label data services
as either being data sources or data consumers. A data source allows customers and
applications to store and retrieve data from the service. Frequently, data sources also
have built-in compute and can provide computational analysis and filtering. But,
ultimately, data is loaded into these data sources and eventually data is retrieved from
them by data consumers. Amazon S3, Amazon Athena, and Amazon Redshift are good
examples of data sources.
1
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Data consumers, on the other hand, access the data from data sources and, typically,
process it. They might optionally display it too. Amazon QuickSight and the Microsoft
Power BI suite are good examples of data consumers. They read from data sources,
and then assist in the analysis, visualization, and publication of information.
It’s easy to see how a data consumer can quickly become a data source for another
upstream service. However, for the purposes of this paper, we consider Microsoft
Power BI a data consumer, and the AWS services such as Amazon RDS, Amazon
Redshift, and Amazon Athena, as data sources.
Power BI Desktop
Power BI Desktop is a free application you install on your local computer. It lets you
connect to, transform, and visualize your data. With Power BI Desktop, you can connect
to multiple different sources of data and combine them (often called modeling) into a
data model. This data model lets you build visuals and collections of visuals you can
share as reports with other people inside your organization.
Power BI Desktop can connect to any supported data source that is available locally or
over the network. For supported data sources, see Microsoft Power BI supported AWS
data sources.
Most users who work on business intelligence projects use Power BI Desktop to create
reports. Then they push content to either Power BI Report Server or the Power BI
service in order to share their reports with others. The act of pushing content from
Power BI Desktop to the Power BI Report Server or the Power BI service is known as
publishing. For more information, see What is Power BI Desktop?
2
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Power BI service
Microsoft Power BI is a collection of software services, apps, and connectors that work
together to help you create, share, and consume business insights in a way that serves
you and your business most effectively. The Power BI service, sometimes referred to as
Power BI online, is the software as a service (SaaS) part of Microsoft Power BI.1
The Power BI service is a cloud-based service. It supports light report editing and
collaboration for teams and organizations. You can connect to data sources in the
Power BI service too, but modeling is limited.
Most report designers who work on business intelligence projects use Power BI
Desktop to create reports, and then use the Power BI service to distribute their reports
with others.2
3
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
desktop application is typically running on their local Windows laptop and physical and
logical connectivity to AWS data sources are the biggest perceived barriers to entry.
However, another option exists, which is to run Power BI Desktop in the AWS Cloud.
This option significantly reduces connectivity barriers to AWS data sources, but also
requires some additional considerations. Both models are discussed in this chapter. We
examine the implications of each in relation to connectivity, security, performance, and
costs so that you can decide which option is best for you.
4
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Figure 1 – Microsoft Power BI connectivity to AWS data sources over the internet
Criteria Considerations for accessing AWS data sources using the internet
Network Data sources must be accessible over the internet. Access must be either
connectivity through a public IP address (Amazon RDS, Amazon Redshift, Amazon EC2
based data sources), or by a regional service endpoint (Amazon Athena).
5
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Criteria Considerations for accessing AWS data sources using the internet
A security group acts as a virtual firewall for your instance to control inbound and
outbound traffic. In order to limit access to trusted entities, configure security
groups to only allow inbound IP ranges associated with known CIDR ranges.
Encryption in transit
We recommend that you configure encryption for any data sources that use
public IP addresses, such as Amazon RDS, Amazon Redshift, or any Amazon
EC2-based data sources. This ensures that the risk of data or credentials being
compromised while in transit, is reduced. Failure to configure encryption
represents a significant risk. Do not overlook this aspect.
Performance Some factors that might impact the overall Power BI Desktop performance when
accessing AWS data sources over the internet include:
• The size of the dataset being accessed. Larger datasets take longer to
retrieve. We recommend limiting queries and using filters to reduce the
amount of data retrieved over the internet.
6
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Criteria Considerations for accessing AWS data sources using the internet
Cost Data sources that reside in a VPC and are queried using public IP address over
the internet incur standard AWS VPC data egress charges. In order to reduce
costs, we recommend limiting queries and using filters to reduce the amount of
data retrieved over the internet.
Site-to-Site VPN creates encrypted tunnels between your network and your Amazon
Virtual Private Cloud (Amazon VPN) or AWS Transit Gateway. Client VPN connects
your users to AWS or on-premises resources using a free VPN software client.
VPN traffic from both Site-to-Site VPN and Client VPN connections terminates in your
VPC. As such, it can route to private IP addresses so your instances no longer need
public-facing IP addresses. For services with a data path accessible from a publicly
facing service endpoint, such as Athena, these service requests can either be routed
over the internet, or over the VPN connection and through a VPC endpoint.
7
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Figure 2 – Connecting Power BI Desktop to AWS data sources over Site-to-Site VPN
and Client VPN
Site-to-Site VPN can also connect to AWS Transit Gateway, facilitating access to data
sources spread across multiple VPCs.
Using AWS VPN provides the benefit of employing encryption when accessing data
sources stored in AWS, without requiring that each data source to be explicitly
configured. Once configured, VPN technology is largely seamless to end users.
Criteria Considerations for accessing AWS data sources using AWS VPN
8
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Criteria Considerations for accessing AWS data sources using AWS VPN
You can use a combination of routing and security groups to control access
to data sources stored in the AWS Cloud.
Encryption in transit
Both types of AWS VPN use IPsec encryption, meaning that data transferred
is encrypted as it travels between AWS and on premises. This ensures that
even if data sources are not configured to use encrypted communications,
that data is protected while traversing the internet.
Authentication
On the other hand, Client VPN does require authentication by the end users
in order to establish the connection. Client VPN authentication can take
place via Active Directory (user-based), mutual authentication (certificate-
based), or SAML SSO (user-based). Once authenticated, the connection is
seamless to the end user. AWS data sources added to Power BI Desktop
require authentication.
9
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Criteria Considerations for accessing AWS data sources using AWS VPN
Performance The use of AWS VPN occurs over the internet. As such, its performance
envelope is similar to the first scenario presented. Some factors can impact
the overall Power BI Desktop performance when accessing AWS data
sources over the internet. They include:
• The size of the dataset being accessed. Larger datasets take longer
to retrieve. We recommend limiting queries and using filters to reduce
the amount of data retrieved over the internet.
Cost Data sources that reside in a VPC and are queried using AWS VPN incur
standard AWS VPN data transfer charges. To reduce costs, we recommend
limiting queries and using filters to reduce the amount of data retrieved over
the internet.
In this model, the customer’s on-premises network is connected through AWS Direct
Connect directly to the AWS network. While there are multiple methods for configuring
AWS Direct Connect, in its simplest mode, you are given access to IP ranges within a
10
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
VPC using constructs known as a Private Virtual Interfaces (Private VIF). You access to
the internet/public IP ranges using a Public Virtual Interface (Public VIF).
Figure 3 – Connecting Power BI Desktop to AWS data sources over AWS Direct Connect
When adding data sources in Microsoft Power BI, you select the private IP address if it’s
located in a VPC, or the public endpoint if it’s a regional service.
Network Once you configure Direct Connect, it can access data sources by connecting
connectivity to private IPs in a VPC, or by using a regional service endpoint.
11
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
You can use a combination of routing and security groups to control access to
data sources stored in the AWS Cloud.
Encryption in transit
Authentication
Performance With Direct Connect, 1 Gbps and 10 Gbps ports are available. You can order
speeds of 50 Mbps, 100 Mbps, 200 Mbps, 300 Mbps, 400 Mbps, and 500
Mbps from any APN Partners that support Direct Connect.
Be mindful of loading large datasets, and note that Power BI Desktop has a
10-GB dataset limit.
12
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Cost In addition to standard Direct Connect port charges, data sources that are
accessed using a Direct Connect connection incur standard Direct Connect
data transfer charges, as explained in AWS Direct Connect Pricing. Data sent
to the AWS Cloud over Direct Connect does not incur any charges.
Although the diagram depicts user connectivity to the desktop occurring through the
internet, AWS VPN and Direct Connect are both valid connection types too. Also,
13
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
In this model, Power BI Desktop is hosted within the Amazon VPC and has direct
network connectivity to data sources with private IP addresses, such as Amazon RDS
and Amazon Redshift. You can connect to Amazon Athena and other regional services
by using a VPC endpoint connection as the destination (pictured in the diagram), or by
using the regional public service endpoint.
There are a number of options for hosting Power BI Desktop in the AWS Cloud:
In this option, you provision Microsoft Windows Desktops with Power BI Desktop
installed for your users. Users can access the Windows desktop using a light-weight
client application that is available for Android, iOS, Fire, Mac, PC, Chromebook, and
Linux devices.
With Amazon WorkSpaces, Power BI Desktop users have dedicated resources, and
desktops can be automatically stopped to save costs. This option also has the
advantage of allowing non-Windows devices to access Amazon WorkSpaces, as well
as providing users a desktop environment for performing other tasks, such as accessing
Microsoft Office or other Windows applications.
14
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
In this option, you provide your on-premises users just the Power BI Desktop
application, streamed to an HTML5-compliant web browser, with no plugins required.
Like the Amazon WorkSpaces option, non-Windows devices can connect and users
have access to dedicated resources.
Network Network connectivity to data sources is straight forward because both the data
connectivity consumer and the data sources reside in the AWS Cloud. Data sources that live
in an Amazon VPC, such as Amazon RDS and Amazon Redshift, can be
accessed directly. Data sources that use regional endpoints can be accessed
through the Amazon VPC internet gateway, or an Amazon VPC endpoint.
Connectivity to Power BI Desktop occurs over the internet, AWS VPN, or AWS
Direct Connect. All three options have modest requirements that most internet
connections can meet.
15
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
For the first option (using Amazon EC2), you can additionally use a combination
of routing and security groups to allow only specific on-premises CIDR ranges.
Encryption in transit
We recommend that data sources within an Amazon VPC are configured to use
encryption for transmission of data. Regional services already make use of TLS
encryption.
Authentication
We recommend that you authenticate with AWS data sources using an identity
that has read-only access only to the datasets required.
All three options require the use of an authentication mechanism in order for the
remote user to prove their identity before accessing the environment that hosts
Power BI Desktop. This mechanism is typically a user name and password, but
MFA options are available as well.
Performance When running in the AWS Cloud, Power BI Desktop performance is typically
better than when run on premises. Both networking and compute instances are
likely to be modern and have higher specifications, and there is likely to be low
latency between the application and the data sources.
16
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Cost By placing Power BI Desktop in the AWS Cloud, data transfer charges can be
significantly reduced or removed entirely. Use care when accessing data
sources across Availability Zones, in different VPC’s, or in different Regions,
because data transfer charges can apply.
For each option, there are additional cost considerations that need to be
factored in. Amazon EC2, Amazon WorkSpaces, and Amazon AppStream 2.0
are priced based on usage. For more information, see the pricing page for each
service.
As the number of users increase, we recommend that you consider connectivity through
AWS Direct Connect. Direct Connect provides a better user experience when loading
larger datasets. Make sure that users are aware of the cost implications of transferring
large datasets.
We recommend that you evaluate running Power BI Desktop in the AWS Cloud. This is
likely to provide both the best performance experience for the end user and the best
management experience for cloud administrators. Solutions using Amazon WorkSpaces
and Amazon AppStream 2.0 in particular can scale from a small number of users to
thousands of users. These services also provide significant security and management
benefits.
17
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
the VPC and configured with Microsoft Power BI credentials. The gateway establishes
an outbound connection to the Azure Service Bus over the internet, and is configured in
Microsoft Power BI to connect to data sources that it can access.
The Microsoft On-premises data gateway does not accept inbound connections from
the Azure cloud, and only initiates outbound connections to the Azure Service Bus.3
Recommended configuration
We recommend that you install the Microsoft On-premises data gateway on an Amazon
EC2 instance in the private subnet that contains your data sources. This subnet is
configured to route requests to the internet via an Amazon VPC NAT gateway installed
in a public subnet. You can use a network address translation (NAT) gateway to enable
instances in a private subnet to connect to the internet or to other AWS services, but
prevent the internet from connecting to those instances. If you require a highly available
data gateway implementation, we recommend using a cluster of on-premise data
gateways installed across multiple EC2 instances that span different AWS Availability
Zones. For information, see Add another gateway to create a cluster.
18
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Additional considerations
Considerations for the Power BI service with data sources in the AWS
Criteria Cloud
19
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Considerations for the Power BI service with data sources in the AWS
Criteria Cloud
You can use a combination of routing and security groups to control access to
data sources stored within the AWS Cloud.
Encryption in transit
Authentication
Make sure that Microsoft Power BI credentials are securely controlled. Access
to the service permits access to AWS data sources and potentially sensitive
information they might contain.
Performance Microsoft On-premises data gateway in the AWS Cloud typically performs well
due to the ability to size and scale up the Amazon EC2 instance. It also
performs fast in Region networking and connectivity to the internet.
20
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Considerations for the Power BI service with data sources in the AWS
Criteria Cloud
Cost Three factors need to be considered: Amazon EC2 instance charges, data
transfer charges, and Amazon NAT gateway charges.
Data transferred from the Microsoft On-premises data gateway to the Power
BI service incurs VPC egress charges. We recommend that you limit queries
and use filters to ensure that only relevant data is transferred.
Power BI Report Server can also connect to data sources that reside within the AWS
Cloud. For a list of supported data sources, see Power BI report data sources in Power
BI Report Server.
There are two licensing options available for Power BI Report Server: Microsoft Power
BI Premium and SQL Server Enterprise Edition with Software Assurance. If you use
SQL Server Enterprise Edition with Software Assurance, you must bring your eligible
license to AWS using the Microsoft License Mobility benefit, as part of your active
Microsoft Software Assurance contract. This type of license is not included as part of
21
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
any Amazon Machine Images (AMIs). For more information about licensing, see
Licensing Power BI Report Server.
You can deploy Power BI Report Server in an Amazon EC2 instance that resides within
a VPC subnet. The following diagram shows Power BI Report Server deployed in a
public subnet so that users can connect to it over the internet. If you have AWS Site-to-
Site VPN, or AWS Direct Connect, you can place Power BI Report Server in a private
subnet instead.
The following table outlines some common considerations when running Power BI
Report Server in the AWS Cloud.
22
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Network Before deploying the instance, you must choose how Power BI Desktop
connectivity users and the report audience will connect to the instance.
23
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
In order to limit access to trusted entities, you must configure the security
group associated with the instance hosting Power BI Report Server so that it
only allows inbound IP ranges associated with trusted CIDR ranges.
Encryption in transit
We recommend that you configure Power BI Report Server Web Service and
web portal to use HTTPS. This ensures that connections from Power BI
Desktop and report viewers are encrypted in transit. For more information,
see Configure TLS connections on a native mode report server.
24
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Performance Before deploying Power BI Report Server in AWS, consider the following:
Cost Consider the following cost factors when deploying Power BI Report Server
in AWS:
25
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
• With QuickSight, there’s no need to download and install a client application. All
functionality, including authoring and reporting, can be accessed from any
platform (Windows, Mac, Linux, etc.) by a web browser.
• It’s easy to get started in small or large settings, with the ability to add users from
a point-and-click interface within QuickSight. No external administrator
intervention needed.
26
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
Conclusion
If you’re looking to use Power BI Desktop, we generally find that customers start
experimenting with the software on premises, connecting to data sources over the
internet. While private connectivity options exist for using AWS VPN and Amazon Direct
Connect, many customers have concluded that running Power BI Desktop in the AWS
Cloud provides a better performing experience. You can experiment between using
Amazon EC2 instances, Amazon WorkSpaces, and Amazon AppStream 2.0 – and a
mix of options is often employed within the same customer.
Customers interested in using Microsoft Power BI with AWS have numerous options
available to them. This paper has outlined strategies for using Power BI Desktop, the
Power BI service with on-premises data gateway connector, and Power BI Report
Server.
If you want to use Power BI Desktop for a small number of users, connectivity over the
internet is possible. However, as organizations scale, they typically rely on either Direct
Connect, or moving Power BI Desktop into the AWS Cloud. You can experiment with
using Amazon EC2 instances, Amazon WorkSpaces, and Amazon AppStream 2.0. A
mix of these options is often employed in the same organization.
If you want to connect data sources in AWS to the Power BI service, you should feel
comfortable knowing that this is an established architectural pattern. You can install the
Microsoft On-premises data gateway within an Amazon VPC and connect data sources
such as Amazon RDS, Amazon Redshift, and Amazon Athena seamlessly to the
service.
If you plan on using Power BI Report Server in AWS, there is also an established path
forward. You can install the Power BI Report Server within an Amazon VPC, close to
AWS data sources, and there are connectivity options for both authors and report users.
27
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
If you want a solution that provides the same business outcomes, without the added
complexity of installing, configuring, patching, and scaling self-managed BI solutions,
we recommend Amazon QuickSight. This fully managed service combines all the
required functionality in a simple web browser experience with pay-per-user pricing.
There is nothing to install and no additional components are required.
Hopefully, this is just the start of your business intelligence journey with AWS. For
additional resources to help you get started, see Appendix: Microsoft Power BI
supported AWS data sources. If you need hands-on help or have additional questions,
contact us.
Contributors
Contributors to this document include:
Further reading
• Integrate Power BI with Amazon Redshift for insights and analytics
Document Revisions
Date Description
November 2020 First publication
28
Amazon Web Services Using Microsoft Power BI with the AWS Cloud
• Amazon Athena uses standard SQL and can be queried using an ODBC or
JDBC connection, so it is not specifically listed. Drivers are provided by AWS for
common operating systems. To get started, see Connecting to Amazon Athena
with ODBC and JDBC Drivers.
For a full list of supported data sources see Power BI data sources.
Notes
1 https://docs.microsoft.com/en-us/power-bi/fundamentals/power-bi-service-overview
2 https://docs.microsoft.com/en-us/power-bi/fundamentals/service-service-vs-desktop
3 https://docs.microsoft.com/en-us/data-integration/gateway/service-gateway-onprem-
faq
4 The credentials that you enter for a data source are encrypted and stored in the
gateway cloud service. The credentials are decrypted at the gateway on premises.
29