0% found this document useful (0 votes)
244 views19 pages

Modern Data Analytics in Excel - George Mount-301-319

Uploaded by

Nacho ordoñez
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
244 views19 pages

Modern Data Analytics in Excel - George Mount-301-319

Uploaded by

Nacho ordoñez
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 19

calculating profit margin, Example: Calculating Profit

Margin-Example: Calculating Profit Margin


creating, Creating Columns in Power Pivot-Recoding
Column Values with SWITCH()
formatting, Loading the Results to Excel
recoding column values, Recoding Column Values with
SWITCH()

Data Model, Power Pivot and the Data Model-Power Pivot and
the Data Model
DAX measures, Creating DAX Measures-Creating Explicit
Measures, Power Pivot and DAX
explicit measures, Creating Explicit Measures-Creating
Explicit Measures
implicit measures, Creating Implicit Measures-Creating
Implicit Measures

defined, What Is Power Pivot?


interface, A Brief Tour of the Power Pivot Add-In-Settings
Calculations group, Calculations
Data Model group, Data Model
Relationships group, Relationships
settings management, Settings
Tables group, Tables

KPIs, Creating KPIs-Adding the KPI to the PivotTable


adding to PivotTables, Adding the KPI to the PivotTable
adjusting icon styles, Adjusting Icon Styles
creating, Creating KPIs-Creating KPIs

loading, Loading the Power Pivot Add-in-Loading the Power


Pivot Add-in
Power Query versus, Why Power Pivot?
reasons for using, Why Power Pivot?-Why Power Pivot?
relational models, Creating Relational Models in Power Pivot-
Exercises
cardinality, Understanding Cardinality-Why Does Cardinality
Matter?
connecting to data sources, Connecting Data to Power
Pivot-Connecting Data to Power Pivot
creating columns, Creating Columns in Power Pivot-
Recoding Column Values with SWITCH()
creating relationships, Creating Relationships-Creating
Relationships
fact and dimension tables, Identifying Fact and Dimension
Tables-Editing the Relationships
filter direction, Understanding Filter Direction-Filter Direction
and Cardinality
hierarchies, Creating and Managing Hierarchies-Using
Hierarchies in the PivotTable
loading Data Model to Power BI, Loading the Data Model to
Power BI-Viewing the Data in Power BI
loading results to Excel, Loading the Results to Excel-
Loading the Results to Excel
time intelligence functions, Time Intelligence Functions-Creating
Basic Time Intelligence Measures
adding calendar tables, Adding a Calendar Table-Adding a
Calendar Table
creating, Creating Basic Time Intelligence Measures-
Creating Basic Time Intelligence Measures

Power Query, First Steps in Excel Power Query-Exercises


appending data from multiple sources, Merging and Appending
Data in Power Query-Appending the Queries
appending queries, Appending the Queries-Appending the
Queries
connecting to external workbooks, Connecting to External
Excel Workbooks-Connecting to External Excel Workbooks
pros and cons of using Power Query for, Why Power Pivot?

channeling data through before connecting to Power Pivot,


Connecting Data to Power Pivot
columns in, Transforming Columns in Power Query-Exercises
calculating in Power Pivot versus, Calculating in Power
Query Versus Power Pivot
changing case, Changing Column Case-Changing Column
Case
changing data types, Changing Data Types
custom, Creating Custom Columns-Calculated Columns
Versus Measures
dates, Working with Dates-Working with Dates
deleting, The Imported Data, Deleting Columns
delimiting by, Delimiting by Column
replacing headers, Replacing Column Headers
reshaping data, Reshaping Data-Reshaping Data

data profiling, Data Profiling in Power Query-Closing Out of Data


Profiling
closing out of, Closing Out of Data Profiling
defined, What Is Data Profiling?
Power Query Editor, Exploring the Data Preview Options-
Closing Out of Data Profiling

defined, What Is Power Query?


ETL process, Power Query as Excel’s ETL Tool-Load
loading options, Exiting the Power Query Editor
M and, Power Query and M
myths debunked by, Power Query as Excel Myth Buster-“Excel
Can’t Process More Than 1,048,576 Rows”
big data limitations, “Excel Can’t Process More Than
1,048,576 Rows”
null values, “Excel Does Not Have a True null”
reproducibility, “Excel Is Not Reproducible”

Power Pivot versus, Why Power Pivot?


queries, Managing Your Queries-Viewing Query Dependencies
appending, Appending the Queries-Appending the Queries
grouping, Grouping Your Queries
loading from Power Query to Power Pivot, Connecting Data
to Power Pivot
refreshing, Refreshing the Query-Refreshing the Query
viewing dependencies, Viewing Query Dependencies

refreshing results in Excel, Refreshing the Query


relational joins, Understanding Relational Joins-Inner Join: Only
the Matches
inner join, Inner Join: Only the Matches-Inner Join: Only the
Matches
left outer join, Left Outer Join: Think VLOOKUP()-Left Outer
Join: Think VLOOKUP()

replacing values in, Changing Column Case


rows in, Transforming Rows in Power Query-Exercises
filling down blank, Filling Down Blank Rows
fixing typos, Removing the Missing Values
refreshing queries, Refreshing the Query-Refreshing the
Query
removing duplicates, Removing the Missing Values
removing missing values, Removing the Missing Values-
Removing the Missing Values
sorting, Removing the Missing Values
splitting data into, Splitting Data into Rows-Splitting Data
into Rows

Power Query Editor, A Tour of the Power Query Editor-Returning to


the Power Query Editor
Column profiling based on entire data set option, Overriding the
Thousand-Row Limit
Data Preview options, Exploring the Data Preview Options-
Column profile
Column distribution checkbox, The “Column quality” and
“Column distribution”-Cell errors
Column profile checkbox, Column profile
Column quality checkbox, The “Column quality” and
“Column distribution”-Cell errors
Monospaced checkbox, “Monospaced” and “Show
whitespace”
Show whitespace checkbox, “Monospaced” and “Show
whitespace”

exiting, Exiting the Power Query Editor-Exiting the Power Query


Editor
imported data, The Imported Data-The Imported Data
queries list, Queries
returning to, Returning to the Power Query Editor
ribbon menu, The Ribbon Menu-The Ribbon Menu
Add Column tab, The Ribbon Menu
Home tab, The Ribbon Menu
Transform tab, The Ribbon Menu
View tab, The Ribbon Menu

Power View, Power BI as the Third Piece of “Modern Excel”


(see also Power BI)

prompt engineering, Large Language Models and Prompt


Engineering
Python, Python with Excel-Auto-fitting column widths
books about, Reader Prerequisites
downloading, Reader Prerequisites
Excel versus Python charts, Excel versus Python charts
learning about, Reader Prerequisites
openpyxl
overview of, What openpyxl contributes-How to use
openpyxl with pandas
summarizing findings, Summarizing Findings with openpyxl-
Auto-fitting column widths

pandas
data cleaning, Cleaning Up the Data in pandas-Creating a
percentile
overview of, Using Python and Excel Together with pandas
and openpyxl

programming, Python Programming


role of in modern Excel, The Role of Python in Modern Excel-
Package development and distribution
as “glue” language, A Growing Stack Requires Glue
network effect, Network Effects Mean Faster Development
Time
package development and distribution, Package
development and distribution
unit testing, Unit testing
version control, Version control

Python for Excel (Zumstein), Reader Prerequisites


PyXLL package, Other Python Packages for Excel

qualitative variables, Column profile


quantitative variables, Column profile
queries, Managing Your Queries-Viewing Query Dependencies
appending, Appending the Queries-Appending the Queries
grouping, Grouping Your Queries
management of, Power Query and M
natural language querying, Using Analyze Data for AI Powered
Insights
optimization of, Power Query and M
refreshing, Refreshing the Query-Refreshing the Query
viewing dependencies, Viewing Query Dependencies

Queries & Connections options, in Power Query, Returning to the


Power Query Editor, Appending the Queries

RANDARRAY() function, Other Dynamic Array Functions


range parameter, of UNIQUE() function, Finding Distinct and Unique
Values with UNIQUE()
ranges, versus arrays, What Is an Array in Excel?
real-time data, The Growing Complexity of Data and Analytics
refresh process, in Power Query
automating, Power Query and M
refreshing results in Excel, Refreshing the Query

regular expressions, Pattern matching/regular expressions


relational joins, Understanding Relational Joins-Inner Join: Only the
Matches
inner join, Inner Join: Only the Matches-Inner Join: Only the
Matches
left outer join, Left Outer Join: Think VLOOKUP()-Left Outer
Join: Think VLOOKUP()

relational models, Creating Relational Models in Power Pivot-


Exercises
cardinality, Understanding Cardinality-Why Does Cardinality
Matter?
columns
calculating in Power Query versus in Power Pivot,
Calculating in Power Query Versus Power Pivot
calculating profit margin, Example: Calculating Profit
Margin-Example: Calculating Profit Margin
creating, Creating Columns in Power Pivot-Recoding
Column Values with SWITCH()
recoding column values, Recoding Column Values with
SWITCH()

connecting to data sources, Connecting Data to Power Pivot-


Connecting Data to Power Pivot
creating relationships, Creating Relationships-Creating
Relationships
fact and dimension tables, Identifying Fact and Dimension
Tables-Editing the Relationships
filter direction, Understanding Filter Direction-Filter Direction and
Cardinality
hierarchies, Creating and Managing Hierarchies-Using
Hierarchies in the PivotTable
loading Data Model to Power BI, Loading the Data Model to
Power BI
loading results to Excel, Loading the Results to Excel-Loading
the Results to Excel

Relationships group, in Power Pivot, Relationships


report generation, automating, Power Automate, Office Scripts, and
Excel Online
reproducibility
Excel, “Excel Is Not Reproducible”
VBA, “Excel Is Not Reproducible”

return_array parameter, of XLOOKUP() function, XLOOKUP()


Versus VLOOKUP()
rows, in Power Query, Transforming Rows in Power Query-
Exercises
filling down blank, Filling Down Blank Rows
fixing typos, Removing the Missing Values
refreshing queries, Refreshing the Query-Refreshing the Query
removing duplicates, Removing the Missing Values
removing missing values, Removing the Missing Values-
Removing the Missing Values
sorting, Removing the Missing Values
splitting data into, Splitting Data into Rows-Splitting Data into
Rows

Russo, Marco, CALCULATE() with ALL()

SAMEPERIODLASTYEAR() function, Creating Basic Time


Intelligence Measures
schema, for sentiment analysis, Sentiment Analysis with Azure
Machine Learning
scripting, Power Automate, Office Scripts, and Excel Online
seaborn data visualization package, Option B: Insert a Python image
[search_mode] parameter, of XLOOKUP() function, XLOOKUP()
Versus VLOOKUP()
sentiment analysis, Sentiment Analysis with Azure Machine
Learning-Sentiment Analysis with Azure Machine Learning
SEQUENCE() function, Other Dynamic Array Functions
sorting
rows, in Power Query, Removing the Missing Values
SORTBY() function, Sorting Records with SORTBY()-Sorting by
Another Column Without Printing It
sorting by another column without printing, Sorting by
Another Column Without Printing It
sorting by multiple criteria, Sorting by Multiple Criteria

[sort_order1] parameter, of SORTBY() function, Sorting Records with


SORTBY()
[sort_order2] parameter, of SORTBY() function, Sorting Records with
SORTBY()
spaces in data, “Monospaced” and “Show whitespace”
#SPILL errors, Dynamic array functions
spill operator (#), Using the Spill Operator
star schema, Arranging the Diagram View
static array formulas, Static array formulas
static array references, Static array references
streaming data, The Growing Complexity of Data and Analytics
structured references, Updating Table Ranges
Supercharge Excel (Allington), Power Pivot and DAX
Sweigart, Al, Python Programming
SWITCH() function
conditional columns versus, Recoding Column Values with
SWITCH()
recoding column values, Recoding Column Values with
SWITCH()
T

Table View, in Power BI, Viewing the Data in Power BI


Tableau, Adjusting Icon Styles
tables, Tables: The Portal to Modern Excel-Exercises
cardinality and, Understanding Cardinality-Why Does Cardinality
Matter?
fact and dimension tables, Identifying Fact and Dimension
Tables-Editing the Relationships
footers, Viewing the Table Footers-Viewing the Table Footers
formatting, Formatting Excel Tables
headers, Creating and Referring to Table Headers-Creating and
Referring to Table Headers
naming, Naming Excel Tables
organizing data for analytics, Organizing Data for Analytics
updating ranges, Updating Table Ranges

Tables group, in Power Pivot, Tables


text
changing case of, Changing Column Case-Changing Column
Case
changing from Whole Number data type to, Changing Data
Types
converting images to, Reading Data from an Image-Reading
Data from an Image

Text to Columns feature, in Excel, Splitting Data into Rows


TEXTSPLIT() function, Other Dynamic Array Functions
tidy data rules, Organizing Data for Analytics, Reshaping Data,
Using Analyze Data for AI Powered Insights
time intelligence functions, Time Intelligence Functions-Creating
Basic Time Intelligence Measures
adding calendar tables, Adding a Calendar Table-Adding a
Calendar Table
creating, Creating Basic Time Intelligence Measures-Creating
Basic Time Intelligence Measures

total row, in tables, Viewing the Table Footers


TOTALYTD() formula, Creating Basic Time Intelligence Measures
trailing spaces, “Monospaced” and “Show whitespace”
Transform Data button, in Power Query, Connecting to External
Excel Workbooks
transformations
columns, Transforming Columns in Power Query-Exercises
ETL process, Transform
rows, Transforming Rows in Power Query-Exercises

Treacy, Mynda, Option B: Insert a Python image


Tufte, Edward, Time Intelligence Functions
typos, fixing in Power Query, Removing the Missing Values

unique values
distinct values versus, Finding Unique Versus Distinct Values
finding, Finding Distinct and Unique Values with UNIQUE()

UNIQUE() function, Dynamic array functions-Finding Unique Versus


Distinct Values
finding distinct and unique values with, Finding Distinct and
Unique Values with UNIQUE()
finding unique versus distinct values, Finding Unique Versus
Distinct Values

unit testing, Unit testing


unpivoting columns, in Power Query, Reshaping Data-Reshaping
Data
unstructured data, The Growing Complexity of Data and Analytics
updates, regular, Power Query and M
UPPER() function, Creating and Referring to Table Headers
USERELATIONSHIP() function, Adding a Calendar Table

variables
assigning values to, LET() and LAMBDA()
optimizing code efficiency and readability using, Power Pivot
and DAX
qualitative, Column profile
quantitative, Column profile

VBA (Visual Basic for Applications), “Excel Is Not Reproducible”


version control, Version control
VLOOKUP() function, Understanding Relational Joins, Why Power
Pivot?, XLOOKUP() Versus VLOOKUP()
VSTACK() function, Other Dynamic Array Functions

whitespace, “Monospaced” and “Show whitespace”, Splitting Data


into Rows
Whole Number data type, changing to text, Changing Data Types
Wickham, Hadley, Organizing Data for Analytics
width property, Creating a summary worksheet
workbooks, Excel
connecting to with Power Query, Connecting to External Excel
Workbooks-Connecting to External Excel Workbooks
importing to Power BI, Importing the Data Model to Power BI

XLMiner add-in, Building Statistical Models with XLMiner-Building


Statistical Models with XLMiner
XLOOKUP() function, Understanding Relational Joins, Creating
Modern Lookups with XLOOKUP()-XLOOKUP() and Looking Up to
the Left
basic lookups, A Basic XLOOKUP()
error handling and, XLOOKUP() and Error Handling
looking up to the left, XLOOKUP() and Looking Up to the Left
pros and cons of combining data sources using, Why Power
Pivot?
VLOOKUP() versus, XLOOKUP() Versus VLOOKUP()

XlsxWriter package, Other Python Packages for Excel


xlwings package, Other Python Packages for Excel

Zumstein, Felix, Reader Prerequisites


Zwingmann, Tobias, Azure and Cloud Computing
About the Author
George Mount is the founder and CEO of Stringfest Analytics, a
consulting firm specializing in analytics training. George speaks
regularly on this topic and maintains a blog at
stringfestanalytics.com.
In addition to being the author of Advancing into Analytics: From
Excel to Python and R (O’Reilly, 2021), he has been recognized as a
Microsoft Most Valuable Professional (MVP) for his contributions to
the community and technical expertise in Excel.
George holds a bachelor’s degree in economics from Hillsdale
College, as well as master’s degrees in finance and information
systems from Case Western Reserve University. He currently
resides in Cleveland, Ohio.
Colophon
The animal on the cover of Modern Data Analytics in Excel is a
Hercules beetle (Dynastes hercules), a species of rhinoceros beetle
found in the rainforests of Central and South America, as well as
some Caribbean islands.
These titans of the insect world are the longest beetles on Earth,
with males reaching up to 7 inches in length, including their horns,
used for grappling with other males for dominance (females lack
horns entirely). Hercules beetles come in shades of olive green or
brownish yellow, sometimes with an iridescent sheen, and have
small black spots scattered across their bodies. These colors can
shift based on humidity.
Despite their impressive size, Hercules beetles are generally
harmless insects. They feed on rotting fruit and tree sap, and the
impressive strength of their horns, which allows them to lift objects
hundreds of times their own weight, comes in handy when searching
for food or burrowing in the forest floor.
Hercules beetles are not currently considered an endangered
species. However, habitat loss due to deforestation is a threat to
their populations. Many of the animals on O’Reilly covers are
endangered; all of them are important to the world.
The cover illustration is by Karen Montgomery, based on an antique
line engraving from Goldsmith’s Natural History. The series design is
by Edie Freedman, Ellie Volckhausen, and Karen Montgomery. The
cover fonts are Gilroy Semibold and Guardian Sans. The text font is
Adobe Minion Pro; the heading font is Adobe Myriad Condensed;
and the code font is Dalton Maag’s Ubuntu Mono.

You might also like