0% found this document useful (0 votes)
2 views3 pages

PowerBI Detailed Data Preparation Notes

The document provides detailed notes on data preparation and transformation in Power BI, covering editing data after load, transforming data before loading, and various dataset shaping operations. It includes techniques for sorting, filtering, data cleansing, grouping, and extending data, as well as creating calculated and custom columns. Additionally, it explains join types and methods for expanding and aggregating data within Power BI.

Uploaded by

RN
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views3 pages

PowerBI Detailed Data Preparation Notes

The document provides detailed notes on data preparation and transformation in Power BI, covering editing data after load, transforming data before loading, and various dataset shaping operations. It includes techniques for sorting, filtering, data cleansing, grouping, and extending data, as well as creating calculated and custom columns. Additionally, it explains join types and methods for expanding and aggregating data within Power BI.

Uploaded by

RN
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Power BI: Detailed Notes on Data Preparation and Transformation

1. Editing Data After Load:

- Once data is loaded into Power BI Desktop, users can use the Data View to create calculated

columns and measures using DAX.

- Existing data values can also be changed, and column names can be renamed directly in the

model.

2. Transforming Data Before Loading:

- Power Query Editor allows users to clean and shape the data before it's loaded into the data

model.

- This helps improve performance and maintain a clean data model.

3. Dataset Shaping Operations:

- Renaming Columns: Double-click column headers to give meaningful names.

- Reordering Columns: Drag and place columns in desired sequence.

- Removing Columns: Delete unnecessary data to reduce model size.

- Merging Columns: Combine first and last names into one full name column.

- Duplicating Columns: Create a duplicate to preserve the original before transformation.

- Splitting Columns: Separate full name into first and last name using space as a delimiter.

- Removing Records: Delete rows that aren't needed like test entries.

- Removing Duplicates: Keep only unique rows based on one or more columns.

4. Sorting and Filtering:

- Sorting Data: Organize values alphabetically or by numeric order.

- Reversing Row Order: Flip the data order from bottom to top.

- Filtering Data: Keep only specific rows using dropdowns or range filters.
5. Data Cleansing:

- Viewing Full Record: Examine complete row content for validation.

- Changing Data Type: Ensure numbers are not stored as text.

- Detecting Data Types: Auto-recognize correct format for each column.

- Replacing Values: Change all instances of "N/A" to "Not Available".

- Transforming Column Contents: Convert all text to lowercase for consistency.

- Fill Down: Copy the value from the cell above into empty cells in a column.

- Using First Row as Headers: Promote row one as the header row.

6. Grouping and Extending Data:

- Grouping Records: Aggregate values by category (e.g., sales by region).

- Appending Data: Combine two tables with similar columns vertically.

- Merging Data: Join tables horizontally based on a matching column.

- Adding Data During Merge: Include new columns like region info when merging.

- Aggregating During Merge: Sum up sales during the merge by customer.

7. Calculated and Custom Columns:

- Calculated Columns: Use formulas like "Total = Quantity * Price".

- Custom Columns: Write your own formula logic using Power Query M language.

- Index Columns: Add a running number to each row for reference.

8. Joins and Relationships:

- Join Types:

- Left Join: All from left + matching right.

- Right Join: All from right + matching left.

- Inner Join: Only matching from both.


- Full Join: All rows from both.

- Join on Multiple Columns: Match records using combined keys (e.g., FirstName + LastName).

- Preparing Datasets: Ensure columns are cleaned and data types match.

- Correct Joins: Verify matched results to avoid mismatches or data loss.

- Examining Joined Data: Inspect join output to ensure expected results.

9. Expand and Aggregate:

- Expand: Pull in fields from related tables post-merge.

- Aggregate: Apply functions like sum, count, average when expanding related data.

You might also like