0% found this document useful (0 votes)

32 views32 pages

Minor Unit 3-5 13 Marks Merged

Uploaded by

zerolegion4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views32 pages

Minor Unit 3-5 13 Marks Merged

Uploaded by

zerolegion4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Unit 3 13 marks

5. How would you use table calculations in Tableau to compare year-over-year growth
rates across multiple product categories? Explain the process and the considerations
involved in ensuring accurate calculations and meaningful visualizations
Steps to Compare YoY Growth Rates in Tableau
1. Data Preparation:
o Ensure your data includes Date (Year), Product Category, and Sales metrics.
o Confirm the date field is recognized as a date type.
2. Create YoY Growth Calculation:
o Create a calculated field named YoY Growth with the formula:
(SUM([Sales]) - LOOKUP(SUM([Sales]), -1)) / LOOKUP(SUM([Sales]), -1)
3. Build the Visualization:
o Drag Year to Columns and Product Category to Rows.
o Add the YoY Growth field to Text or Rows.
4. Apply Table Calculations:
o Right-click on YoY Growth and select Quick Table Calculation → Year Over
Year Growth.
o Edit the calculation to compute across the table as needed.
5. Ensure Accurate Calculations:
o Check for data integrity and missing values.
o Be mindful of any filters affecting the context of calculations.
6. Format for Clarity:
o Use color coding to differentiate growth rates (positive/negative).
o Add labels and tooltips for better clarity.
7. Create a Dashboard:
o Combine worksheets into a dashboard for a holistic view.
o Add filters for interactivity to focus on specific categories or time periods.
8. Review and Iterate:

o Review visualizations for clarity and effectiveness.

o Gather feedback for improvements.
Considerations
 Be aware of seasonality in your data.
 Tailor visualizations to your audience's understanding.
This process enables effective analysis and visualization of YoY growth rates across product
categories in Tableau.
Unit 3 13 marks

6. Discuss how you would customize data in Tableau to create a calculated field that
segments customers into different loyalty tiers based on their purchase history. What
steps would you take, and how would you ensure the segmentation is both accurate
and actionable?
Steps to Segment Customers into Loyalty Tiers in Tableau
1. Data Preparation:
o Ensure your dataset includes fields like Customer ID, Purchase Amount, and
Purchase Date.
o Aggregate total purchases per customer.
2. Create Calculated Fields:
o Total Spend:
SUM([Purchase Amount])
o Purchase Frequency (if needed):
COUNT([Purchase Date])
3. Define Loyalty Tiers:
o Establish criteria for tiers (e.g., Bronze: < $500, Silver: $500-$1,000, Gold:
$1,000-$2,500, Platinum: > $2,500).
4. Create Loyalty Tier Field:
o Create a calculated field Loyalty Tier:
IF [Total Spend] < 500 THEN 'Bronze'
ELSEIF [Total Spend] < 1000 THEN 'Silver'
ELSEIF [Total Spend] < 2500 THEN 'Gold'
ELSE 'Platinum' END
5. Visualize Segments:
o Drag Loyalty Tier to Rows and measures like Count of Customers to visualize
distribution.
6. Analyze and Validate:

o Create cross-tabs to check customer counts in each tier and ensure no

outliers skew results.
7. Make Segmentation Actionable:
o Identify target groups for marketing campaigns based on segments.
o Monitor trends over time to refine criteria.
8. Create a Dashboard:
o Combine visualizations of loyalty segments and trends, adding filters for
interactivity.
Unit 3 13 marks

Considerations
 Data Quality: Ensure clean and accurate data.
 Regular Updates: Adjust criteria based on changing customer behavior.
 Feedback and Testing: Gather stakeholder feedback and continuously test the
effectiveness of segmentation.
By following these steps, you can effectively segment customers into loyalty tiers in Tableau,
ensuring the segmentation is accurate and actionable.
7. When working with data of varying granularity (e.g., daily sales data versus monthly
targets), how would you structure your Tableau dashboard to accurately represent the
data without misleading the audience? Discuss your strategies for harmonizing
different levels of granularity and ensuring clear communication.
When working with data of varying granularity in Tableau, such as daily sales data versus
monthly targets, structuring your dashboard effectively is crucial to accurately represent the
information and avoid misleading the audience. Here are strategies for harmonizing different
levels of granularity and ensuring clear communication:
1. Understand Data Relationships
 Define Granularities: Clearly identify the granularity of each dataset (e.g., daily
sales vs. monthly targets).
 Aggregation: Determine how to aggregate the daily sales data to align with the
monthly targets. For instance, you might sum daily sales to get a monthly total.
2. Create Calculated Fields for Aggregation
 Monthly Sales Calculation: Create a calculated field to sum daily sales into monthly
totals:
DATEPART('month', [Sale Date])
 Use Level of Detail (LOD) Expressions: Use LOD expressions to control the
granularity of calculations. For example:
{ FIXED MONTH([Sale Date]): SUM([Sales]) }
3. Design the Dashboard Layout
 Use Consistent Timeframes: Ensure that all time-based visuals (e.g., line charts for
sales) share the same granularity. For example, use monthly views for both daily
sales aggregated to the month and monthly targets.
 Separate Visuals for Different Granularities: If you need to display both daily and
monthly data, use separate visuals. For instance, a line chart for daily sales and a
bar chart for monthly targets can help clarify differences.
4. Incorporate Clear Labels and Annotations
 Clear Titles and Legends: Clearly label each visual to indicate the data granularity it
represents (e.g., "Monthly Sales" vs. "Daily Sales").
 Annotations: Use annotations to provide context or explanations for specific data
points, especially where discrepancies may arise between daily and monthly data.
Unit 3 13 marks

5. Use Tooltips for Additional Information

 Detailed Tooltips: Implement tooltips that provide detailed information when users
hover over data points. For instance, hovering over a monthly total could show the
breakdown of daily sales contributing to that total.
6. Include Contextual Metrics
 KPIs and Ratios: Include key performance indicators (KPIs) or ratios (e.g., % of
monthly target achieved) to give a quick overview of performance against goals,
aiding in interpretation.
7. Visual Encoding
 Consistent Color Schemes: Use consistent colors to represent the same concepts
across different visuals (e.g., sales in blue and targets in orange).
 Chart Types: Choose appropriate chart types that convey the granularity effectively
(e.g., line charts for trends, bar charts for comparisons).
8. Allow for Interactive Exploration
 Filters and Parameters: Implement filters or parameters that allow users to choose
the granularity they want to view (e.g., daily, weekly, monthly). This can enhance user
engagement and understanding.
 Drill-down Options: Allow users to drill down from monthly views to daily details if
needed, providing flexibility in data exploration.
9. Test and Iterate
 User Feedback: Share the dashboard with stakeholders and gather feedback on
clarity and usability. Make adjustments based on their input.
 Monitor Usage: Observe how users interact with the dashboard to identify areas for
improvement.
Conclusion
By applying these strategies, you can create a Tableau dashboard that effectively
harmonizes data of varying granularity. This approach ensures accurate representation,
enhances clarity, and fosters meaningful communication with the audience, enabling them to
make informed decisions based on the insights presented.

8. Imagine you are tasked with designing a Tableau dashboard for a company’s
executive team. The dashboard needs to be both comprehensive and user-friendly.
Discuss your approach to selecting the key metrics, visualizations, and interactivity
features that would best serve the needs of this audience.
Designing a Tableau dashboard for a company's executive team requires a strategic
approach to ensure that it is comprehensive, user-friendly, and tailored to the decision-
makers' needs. Here’s how to approach this task:
1. Understand Executive Needs
Unit 3 13 marks

 Identify Objectives: Understand the specific goals and questions of the executive
team.
 Focus on Key Metrics: Select 5-10 high-impact KPIs relevant to decision-making,
such as revenue growth, profit margins, customer retention rates, and operational
efficiency.
2. Select Key Metrics
 Prioritize Metrics: Choose metrics that allow for trend analysis and segment data by
departments or regions for deeper insights.
3. Choose Effective Visualizations
 Appropriate Chart Types: Use:
o Bar/Column Charts: For category comparisons (e.g., revenue by region).
o Line Charts: For trends over time (e.g., sales growth).
o Pie/Donut Charts: For proportions (e.g., market share).
o Heat Maps: For identifying patterns (e.g., sales performance).
 Dashboard Layout: Organize related metrics logically, with summaries at the top
and detailed visuals below.
4. Enhance User Interactivity
 Filters and Parameters: Allow customization (e.g., by time period or department).
 Drill-down Capabilities: Enable users to click for detailed views.
 Tooltips: Provide additional context for data points on hover.
5. Emphasize Clarity and Usability
 Clean Design: Use a minimalistic layout to avoid distractions.
 Clear Labels and Legends: Ensure all visuals are labeled for easy interpretation.
 Dynamic Titles: Reflect any applied filters in titles for context.
6. Test and Iterate
 Gather Feedback: Share a prototype with executives to get their input on usability.
 Iterate: Make adjustments based on feedback to enhance the dashboard.
7. Provide Contextual Insights
 Annotations: Highlight significant trends or anomalies in the data.
 Summary Insights: Include a section with key takeaways to aid strategic
discussions.
Conclusion
By implementing these strategies, you can create a Tableau dashboard that is
comprehensive, user-friendly, and tailored to the executive team's needs, enabling informed
decision-making.
Unit 3 13 marks
Unit 4 13&15 marks

1. Discuss the difference between assigning values using <-, =, and assign() in R.
Provide examples for each method and explain when each method would be
most appropriate in practice.
In R, there are multiple ways to assign values to variables: <-, =, and the assign() function.
Each has its own use cases and slight differences. Let's break down each one with
examples and discuss when to use each.
1. Using <- for Assignment
The <- operator is the conventional assignment operator in R. It assigns a value to a
variable, allowing you to assign data to a name that can be referenced later.
Example:
x <- 10
y <- c(1, 2, 3, 4, 5)
Explanation:
 Here, x is assigned the value 10, and y is assigned a vector of numbers.
 <- is the most common way to assign values and is considered idiomatic in R
programming.
When to use <-:
 Use <- for most variable assignments, as it improves readability and follows R's
typical style.
2. Using = for Assignment
The = operator can also assign values to variables and is functionally similar to <- in most
contexts. However, it is more commonly used within function arguments.
Example:
x = 20
z = c("apple", "banana", "cherry")
Explanation:
 x is assigned the value 20, and z is assigned a character vector.
 In this example, = works the same as <-.
When to use =:
 = is typically used within function arguments rather than for general variable
assignment. For example, in plot(x = 1:10, y = 10:1), x and y are arguments within
the function call.

3. Using assign() for Assignment

Unit 4 13&15 marks

The assign() function allows you to assign a value to a variable name given as a string. This
can be useful for programmatically generating variable names or dynamically assigning
values in loops.
Example:
assign("a", 30)
assign("data_frame", data.frame(col1 = 1:5, col2 = 6:10))
Explanation:
 Here, a is assigned the value 30, and data_frame is assigned a data frame.
 Since assign() takes a string as its first argument, it is particularly helpful for
assigning values to variable names created dynamically.
When to use assign():
 assign() is useful when you need to generate variable names dynamically, such as
within loops or in programmatically created functions.
Summary Table

Usage
Method When to Use
Example

<- x <- 10 For most assignments, as it follows R's typical style.

Preferably in function arguments, but also works for general

= x = 10
assignment.

When variable names need to be generated dynamically,

assign() assign("x", 10)
especially in loops.

In general, sticking to <- for regular assignments and = for function arguments will make your
code more readable and consistent with R's style.
2. Given a dataset of customer survey ratings from 1 to 5, create a factor in R to
categorize the ratings as "Poor", "Average", "Good", "Very Good", and
"Excellent". Explain how factors improve data analysis and why they are used
instead of character vectors in such cases.
To categorize customer survey ratings from 1 to 5 as "Poor", "Average", "Good", "Very
Good", and "Excellent" in R, you can use factors. Factors are categorical variables that allow
us to handle, order, and analyze categorical data effectively. Let's go through the steps to
create a factor and discuss its benefits.
Step 1: Creating the Factor in R
Assume we have a vector ratings representing customer survey ratings ranging from 1 to 5.
# Sample dataset of customer survey ratings
ratings <- c(1, 3, 2, 5, 4, 3, 1, 2, 5, 4)
Unit 4 13&15 marks

# Define labels for each rating category

rating_labels <- c("Poor", "Average", "Good", "Very Good", "Excellent")

# Convert ratings into a factor with ordered levels

rating_factor <- factor(ratings, levels = 1:5, labels = rating_labels, ordered = TRUE)

Explanation of Code
 factor() function: This function converts ratings into a factor. By specifying levels =
1:5, we ensure each rating corresponds to one of the labels.
 labels argument: This maps the numerical ratings (1 to 5) to descriptive categories
("Poor" to "Excellent").
 ordered = TRUE: Setting this to TRUE makes it an ordered factor, meaning R
recognizes the sequence from "Poor" to "Excellent" as meaningful in terms of
increasing value.
Output
Now, rating_factor will look like this:
[1] Poor Good Average Excellent Very Good Good Poor Average Excellent
Very Good
Levels: Poor < Average < Good < Very Good < Excellent

Why Factors Improve Data Analysis

1. Memory Efficiency: Factors are stored as integer vectors with levels, making them
more memory-efficient than character vectors, especially with large datasets.
2. Descriptive Levels: Factors allow each level to have a label, which improves
readability and interpretation without changing the underlying integer structure.
3. Data Integrity: Factors ensure that only predefined levels are used, which helps
avoid accidental typos or inconsistent labels (e.g., "Poor" vs. "poor").
4. Order & Ranking: When factors are ordered, you can perform meaningful
comparisons, such as greater-than or less-than operations on categorical data.
Why Use Factors Instead of Character Vectors
Using factors instead of character vectors in categorical data analysis has several
advantages:
 Enforced Categories: Factors enforce specific categories, reducing the risk of
misspelled or inconsistent entries.
 Statistical Modeling: Factors are integral in many statistical functions and models
(e.g., ANOVA, regression) in R. These functions can handle factors more effectively
than raw character vectors.
Unit 4 13&15 marks

 Analysis & Visualization: When visualizing or summarizing categorical data, factors

help group and order categories automatically, making analysis smoother and results
more intuitive.
In this context, using a factor for survey ratings provides a structured way to analyze the
ratings with descriptive labels and ensures that categories like "Poor" or "Excellent" are
consistently represented and ranked properly.
3. You are tasked with storing multiple datasets (a numeric vector, a character
vector, and a matrix) in a single object for later use. Explain how you would
achieve this using a list in R. Write R code to create the list and demonstrate
how to access each individual dataset from the list.
In R, lists are a flexible data structure that can hold multiple types of data within a single
object, making them ideal for storing datasets of different types (such as a numeric vector,
character vector, and matrix). Lists allow us to access each element by name or by index.
Here’s how you can create a list containing a numeric vector, a character vector, and a
matrix, and how to access each individual dataset.
Step 1: Create the Individual Datasets
# Numeric vector
numeric_vector <- c(10, 20, 30, 40, 50)

# Character vector
character_vector <- c("apple", "banana", "cherry", "date", "elderberry")

# Matrix
matrix_data <- matrix(1:9, nrow = 3, ncol = 3)
Step 2: Store the Datasets in a List
You can combine the datasets into a list, with each dataset assigned a descriptive name.
# Create a list to store the datasets
data_list <- list(
numbers = numeric_vector,
fruits = character_vector,
matrix = matrix_data
)
Step 3: Access Each Dataset in the List

You can access each dataset within the list either by name or by index.
# Access the numeric vector
print(data_list$numbers) # By name
Unit 4 13&15 marks

print(data_list[[1]]) # By index

# Access the character vector

print(data_list$fruits) # By name
print(data_list[[2]]) # By index

# Access the matrix

print(data_list$matrix) # By name
print(data_list[[3]]) # By index
Output Explanation
 data_list$numbers and data_list[[1]] both access the numeric vector stored in the
list.
 data_list$fruits and data_list[[2]] access the character vector.
 data_list$matrix and data_list[[3]] access the matrix.
Why Lists Are Suitable for Storing Different Data Types
 Flexibility: Lists can hold any R object, allowing for a mix of different data structures.
 Named Elements: By naming each element, you can easily identify and retrieve
datasets without having to remember their index position.
 Extendibility: You can add more datasets or remove items as needed, which makes
lists ideal for storing diverse, dynamically changing data.
Using a list in this way keeps your data organized and accessible for analysis or further
processing in a single, cohesive object.
4. Explain the differences between various data classes in R such as numeric,
integer, character, and factor. Provide an example of converting between these
classes and discuss how improper class usage can impact the analysis.
In R, data can be stored in various classes, each with distinct characteristics suited for
different types of analysis. Here, we’ll look at the main classes: numeric, integer,
character, and factor, explain their differences, and examine how improper class usage
might affect data analysis.

1. Numeric
 Description: Represents real numbers, including decimal values.
 Usage: Numeric class is used for continuous, quantitative data.
 Example:
Unit 4 13&15 marks

numeric_var <- 42.5 # A decimal or floating-point number

2. Integer
 Description: Represents whole numbers.
 Usage: The integer class is used for discrete, whole-number values.
 Example:
integer_var <- 42L # The 'L' suffix makes it an integer
Note: In R, numbers are treated as numeric by default unless explicitly defined as integers
using the L suffix or as.integer() function.
3. Character
 Description: Stores text data or strings.
 Usage: Character class is used for categorical data without a specific order, such as
names, labels, or descriptions.
 Example:
character_var <- "Hello, World!"
4. Factor
 Description: A categorical data type with a predefined set of levels. Factors can be
ordered or unordered, and they are often used to represent categorical variables in
R.
 Usage: Useful for data with a limited set of possible values (e.g., ratings, categories).
 Example:
factor_var <- factor(c("Low", "Medium", "High"), levels = c("Low", "Medium", "High"), ordered
= TRUE)
Converting Between Classes
R allows conversion between these classes using functions like as.numeric(), as.integer(),
as.character(), and as.factor().
Example of Conversion:
# Starting with a character vector
char_vector <- c("1", "2", "3")
# Convert to numeric
num_vector <- as.numeric(char_vector)

# Convert to integer
int_vector <- as.integer(num_vector)
Unit 4 13&15 marks

# Convert to factor
factor_vector <- as.factor(char_vector)

# Convert back to character

back_to_char <- as.character(factor_vector)
Impact of Improper Class Usage on Analysis
Using an inappropriate class can cause several issues in analysis:
1. Numeric and Integer Confusion: Treating integers as numeric can lead to
unintentional rounding or truncation errors, especially in loops or aggregations where
exact counts matter.
2. Character vs. Factor: If categorical data is stored as a character instead of a factor,
it may lead to incorrect grouping in analyses and visualizations, as characters are not
inherently treated as categories.
o Example: A character vector of city names (c("Paris", "London", "New York"))
will not be treated as a categorical variable unless it is converted to a factor,
impacting analyses like frequency counts or modeling.
3. Improper Factor Usage: Factors with an incorrect order (e.g., months stored as
factor(c("March", "January", "February")) without specifying the correct order) can
lead to misleading visualizations or analysis outputs, as R might use the default
alphabetical order.
4. Coercion Errors: Converting non-numeric characters to numeric (e.g.,
as.numeric(c("apple", "banana"))) results in NA values, potentially causing
unexpected errors in calculations.
Understanding these differences is crucial to ensure that data is treated appropriately in R,
thus improving the accuracy and interpretability of your analysis.
5. Write an R program using a for loop that calculates the factorial of a given
number n. Then, discuss the time complexity of your solution and compare it
with a recursive implementation of the factorial function. Provide the pros and
cons of using loops vs. recursion in R.
Here's an R program that calculates the factorial of a given number nnn using a for loop.

Factorial Calculation Using a For Loop in R

factorial_loop <- function(n) {
result <- 1
for (i in 1:n) {
result <- result * i
}
Unit 4 13&15 marks

return(result)
}
# Example usage
n <- 5
factorial_result <- factorial_loop(n)
print(factorial_result) # Output should be 120 for n = 5
Time Complexity of the Loop Solution
The time complexity of the loop-based factorial calculation is O(n)O(n)O(n), as the for loop
iterates from 1 to nnn, multiplying each value with the result. The space complexity is
O(1)O(1)O(1) because it uses a single variable (result) for storing the factorial, making this
solution memory-efficient.
Recursive Implementation of the Factorial Function
Here's how you can compute the factorial using recursion in R.
factorial_recursive <- function(n) {
if (n <= 1) {
return(1)
} else {
return(n * factorial_recursive(n - 1))
}
}
# Example usage
factorial_result <- factorial_recursive(n)
print(factorial_result) # Output should be 120 for n = 5
Time Complexity of the Recursive Solution
The time complexity of the recursive solution is also O(n)O(n)O(n) since there are nnn
recursive calls. However, its space complexity is O(n)O(n)O(n) due to the stack memory
used to hold each recursive call until the base case is reached.
Pros and Cons of Using Loops vs. Recursion in R
Pros of Using Loops
1. Memory Efficiency: Loops use constant memory O(1)O(1)O(1) for calculations, as
they do not require stack memory for multiple function calls.
2. Performance: In R, iterative solutions are generally faster than recursive ones due to
the overhead of function calls in recursion.
3. Easy to Understand: Loops are often more straightforward for beginners to
understand and debug.
Unit 4 13&15 marks

Cons of Using Loops

1. Complexity in Certain Problems: For problems that are inherently recursive (like
tree traversals), loops can make the code more complex and harder to understand.
2. Limited Expressiveness: Loops may be less intuitive or natural for tasks that benefit
from recursion, such as divide-and-conquer algorithms.
Pros of Using Recursion
1. Elegant Solution for Recursive Problems: Recursion is a more natural choice for
problems that can be broken down into subproblems, such as factorials, Fibonacci
sequences, and data structure traversal.
2. Simplified Code: Recursive solutions can be more concise and expressive, reducing
the need for managing indices or counters.
Cons of Using Recursion
1. Stack Overflow Risk: For large values of nnn, recursion can lead to stack overflow
errors as each recursive call consumes memory on the stack.
2. Increased Memory Usage: Recursive calls add to the call stack, leading to higher
memory consumption (space complexity O(n)O(n)O(n)).
3. Performance Overhead: Recursive calls incur additional function call overhead,
making recursion slower than loops for high-iteration tasks.
Summary
 Loops: Preferred for iterative tasks and better memory efficiency in R.
 Recursion: More natural for recursive problems, but risks inefficiency in R due to
stack usage and potential overhead for each function call.
Choosing between loops and recursion depends on the problem's nature and R's memory
and performance constraints. For calculating factorials, the loop solution is generally
preferred in R due to its simplicity and lower memory usage.

6. In data science, decisions often depend on multiple conditions. Write a

program in R that categorizes a person’s BMI (Body Mass Index) as
"Underweight", "Normal weight", "Overweight", or "Obesity" based on given
BMI values. Use nested if-else statements and explain how decision support
systems improve real-world data analysis.
Here’s an R program that categorizes a person’s Body Mass Index (BMI) using nested if-
else statements. The program takes a BMI value as input and categorizes it into one of four
classifications: "Underweight", "Normal weight", "Overweight", or "Obesity".
BMI Categorization Program in R
# Function to categorize BMI
Unit 4 13&15 marks

categorize_bmi <- function(bmi) {

if (bmi < 18.5) {
category <- "Underweight"
} else if (bmi >= 18.5 && bmi < 24.9) {
category <- "Normal weight"
} else if (bmi >= 25 && bmi < 29.9) {
category <- "Overweight"
} else {
category <- "Obesity"
}
return(category)
}
# Example usage
bmi_values <- c(17.5, 22.0, 27.5, 32.0) # Sample BMI values
for (bmi in bmi_values) {
category <- categorize_bmi(bmi)
cat("BMI:", bmi, "- Category:", category, "\n")
}
Explanation of the Code
 Function categorize_bmi: This function takes a BMI value as input and categorizes
it using nested if-else statements.
o If the BMI is less than 18.5, it categorizes it as "Underweight".
o If the BMI is between 18.5 and 24.9 (inclusive), it categorizes it as "Normal
weight".
o If the BMI is between 25 and 29.9 (inclusive), it categorizes it as
"Overweight".
o If the BMI is 30 or greater, it categorizes it as "Obesity".
 Example Usage: The program processes a vector of sample BMI values and prints
the corresponding categories.
Output
When the example code is executed, the output will look like this:
BMI: 17.5 - Category: Underweight
BMI: 22 - Category: Normal weight
BMI: 27.5 - Category: Overweight
Unit 4 13&15 marks

BMI: 32 - Category: Obesity

Decision Support Systems and Their Role in Data Analysis
Decision support systems (DSS) are computer-based tools that help individuals make
informed decisions based on data analysis. They integrate data from various sources,
provide analytical capabilities, and present the information in a user-friendly format. Here are
some ways in which DSS improve real-world data analysis:
1. Informed Decision-Making: DSS provides data-driven insights, allowing decision-
makers to evaluate options based on empirical evidence rather than intuition alone.
This leads to more rational and informed choices.
2. Handling Complexity: Many real-world problems involve multiple conditions and
variables. Decision support systems can analyze complex datasets and present
simplified results, making it easier for users to understand and act upon.
3. Scenario Analysis: DSS allows users to simulate different scenarios and outcomes
based on varying inputs. For example, a health DSS can model the impact of lifestyle
changes on BMI and suggest personalized recommendations.
4. Improved Accuracy: By relying on systematic data analysis, decision support
systems reduce the likelihood of human error, thereby improving the accuracy of
conclusions and actions taken.
5. Real-Time Data Processing: Modern DSS can process real-time data, enabling
organizations to make timely decisions that reflect current conditions and trends.
6. User-Friendly Interfaces: Many DSS incorporate visualizations and dashboards that
help users quickly grasp insights, enhancing the overall decision-making process.
Conclusion
The categorization of BMI using nested if-else statements in R serves as a simple example
of how decision-making can be automated based on conditions. In the broader context of
data science, decision support systems empower users by providing insights that improve
decision quality, efficiency, and effectiveness, making them invaluable tools in various fields,
including healthcare, finance, and business strategy.
7. Write a program in R that prompts a user to enter a numeric grade (0-100) and
returns the corresponding letter grade (A-F) using if-else statements. Then,
modify the program to handle cases where input is missing, not a number, or out
of range using is.na(), is.numeric(), and validation checks. Explain how error
handling improves code reliability.
Here's an R program that prompts a user to enter a numeric grade (0-100) and returns the
corresponding letter grade (A-F) using if-else statements. The program also includes error
handling to manage cases where the input is missing, not a number, or out of the valid range.
R Program for Grade Conversion
# Function to convert numeric grade to letter grade
convert_grade <- function(grade) {
if (is.na(grade) || !is.numeric(grade)) {
Unit 4 13&15 marks

return("Invalid input: Please enter a numeric value.")

}

if (grade < 0 || grade > 100) {

return("Invalid input: Grade must be between 0 and 100.")

if (grade >= 90) {

return("A")
} else if (grade >= 80) {
return("B")

} else if (grade >= 70) {

return("C")
} else if (grade >= 60) {
return("D")
} else {
return("F")
}

# Prompt user for input

user_input <- readline(prompt = "Enter a numeric grade (0-100): ")

# Try to convert input to numeric

numeric_grade <- as.numeric(user_input)

# Call the function and print the result

grade_result <- convert_grade(numeric_grade)
cat("Letter Grade:", grade_result, "\n")
Explanation of the Code

1. Function convert_grade: This function takes a numeric grade as input and checks for
validity before converting it to a letter grade.
Unit 4 13&15 marks

o Input Validation:
 It first checks if the input is NA or not numeric using is.na() and
is.numeric().

 It checks if the grade is within the range of 0 to 100.

o Grade Conversion: If the input is valid, it uses nested if-else statements to
convert the numeric grade to a corresponding letter grade.
2. User Input: The program prompts the user to enter a numeric grade using readline().
The input is then attempted to be converted to numeric using as.numeric().

3. Result Output: Finally, the program calls the convert_grade function and prints the
corresponding letter grade or error message.
Example Usage
 Valid Input: If the user enters 85, the output will be:

Letter Grade: B
 Invalid Input: If the user enters abc, the output will be:
Invalid input: Please enter a numeric value.
Error Handling and Code Reliability

Error handling is a crucial aspect of programming that enhances the reliability and robustness
of code. Here are some ways it improves code reliability:
1. Graceful Degradation: Instead of crashing or producing incorrect results when faced
with unexpected input, the program provides meaningful error messages, guiding the
user to correct their input.
2. Input Validation: By checking whether the input is numeric and within an acceptable
range, the program ensures that only valid data is processed, reducing the risk of logical
errors during execution.
3. User Experience: Clear error messages improve user experience by helping users
understand what went wrong, thus allowing them to provide valid input without
frustration.
4. Debugging Ease: Error handling can make debugging easier by catching issues at an
early stage, allowing developers to identify problems without extensive troubleshooting.
5. Maintainability: Well-handled errors make the codebase more maintainable, as future
changes can be made with confidence that the program will behave predictably even
when faced with unexpected input.

Conclusion
Incorporating error handling into the R program for converting numeric grades to letter grades
significantly enhances its reliability and usability. This practice is essential in developing robust
applications that can handle real-world user input effectively.
8. Write a function in R that takes a numeric vector as input and returns a vector of
squared values. Then, discuss how writing reusable functions in R can streamline
Unit 4 13&15 marks

data analysis workflows, particularly in the context of large datasets or repetitive

tasks.
R Function to Square Values

# Function to square each element of a numeric vector

square_values <- function(num_vector) {
if (!is.numeric(num_vector)) {

stop("Input must be a numeric vector.")

}
return(num_vector^2) # Square each element

# Example usage
numeric_vector <- c(1, 2, 3, 4, 5)
squared_result <- square_values(numeric_vector)
print(squared_result) # Output: c(1, 4, 9, 16, 25)
Benefits of Reusable Functions in Data Analysis

1. Efficiency: Functions encapsulate repetitive tasks, allowing for quicker execution

without code duplication.
2. Maintainability: Changes to a function update all instances, making code easier to
manage.
3. Reusability: Functions can be reused across projects, reducing duplication and errors.
4. Testing: Functions can be tested independently, simplifying debugging.
5. Collaboration: Clear functions help team members understand analyses without
needing to review complex code.
6. Scalability: Functions can facilitate efficient batch processing or vectorized operations
for large datasets.

7. Readability: Well-named functions improve code clarity and provide context for future
reference.
Conclusion
Creating reusable functions in R streamlines data analysis workflows and enhances code
efficiency, maintainability, and readability, which is crucial for handling complex or large
datasets.
Unit 5 13 marks

1. Explain how Power BI can transform raw data into actionable insights for a
business. Provide examples of how different industries could leverage Power
BI.
Power BI is a powerful business analytics tool that enables organizations to visualize their
data and share insights across the organization, or embed them in an app or website. Here’s
how it can transform raw data into actionable insights:
Transformation of Raw Data into Actionable Insights
1. Data Connectivity: Power BI connects to various data sources, including databases,
Excel spreadsheets, cloud services, and even real-time data streams. This capability
allows businesses to aggregate data from multiple sources into a single dashboard.
2. Data Cleaning and Transformation: Power BI offers data transformation features
through Power Query, allowing users to clean, reshape, and combine datasets. This
ensures that the data is accurate and relevant before analysis.
3. Data Visualization: With a wide range of visualization options—such as charts,
graphs, and maps—Power BI helps businesses present complex data in a user-
friendly manner. Interactive visualizations allow users to drill down into data for
deeper insights.
4. Real-Time Analytics: Power BI can provide real-time dashboards that update
automatically as new data comes in. This helps businesses make timely decisions
based on the most current information.
5. Natural Language Query: Users can ask questions in plain language and receive
insights in the form of visuals. This feature democratizes data access, allowing non-
technical users to explore data without needing advanced analytical skills.
6. Collaboration and Sharing: Power BI enables easy sharing of reports and
dashboards among team members, fostering collaboration. Reports can be published
to the Power BI service for wider access and can be integrated into other
applications.
Industry-Specific Use Cases
1. Healthcare:
o Patient Care Analytics: Hospitals can use Power BI to analyze patient data,
track treatment outcomes, and monitor readmission rates. By visualizing this
data, they can identify trends and improve patient care.
o Resource Allocation: Power BI can help hospitals optimize staffing and
resource allocation by analyzing patient flow and peak times, leading to
improved operational efficiency.
2. Retail:
o Sales Performance Monitoring: Retailers can use Power BI to analyze
sales data across different stores, identify top-selling products, and track
customer preferences. This helps in inventory management and targeted
marketing campaigns.
Unit 5 13 marks

o Customer Behavior Analysis: By visualizing customer purchase patterns,

retailers can tailor their offerings and promotions to enhance customer
satisfaction and loyalty.
3. Manufacturing:
o Supply Chain Optimization: Manufacturers can leverage Power BI to
monitor supply chain metrics, track production efficiency, and manage
inventory levels. This data helps in identifying bottlenecks and improving
operational efficiency.
o Quality Control: By analyzing defect rates and production anomalies,
manufacturers can identify quality issues early and implement corrective
actions.
4. Finance:
o Financial Reporting: Finance departments can create interactive
dashboards to visualize key performance indicators (KPIs), budget vs. actual
spending, and revenue forecasts. This aids in strategic financial planning and
decision-making.
o Risk Analysis: Power BI can help financial institutions analyze risk exposure
by visualizing data related to loans, investments, and market fluctuations.
5. Education:
o Student Performance Tracking: Educational institutions can use Power BI
to analyze student performance data, attendance rates, and course
completion statistics. This helps educators identify at-risk students and tailor
interventions.
o Resource Management: Schools can monitor resource allocation, including
faculty utilization and facility usage, to enhance operational efficiency.
Conclusion
Power BI transforms raw data into actionable insights by providing powerful data
visualization, real-time analytics, and collaborative tools. Different industries can leverage
these capabilities to improve decision-making, enhance operational efficiency, and drive
business growth. The flexibility and user-friendly interface of Power BI make it an essential
tool for organizations aiming to harness the power of their data.
2. Analyze the role of Power BI architecture in ensuring data security, scalability,
and performance in a business environment. How does the architecture help
large-scale organizations maintain efficient reporting?
Power BI Architecture: Ensuring Security, Scalability, and Performance
1. Data Security
 Row-Level Security (RLS): Allows users to access only relevant data, enhancing
confidentiality and compliance.
 Data Encryption: Protects sensitive data both in transit and at rest.
 Azure Active Directory Integration: Ensures secure authentication and identity
management.
Unit 5 13 marks

2. Scalability
 Cloud-Based Architecture: Easily scales to accommodate growing data and user
demands without extensive hardware investments.
 Flexible Data Storage: Supports various data storage solutions, allowing
organizations to choose based on needs.
 Incremental Data Refresh: Updates only changed data, optimizing performance and
reducing load times.
3. Performance
 In-Memory Data Storage: Utilizes columnar compression for faster querying and
data retrieval.
 Direct Query and Import Modes: Offers flexibility in accessing data based on
reporting needs.
 Aggregations: Summarizes data for improved query performance and reduced load
on sources.
4. Efficient Reporting for Large Organizations
 Centralized Data Management: Reusable datasets across reports ensure
consistency and reduce redundancy.
 Collaborative Workspaces: Facilitates teamwork while maintaining version control
and security.
 Scheduled Data Refresh: Automates updates for timely access to insights.
 Custom Dashboards: Tailored visualizations provide quick access to relevant
insights for stakeholders.
Conclusion
Power BI’s architecture effectively addresses data security, scalability, and performance,
making it ideal for large organizations. It enhances efficient reporting, supports collaboration,
and ensures reliable insights, empowering informed decision-making.
3. Critically evaluate Power BI in comparison to Tableau and QlikView. Discuss
the strengths and weaknesses of each tool in terms of ease of use, integration
capabilities, and cost-efficiency. Provide a recommendation for different
business scenarios
When evaluating Power BI, Tableau, and QlikView, it's essential to consider their strengths
and weaknesses in terms of ease of use, integration capabilities, and cost-efficiency. Here's
a critical comparison of these three leading business intelligence tools:

Comparison of Power BI, Tableau, and QlikView

Unit 5 13 marks

Feature Power BI Tableau QlikView

Strengths: User-friendly,
Strengths: Intuitive
especially for Microsoft Strengths: Powerful
interface; strong
users; drag-and-drop data exploration;
visualization options.
functionality; extensive dynamic dashboards.
Ease of Use Weaknesses:
support. Weaknesses: Steeper
Overwhelming for
Weaknesses: Some learning curve; less
beginners; complex
advanced features have a intuitive interface.
dashboard creation.
learning curve.

Strengths: Seamless Strengths: Wide data Strengths: Excellent

Microsoft integration; source integration; handling of large
supports various data supports live datasets; strong data
Integration
sources. connections. preparation.
Capabilities
Weaknesses: Some third- Weaknesses: Complex Weaknesses: Less
party integrations require configurations for flexibility with certain
setup. some integrations. data sources.

Strengths:
Strengths: Affordable Strengths: Robust
Comprehensive solution
pricing; free version features justify costs
justifying costs.
Cost- available. for advanced analytics.
Weaknesses: Higher
Efficiency Weaknesses: Advanced Weaknesses: More
total cost of ownership
features may require expensive than Power
due to licensing and
higher-tier licensing. BI and QlikView.
training.

Recommendation for Different Business Scenarios

1. Small to Medium-Sized Businesses:
Recommendation: Power BI
o Ideal for organizations looking for an affordable, user-friendly solution with
strong integration into Microsoft products.
2. Large Enterprises Needing Advanced Analytics:
Recommendation: Tableau
o Best for businesses that require sophisticated data visualizations and
analytics capabilities and have the budget to support higher costs.
3. Organizations Dealing with Complex Data Models:
Recommendation: QlikView

o Suitable for organizations needing powerful associative data analysis and

extensive customization capabilities, especially if they have the resources for
training.
Conclusion
Unit 5 13 marks

Each tool has its strengths and weaknesses, making them suitable for different business
scenarios. Power BI excels in affordability and ease of use, Tableau offers advanced
visualizations for those who can invest more, and QlikView provides robust capabilities for
complex data analysis. The choice should align with the organization's specific needs,
technical expertise, and budget constraints.
4. How would you approach designing an efficient data model in Power BI for a
multi-department organization with diverse data sources? Discuss key
considerations such as relationships, hierarchies, and the use of calculated
tables and columns.

Designing an efficient data model in Power BI for a multi-department organization with diverse
data sources involves careful planning and consideration of several key elements. Here’s a
structured approach:
1. Understand Business Requirements
 Identify Stakeholders: Collaborate with representatives from each department to
gather their reporting needs and understand the data they require.
 Define Metrics: Determine the key performance indicators (KPIs) and metrics that are
relevant across departments, ensuring alignment with organizational goals.
2. Data Source Integration
 Diverse Sources: Connect to various data sources (databases, spreadsheets, cloud
services) while ensuring data quality and consistency. Use Power Query to clean and
transform data as needed.
 Data Types: Ensure that the data types are correctly defined for each field to avoid
issues during calculations and visualizations.
3. Data Relationships
 Establish Relationships: Create relationships between tables based on common
fields (e.g., foreign keys). Use star schema design where possible, with fact tables
(e.g., sales, transactions) connected to dimension tables (e.g., products, customers).
o One-to-Many Relationships: Common in most models, where a single record
in one table relates to multiple records in another.
o Many-to-Many Relationships: Avoid these if possible, as they can complicate
calculations and lead to performance issues. If necessary, consider bridge
tables to manage these relationships effectively.
4. Hierarchies
 Create Hierarchies: Develop hierarchies for dimensions (e.g., geography: Country >
State > City) to enable drill-down capabilities in reports and enhance data exploration.
 Use Default Relationships: Ensure that hierarchies are linked to the appropriate
tables to facilitate navigation and filtering in visuals.

5. Calculated Tables and Columns

Unit 5 13 marks

 Calculated Columns: Use these for row-level calculations that need to be part of the
data model, such as categorizing data or creating new attributes based on existing
ones (e.g., profit margins).
 Calculated Tables: Create these for aggregating data from multiple sources or for
custom summarization needs (e.g., creating a summary table that combines sales data
from different departments).
 Performance Consideration: Use calculated columns judiciously, as they can
increase model size and complexity. Where possible, use measures for calculations
that do not need to be stored in the model.
6. Performance Optimization
 Data Model Size: Aim to keep the data model compact by removing unnecessary
columns and tables. Use Power BI’s data reduction techniques, like aggregations, to
minimize the dataset size.
 Use Measures Instead of Calculated Columns: Measures are computed on the fly
and do not consume storage, making them more efficient for aggregations and
calculations.
7. Testing and Validation
 Test the Model: After building the model, test relationships and calculations to ensure
accuracy. Validate data against source systems to confirm correctness.
 Feedback Loop: Get feedback from users and stakeholders to refine the model and
ensure it meets reporting needs.
8. Documentation and Maintenance
 Document the Model: Keep a record of data sources, relationships, and measures to
aid future maintenance and onboarding of new users.
 Regular Updates: Establish a process for regularly updating data and refining the
model based on changing business needs or additional data sources.
Conclusion
By carefully considering relationships, hierarchies, calculated tables, and performance
optimization, you can design an efficient data model in Power BI that meets the diverse needs
of a multi-department organization. This structured approach ensures that the model is
flexible, scalable, and capable of delivering meaningful insights across departments.
5. Design a Power BI dashboard for a marketing team aiming to improve customer
engagement. What types of visualizations would you include, and how would
you ensure that the dashboard meets both real-time and historical data needs?
Designing a Power BI dashboard for a marketing team focused on improving customer
engagement involves selecting the right visualizations and ensuring the dashboard effectively
integrates both real-time and historical data. Here's how to approach this:

Dashboard Components
1. Overall Engagement Metrics
Unit 5 13 marks

o Total Engagement Rate: A card visualization displaying the overall

engagement rate (likes, shares, comments) across platforms.
o Engagement Growth Rate: A line chart showing the engagement rate trend
over time (daily, weekly, or monthly).
2. Channel Performance
o Engagement by Channel: A stacked column chart to compare engagement
metrics (likes, shares, comments) across different marketing channels (social
media, email, website).
o Channel Conversion Rates: A donut chart displaying the conversion rates of
each channel to visualize which channels drive the most conversions.
3. Audience Insights
o Demographics Breakdown: A bar chart or pie chart showcasing audience
demographics (age, gender, location) to understand who is engaging with
content.
o Customer Segmentation: A clustered column chart categorizing customers
based on engagement levels (high, medium, low).
4. Campaign Performance
o Top Performing Campaigns: A table or matrix visualization listing campaigns
with key metrics (engagement, reach, ROI).
o Campaign Timeline: A Gantt chart or timeline visualization to display the start
and end dates of campaigns, along with engagement metrics over time.
5. Content Analysis
o Post Engagement: A bar chart showing the engagement of individual posts
(likes, comments, shares) to identify high-performing content.
o Content Type Performance: A horizontal bar chart comparing engagement
rates across different content types (videos, images, blogs).
6. Real-Time Data Integration
o Real-Time Metrics Tiles: Incorporate real-time tiles for metrics like current
website visitors, live engagement counts on social media, or recent campaign
performance.
o Alerts and Notifications: Set up alerts for significant changes in engagement
metrics (e.g., sudden drops or spikes) to enable quick responses.

Ensuring Real-Time and Historical Data Needs

1. Data Connectivity
Unit 5 13 marks

o Use Direct Query Mode: For real-time data, connect directly to live data
sources (e.g., social media APIs, CRM systems) using Direct Query mode,
which allows users to access up-to-date information without manual refreshes.
2. Scheduled Data Refresh
o Refresh Historical Data: Schedule regular refreshes (e.g., daily or hourly) for
datasets that contain historical information to ensure that users have access to
the latest historical trends.
3. Combined Data Sources
o Integrate Historical and Real-Time Data: Create a data model that combines
both historical data (imported from databases or files) and real-time data
sources. This allows for comprehensive analysis of trends over time alongside
current performance metrics.
4. Dynamic Filters and Slicers
o User-Driven Filters: Include slicers for users to filter data by date range,
marketing channels, or customer demographics, enabling them to view both
historical and real-time data flexibly.
5. User Interaction Features
o Drill-Through and Drill-Down: Enable users to drill down into specific
campaigns or content types for detailed analysis or drill-through to other related
reports.
Conclusion
By incorporating these visualizations and strategies, the Power BI dashboard will provide the
marketing team with a comprehensive view of customer engagement. It will enable them to
analyze historical performance, monitor real-time metrics, and make informed decisions to
improve engagement strategies effectively.
6. Explain how DAX (Data Analysis Expressions) functions can enhance data
analysis in Power BI. Using examples, demonstrate the use of CALCULATE,
FILTER, and RELATED functions to build complex reports.
Enhancing Data Analysis with DAX
DAX is a powerful formula language in Power BI that enables complex calculations and
aggregations, enhancing data analysis. Key functions include:

1. CALCULATE Function
Purpose: Modifies filter context for calculations.
Example: Calculate total sales for the "Electronics" category.

TotalSalesElectronics = CALCULATE(
SUM(Sales[SalesAmount]),
Unit 5 13 marks

Product[Category] = "Electronics"
)
2. FILTER Function

Purpose: Returns a subset of a table based on specified conditions.

Example: Calculate total sales for customers with more than 5 purchases.
TotalSalesHighEngagement = CALCULATE(
SUM(Sales[SalesAmount]),
FILTER(
Customers,
Customers[PurchaseCount] > 5
)
)
3. RELATED Function
Purpose: Retrieves related values from another table.
Example: Average sales amount per customer.
AvgSalesPerCustomer = AVERAGEX(
VALUES(Customers[CustomerName]),
CALCULATE(SUM(Sales[SalesAmount]))
)
Combined Example
Scenario: Analyze total sales for high-engagement customers in the "Electronics" category.
TotalHighEngagementElectronicsSales = CALCULATE(
SUM(Sales[SalesAmount]),
FILTER(
Customers,
Customers[PurchaseCount] > 5
),
Product[Category] = "Electronics"
)
Conclusion
DAX functions like CALCULATE, FILTER, and RELATED enhance data analysis in Power BI
by enabling dynamic measures, conditional calculations, and access to related data.
Unit 5 13 marks

Leveraging these functions allows for tailored and insightful reporting, addressing specific
business questions effectively.
7. Discuss the challenges and best practices for sharing Power BI dashboards
across an organization. How would you ensure data security, version control,
and accessibility for different levels of users?
Sharing Power BI dashboards across an organization presents several challenges, but
following best practices can help ensure a smooth and secure experience. Here’s an overview
of the challenges and recommended best practices for data security, version control, and
accessibility.
Challenges in Sharing Power BI Dashboards
1. Data Security: Ensuring sensitive data is not exposed to unauthorized users can be
complex, especially when dashboards are shared widely.
2. Version Control: Keeping track of different versions of reports and dashboards can
lead to confusion and inconsistencies, particularly when multiple team members are
involved in updates.
3. User Access Levels: Different users may require different access levels (view, edit,
or admin), which complicates permissions management.
4. Performance Issues: High traffic to dashboards can lead to performance degradation,
especially if many users are accessing large datasets simultaneously.
5. Compatibility and Training: Users across the organization may have varying levels
of experience with Power BI, impacting their ability to navigate and utilize dashboards
effectively.
Best Practices
1. Data Security
o Row-Level Security (RLS): Restrict data access based on user roles.
o Secure Workspaces: Use Power BI workspaces with role-based access
controls.
o Access Monitoring: Regularly review user access to sensitive data.
2. Version Control
o Consistent Naming: Use clear versioning in dashboard names (e.g., "Sales
Dashboard v1.0").
o Power BI Features: Utilize versioning tools within Power BI Service.
o Documentation: Maintain records of changes made to dashboards.
3. Accessibility
o Role-Based Access: Assign permissions based on user roles for appropriate
access.
o Mobile Optimization: Ensure dashboards are accessible on mobile devices.
o Training: Provide training sessions for users on dashboard navigation.
Unit 5 13 marks

4. Performance Optimization
o Optimize Data Models: Simplify models to improve performance.
o Scheduled Refreshes: Keep data current without affecting peak usage.
5. User Feedback
o Collect Feedback: Regularly seek user input for improvements.
o Iterate: Continuously update dashboards based on user needs.
Conclusion
By implementing these best practices, organizations can effectively share Power BI
dashboards while ensuring data security, consistent version control, and accessibility for all
users. Engaging with users for feedback further enhances dashboard utility.
8. In a large-scale enterprise with departments like sales, finance, and HR, explain
how Power BI’s features like data modeling, DAX, and dashboard sharing can
be combined to provide a holistic view of performance. What challenges might
you face, and how would you overcome them?
In a large-scale enterprise with departments like sales, finance, and HR, Power BI can create
a holistic view of performance by leveraging data modeling, DAX, and dashboard sharing.
Here’s how these features can be integrated, along with potential challenges and solutions.
Combining Power BI Features
1. Data Modeling
o Integrated Data Sources: Combine data from various systems (CRM, ERP)
for a unified view.
o Star Schema Design: Use a star schema to simplify relationships between
departments.
o Data Transformation: Clean and transform data in Power Query to ensure
accuracy.
2. DAX (Data Analysis Expressions)
o Calculated Measures: Create KPIs (e.g., revenue growth, turnover rates)
using DAX.
o Time Intelligence: Implement DAX for time-based performance analysis.
o Conditional Analysis: Perform calculations based on specific conditions (e.g.,
sales by region).
3. Dashboard Sharing
o Role-Based Access: Use role-based access controls to protect sensitive
information.
o Collaboration Features: Enable team collaboration through shared
dashboards and comments.
o Real-Time Monitoring: Set up dashboards for continuous performance
tracking.
Unit 5 13 marks

Challenges and Solutions

1. Data Silos
o Challenge: Fragmented data across departments.
o Solution: Establish a centralized data governance strategy to encourage data
sharing.
2. Complex Data Relationships
o Challenge: Managing intricate data connections.
o Solution: Use visual aids for clarity and train data modelers on best practices.
3. User Adoption
o Challenge: Resistance to new tools.
o Solution: Provide training and resources, and gather user feedback to improve
dashboards.
4. Performance Issues
o Challenge: Slow performance with large data sets.
o Solution: Optimize data models and schedule refreshes during off-peak hours.
5. Data Security and Compliance
o Challenge: Ensuring security and regulatory compliance.
o Solution: Implement strict access controls and regularly audit user
permissions.
Conclusion
By integrating Power BI’s data modeling, DAX, and dashboard sharing, enterprises can
achieve a comprehensive view of performance across departments. Addressing challenges
like data silos and user adoption through structured governance and training will enhance
decision-making and efficiency.

Budgeting Basics and Beyond
From Everand
Budgeting Basics and Beyond
Jae K. Shim
No ratings yet
Adobe Analytics For Dummies
From Everand
Adobe Analytics For Dummies
David Karlins
5/5 (1)
PROJECT REPORT taBLEAU 2 PDF
No ratings yet
PROJECT REPORT taBLEAU 2 PDF
12 pages
G.R. No. 138500
No ratings yet
G.R. No. 138500
2 pages
Lakian Petition Bankruptcy
No ratings yet
Lakian Petition Bankruptcy
44 pages
Tableau Training Plan
No ratings yet
Tableau Training Plan
4 pages
Amogh Borgave Data Visuaization FA-10
No ratings yet
Amogh Borgave Data Visuaization FA-10
23 pages
2 - Echelon AiQ Comprehensive Data Analysis - Final - Mod5
No ratings yet
2 - Echelon AiQ Comprehensive Data Analysis - Final - Mod5
60 pages
DVR Lab Manual
No ratings yet
DVR Lab Manual
87 pages
Business 360°: Unlocking Computer Application
From Everand
Business 360°: Unlocking Computer Application
NotesKaro
No ratings yet
Tableau
No ratings yet
Tableau
5 pages
SDC Lab Manual (Mid - 1)
No ratings yet
SDC Lab Manual (Mid - 1)
35 pages
Tableau Software Course Support
No ratings yet
Tableau Software Course Support
39 pages
Tableau 4
No ratings yet
Tableau 4
16 pages
Data Visualization Group Project Assignment Instructions Spring 2025
No ratings yet
Data Visualization Group Project Assignment Instructions Spring 2025
5 pages
Tableau Notes
No ratings yet
Tableau Notes
78 pages
TABLEAU
No ratings yet
TABLEAU
6 pages
DV Lab Manual
No ratings yet
DV Lab Manual
38 pages
Skill Development
No ratings yet
Skill Development
2 pages
How to Create Custom Dashboards in ServiceNow
From Everand
How to Create Custom Dashboards in ServiceNow
Business Success Shop
No ratings yet
Balanced Scorecards and Operational Dashboards with Microsoft Excel
From Everand
Balanced Scorecards and Operational Dashboards with Microsoft Excel
Ron Person
2/5 (1)
Unit - 3 DV R20
No ratings yet
Unit - 3 DV R20
50 pages
ASSIGNMENT 4 Big Data
No ratings yet
ASSIGNMENT 4 Big Data
2 pages
Ba CH-3
No ratings yet
Ba CH-3
6 pages
EMBA 22-23 Assignment No.2 Exp Learning
No ratings yet
EMBA 22-23 Assignment No.2 Exp Learning
5 pages
Beginner’s Guide to ServiceNow Workflow Automation
From Everand
Beginner’s Guide to ServiceNow Workflow Automation
Business Success Shop
No ratings yet
Excel Dashboards and Reports For Dummies
From Everand
Excel Dashboards and Reports For Dummies
Michael Alexander
4.5/5 (2)
R Programming Lab
No ratings yet
R Programming Lab
57 pages
Data Analytics Essentials You Always Wanted To Know: Self Learning Management
From Everand
Data Analytics Essentials You Always Wanted To Know: Self Learning Management
Vibrant Publishers
4/5 (11)
Setting Up an Online Business
From Everand
Setting Up an Online Business
KingsHub
No ratings yet
Tableau Course Content: Duration: 24 Hours Who Should Learn The Tableau Course?
No ratings yet
Tableau Course Content: Duration: 24 Hours Who Should Learn The Tableau Course?
7 pages
Design Compelling Dashboards
No ratings yet
Design Compelling Dashboards
3 pages
Unit 3 Data Visual
No ratings yet
Unit 3 Data Visual
5 pages
Tableau Assessment Questions
No ratings yet
Tableau Assessment Questions
3 pages
Raspinder Kaur Tableau Lab Manual
No ratings yet
Raspinder Kaur Tableau Lab Manual
44 pages
Financial Performance Dashboard (Business Analyst)
No ratings yet
Financial Performance Dashboard (Business Analyst)
14 pages
Data Visualization Assignment 2
No ratings yet
Data Visualization Assignment 2
4 pages
DVC - All Questions and Answers - CT 1, CT 2 and Model - Final
No ratings yet
DVC - All Questions and Answers - CT 1, CT 2 and Model - Final
114 pages
The Compendium of Marketing Tools: An Encyclopaedia for the Modern Marketer: Compendium of Marketing Tools, #1
From Everand
The Compendium of Marketing Tools: An Encyclopaedia for the Modern Marketer: Compendium of Marketing Tools, #1
EY Tassa
No ratings yet
Tableau Assignment
50% (2)
Tableau Assignment
2 pages
Tableau Introduction
No ratings yet
Tableau Introduction
15 pages
Marketing Matters: A Market Analysis Methodology Leading to a Marketing Simulation Capability
From Everand
Marketing Matters: A Market Analysis Methodology Leading to a Marketing Simulation Capability
Kendall Carey
No ratings yet
7 Financial Models for Analysts, Investors and Finance Professionals: Theory and practical tools to help investors analyse businesses using Excel
From Everand
7 Financial Models for Analysts, Investors and Finance Professionals: Theory and practical tools to help investors analyse businesses using Excel
Paul Lower
No ratings yet
Tableau
No ratings yet
Tableau
18 pages
20ads751 Data Analytics QB Final
No ratings yet
20ads751 Data Analytics QB Final
7 pages
Tableau Syllabus by Rstats
No ratings yet
Tableau Syllabus by Rstats
3 pages
The Balanced Scorecard: Turn your data into a roadmap to success
From Everand
The Balanced Scorecard: Turn your data into a roadmap to success
50minutes
3.5/5 (4)
How To Win Customers Every Day _ Volume 7: Data-Driven Selling: The Complete Guide to Success
From Everand
How To Win Customers Every Day _ Volume 7: Data-Driven Selling: The Complete Guide to Success
MAX EDITORIAL
No ratings yet
Tableau Course Agenda
No ratings yet
Tableau Course Agenda
5 pages
Tableau Desktop Fundamentals
No ratings yet
Tableau Desktop Fundamentals
3 pages
It Is This Very Small Risk Probability That Causes
From Everand
It Is This Very Small Risk Probability That Causes
William Blanke
No ratings yet
Tableau Project
No ratings yet
Tableau Project
2 pages
Data Visualization Using Tableau
No ratings yet
Data Visualization Using Tableau
6 pages
SDC Pages
No ratings yet
SDC Pages
11 pages
DVT Lab Final Merged
No ratings yet
DVT Lab Final Merged
48 pages
BI Interview Questions Answers
No ratings yet
BI Interview Questions Answers
2 pages
Tableau Part 1
No ratings yet
Tableau Part 1
160 pages
SDC LAB Manual
No ratings yet
SDC LAB Manual
40 pages
Tableau Lab Manual
No ratings yet
Tableau Lab Manual
6 pages
Sales Amount by Month - Sort It by The Correct Month Order, Not Alphabetical Order
No ratings yet
Sales Amount by Month - Sort It by The Correct Month Order, Not Alphabetical Order
6 pages
DATA ANALYSIS AND DATA SCIENCE: Unlock Insights and Drive Innovation with Advanced Analytical Techniques (2024 Guide)
From Everand
DATA ANALYSIS AND DATA SCIENCE: Unlock Insights and Drive Innovation with Advanced Analytical Techniques (2024 Guide)
WINTON CLEM
No ratings yet
Group Assignment # 3
No ratings yet
Group Assignment # 3
3 pages
Audi A4 1.8T Ultimate Timing Belt Kit Installation Guide ES#8146
No ratings yet
Audi A4 1.8T Ultimate Timing Belt Kit Installation Guide ES#8146
15 pages
Ge3 - Reviewer
No ratings yet
Ge3 - Reviewer
15 pages
Advanced Motion Part One PDF
No ratings yet
Advanced Motion Part One PDF
7 pages
ANRITSU MG3692C Datasheet
No ratings yet
ANRITSU MG3692C Datasheet
17 pages
Draft PK Hong Kong
No ratings yet
Draft PK Hong Kong
4 pages
Nco Class-4 Samplepaper
0% (1)
Nco Class-4 Samplepaper
2 pages
Accounting Concepts and Conventions
100% (1)
Accounting Concepts and Conventions
8 pages
Alfamart CMR - KAS Validation
No ratings yet
Alfamart CMR - KAS Validation
5 pages
Elora Catalog
No ratings yet
Elora Catalog
256 pages
Mbasic English For Academic Purposes
No ratings yet
Mbasic English For Academic Purposes
4 pages
White Paper FY 2024 25
No ratings yet
White Paper FY 2024 25
36 pages
Scoville
No ratings yet
Scoville
8 pages
GeM-Bidding-7098423 - Nalco
No ratings yet
GeM-Bidding-7098423 - Nalco
10 pages
Diabeties Minor
No ratings yet
Diabeties Minor
45 pages
Pradhanmantri Jan Dhan Yojana: Finatix Club IIM Raipur
No ratings yet
Pradhanmantri Jan Dhan Yojana: Finatix Club IIM Raipur
10 pages
KTM Adventure Range Folder MY21-EN
No ratings yet
KTM Adventure Range Folder MY21-EN
33 pages
Method Statement of Bored Piles 21
100% (1)
Method Statement of Bored Piles 21
40 pages
List of Regulated Electrical Equipment 250718
No ratings yet
List of Regulated Electrical Equipment 250718
15 pages
Fabric Expansion Joint: Catalogue Flytech Engineering SDN BHD 1/1/2011
No ratings yet
Fabric Expansion Joint: Catalogue Flytech Engineering SDN BHD 1/1/2011
7 pages
Sustainable Measurement
No ratings yet
Sustainable Measurement
8 pages
Dealer Management System 2023 AI Withlogo
No ratings yet
Dealer Management System 2023 AI Withlogo
1 page
Flumes For Accurate Flow Measurement
No ratings yet
Flumes For Accurate Flow Measurement
10 pages
1DR0 03 Que 20211126
No ratings yet
1DR0 03 Que 20211126
60 pages
MAX365
No ratings yet
MAX365
13 pages
Final Project
No ratings yet
Final Project
84 pages
Progress Chart Electrical Installation and Maintenance NC II
No ratings yet
Progress Chart Electrical Installation and Maintenance NC II
4 pages
Youth Sexual and Reproductive Health
No ratings yet
Youth Sexual and Reproductive Health
33 pages
Applied Economics Business Proposal Template
No ratings yet
Applied Economics Business Proposal Template
4 pages

Minor Unit 3-5 13 Marks Merged

Uploaded by

Minor Unit 3-5 13 Marks Merged

Uploaded by

Unit 3 13 marks

o Review visualizations for clarity and effectiveness.

o Create cross-tabs to check customer counts in each tier and ensure no

5. Use Tooltips for Additional Information

3. Using assign() for Assignment

<- x <- 10 For most assignments, as it follows R's typical style.

Preferably in function arguments, but also works for general

When variable names need to be generated dynamically,

# Define labels for each rating category

# Convert ratings into a factor with ordered levels

Why Factors Improve Data Analysis

 Analysis & Visualization: When visualizing or summarizing categorical data, factors

# Access the character vector

# Access the matrix

numeric_var <- 42.5 # A decimal or floating-point number

# Convert back to character

Factorial Calculation Using a For Loop in R

Cons of Using Loops

6. In data science, decisions often depend on multiple conditions. Write a

categorize_bmi <- function(bmi) {

BMI: 32 - Category: Obesity

return("Invalid input: Please enter a numeric value.")

if (grade < 0 || grade > 100) {

if (grade >= 90) {

} else if (grade >= 70) {

# Prompt user for input

# Try to convert input to numeric

# Call the function and print the result

 It checks if the grade is within the range of 0 to 100.

data analysis workflows, particularly in the context of large datasets or repetitive

# Function to square each element of a numeric vector

stop("Input must be a numeric vector.")

1. Efficiency: Functions encapsulate repetitive tasks, allowing for quicker execution

o Customer Behavior Analysis: By visualizing customer purchase patterns,

Comparison of Power BI, Tableau, and QlikView

Feature Power BI Tableau QlikView

Strengths: Seamless Strengths: Wide data Strengths: Excellent

Recommendation for Different Business Scenarios

o Suitable for organizations needing powerful associative data analysis and

5. Calculated Tables and Columns

o Total Engagement Rate: A card visualization displaying the overall

Ensuring Real-Time and Historical Data Needs

Purpose: Returns a subset of a table based on specified conditions.

Challenges and Solutions

You might also like