0% found this document useful (0 votes)

17 views52 pages

Business Analytics

The document provides an overview of various tools and functionalities in Google Sheets and Power BI, including features like conditional formatting, pivot tables, IF functions, and data uploading in Power BI. It outlines step-by-step instructions for using these features effectively for data analysis and visualization. Additionally, it covers data modeling processes in Power BI, emphasizing the importance of relationships, transformations, and security measures.

Uploaded by

krai7937

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views52 pages

Business Analytics

Uploaded by

krai7937

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 52

S.

NO TOPIC PAGE NO
1 Google Spreadsheet

1.1 Conditional formatting

1.2 PIVOT Table
1.3 IF Than
1.4 Find & Replace
1.5 V Lookup
1.6 Count If

2 Power BI

2.1 Introduction & Console

details
2.2 Data Uploading
2.3 Data Modeling
2.4 Graph structure
Pie Chart
Histogram chart

3 SPSS

3.1 Loading Data

3.2 T Test
One sample T Test
3.3 Paired Sample T Test
3.4 Two Sample T Test
3.5Chi- Square Test
3.6Cluster Analysis
GOOGLE SPREADSHEET

What is Google Sheets?

Google Sheets is a web-based application that enables users to create, update

and modify spreadsheets and share the data online in real time.

Google's product offers typical spreadsheet features, such as the ability to add,
delete and sort rows and columns. But unlike other spreadsheet programs,
Google Sheets also enables multiple geographically dispersed users to
collaborate on a spreadsheet at the same time and chat through a built-in
instant messaging program. Users can upload spreadsheets directly from their
computers or mobile devices. The application saves every change
automatically, and users can see other users' changes as they are being made.

Google Sheets is included as part of the Google Docs Editors suite of free web
applications. This suite also includes Google Docs, Google Slides, Google
Drawings, Google Forms, Google Sites and Google Keep.
CONDITIONAL FORMATTING

1.1 Conditional formatting is a feature in spreadsheets that applies a format

to cells based on certain conditions. For example, you can use conditional
formatting to highlight cells that contain certain values, or to calculate a
cell's background colour based on its value.

Using Conditional Formatting in Google Sheet

1. Open Google Sheets

Open the spreadsheet where you want to apply conditional formatting.

2. Select the Range

Highlight the cells or range of cells to which you want to apply conditional formatting.

3. Access Conditional Formatting

Go to the Format menu.

Click on Conditional Formatting.

4. Set the Formatting Rules

A panel will open on the right.

Under Format cells if, choose a predefined condition or select Custom formula is for more
advanced rules.

Predefined options: Greater than, less than, Text contains, etc.

Custom formula: Use formulas like =A1>100 or =AND(A1>10, B1<5).

5. Choose the Formatting Style

Set the format you want to apply (text color, background color, bold, etc.) from the "Formatting
style" section.

6. Apply the Rule

Click Done to apply the rule.

You can add more rules for the same range by clicking Add another rule.

7. Test Your Rules

Check the spreadsheet to ensure the rules work as expected. Adjust if necessary.
PIVOT TABLE

1.2 A Pivot Table is an interactive way to quickly summarize large amounts

of data. You can use a PivotTable to analyze numerical data in detail, and
answer unanticipated questions about your data. A PivotTable is
especially designed for: Querying large amounts of data in many user-
friendly ways.

Using PIVOT Table in Spread sheet

1. Open Your Spreadsheet

Open the Google Sheets file containing the data you want to summarize.

2. Select Your Data

Highlight the range of cells that contains the data (including headers).

Ensure your data has column headers for easier setup.

3. Insert a Pivot Table

Go to the Insert menu.

Select Pivot Table.

Choose whether to place the pivot table in a New Sheet (recommended) or in the Existing Sheet.

4. Set Up the Pivot Table

A sidebar will appear on the right side of the screen.

Customize the pivot table by dragging fields into the following sections:

Rows: Add fields that will appear as rows (e.g., categories like names, products, dates).

Columns: Add fields to organize data into columns.

Values: Add fields for numerical data or calculations (e.g., sum, average, count).
Filters: Add fields to filter the data dynamically.

5. Adjust Field Settings

For Values, click the dropdown arrow to choose the aggregation type (e.g., sum, average, count,
max, min).

For Rows or Columns, sort the data alphabetically or numerically.

6. Analyze Your Data

The pivot table will automatically populate based on your selections.

You can tweak the fields or filters to adjust your analysis.

IF THAN

1.3 In Google Sheets, the "IF-THAN" option is implemented using the IF

function. It allows you to perform logical tests and return different values
based on whether the condition is true or false.

Steps of using IF THAN option in Spreadsheet

1. Understand the Syntax of the IF Function

The general syntax is:

=IF(condition, value_if_true, value_if_false)

condition: The logical test (e.g., A1 > 10).

value_if_true: The value or action if the condition is TRUE.

value_if_false: The value or action if the condition is FALSE.

2. Open Your Spreadsheet

Open Google Sheets, or another spreadsheet tool.

3. Identify the Condition

Decide the logic you want to apply. For example:

"If a number in column A is greater than 50, display 'Pass'; otherwise, display 'Fail'."

4. Enter the IF Formula

In a cell, type the formula based on your condition. For the example above:

=IF(A1 > 50, "Pass", "Fail")

5. Drag or Copy the Formula

If you want the formula to apply to multiple rows:

Click the cell with the formula.

Drag the fill handle (a small square at the bottom-right corner of the cell) downward or across
other cells.

6. Customize for Advanced Conditions

You can nest multiple IF statements for complex logic:

=IF(A1 > 90, "A", IF(A1 > 75, "B", IF(A1 > 50, "C", "F")))

Alternatively, use AND or OR functions within the condition:

AND Example:

=IF(AND(A1 > 50, B1 < 20), "Yes", "No")

OR Example:

=IF(OR(A1 > 50, B1 < 20), "Yes", "No")

7. Test Your Formula

Check that the formula provides the expected results for different inputs.

Let me know if you'd like specific examples or additional guidance!

FIND & REPLACE

1.4 In your computer, open a spreadsheet in Google Sheets. Find and replace. Next to
"Find," type the word you want to find, if you want to replace the word, enter the new
word next to "Replace with”.

Steps to use Find & Replace in Spreadsheet

1. Open the Find and Replace Tool

Shortcut: Press Ctrl + H (Windows) or Cmd + H (Mac).

Menu: Click on Edit > Find and Replace.

2. Enter Your Search Criteria

Find: Type the word, number, or value you want to search for.

Replace with: Enter the value you want to replace it with.

3. Adjust Options (if needed)

Search: Choose whether to search in:

All sheets (entire workbook).

This sheet (current sheet only).

Specific range (if highlighted).

Match case: Check this if the search is case-sensitive.

Match entire cell contents: Check this to find exact matches only.

Search using regular expressions: Use regex for advanced patterns.

4. Execute the Search

Find: Click this to highlight the next occurrence of the term.

Replace: Replaces the current instance of the search term with your new value.
Replace all: Replaces all instances of the term across the selected range or sheet.

5. Review Changes

Check the replacements to ensure accuracy.

V Lookup
1.5 VLOOKUP, or "Vertical Lookup", is a Google Sheets function that searches for a
value in a column and returns related information from the same row in a different
column.

Steps to Use VLOOKUP

1. Set up your data:

Ensure your data is organized in columns.

The column you are searching in (first column of the range) should contain the values you want
to search for.

2. Write the formula:

Click on the cell where you want the result.

Enter the VLOOKUP formula.

Example:
Suppose your data is in range A2:C10, and you want to search for the value "Product A" in
column A and return its price from column B. Use this formula:

=VLOOKUP ("Product A", A2:C10, 2, FALSE)

3. Adjust parameters:

Replace "Product A" with a cell reference (e.g., D2) if the search key is in another cell.

Adjust the index number to match the column you want to retrieve data from.

4. Press Enter:

Google Sheets will display the result if the search key is found. If not, you'll see #N/A.
COUNT IF

1.6 You can use the COUNTIF function to: Track sales figures, monitor project
completions, and Analyse customer data.
You can also use the COUNTIFS function to count the number of times all criteria are
met across multiple ranges.

Step-by-Step Guide:

1. Open Google Sheets:

Go to your Google Sheets document or create a new one by visiting Google Sheets.

2. Select a Cell for the Formula:

Click on the cell where you want the result of the COUNTIF formula to appear.

3. Enter the COUNTIF Formula:

In the selected cell, type the following formula structure:

=COUNTIF (range, criterion)

Replace range with the range of cells you want to evaluate, and criterion with the condition for
counting.

For example:

To count how many times the word "Apple" appears in cells A1 to A10, you would type:

=COUNTIF (A1:A10, "Apple")

4. Press Enter:

After typing the formula, press Enter. Google Sheets will evaluate the condition and display the
result in the selected cell.

5. Optional – Adjust Formula as Needed:

You can adjust the range and criterion to suit your needs. For instance, you could change the
range from A1:A10 to B1:B20 or adjust the criterion to count cells greater than a specific
number.
POWER BI

2.1 INTRODUCTION AND COSOLE DETAILS

Power BI provides several console and diagnostic tools that can help with
managing and troubleshooting Power BI reports, datasets, and data
models. Here are the key details about Power BI's console tools:

1. Power BI Service Console

Power BI Service (online) is a cloud-based platform for creating, sharing, and

collaborating on reports and dashboards. Users access Power BI from their browsers
or through the Power BI mobile apps.

Key features:

Workspaces: Centralized collaboration for content (reports, datasets, dashboards).

Apps: Bundled reports and dashboards shared with users or groups.

Dataflows: Data transformation and integration tools for creating datasets.

Scheduled Refresh: Automating data updates for datasets.

Usage Metrics: Insights on how reports and dashboards are being used.

2. Power BI Desktop Console

Power BI Desktop is a desktop application used for creating reports, queries, and data
models.

Key features:

Query Editor: Allows users to shape and transform data before loading it into Power
BI.

Data Modelling: Users can create relationships between tables, define calculated
columns, and build measures using DAX (Data Analysis Expressions).
Visualizations: Drag-and-drop interface for creating visual reports, charts, and tables.

Publish to Service: Once reports are built, users can publish to the Power BI Service.

3. Power BI Admin Console

The Power BI Admin Console is designed for administrators to manage the Power BI
tenant. It's available to Power BI Service admins.

Key features:

Audit Logs: View and export activity logs to track usage and access events.

Capacity Metrics: Monitor the performance and resource usage of Power BI Premium
capacities.

Tenant Settings: Configure service settings and governance policies like dataset size
limits, sharing settings, and data retention.

Security: Manage roles, user permissions, and audit security compliance.

Data Gateway Management: Administer on-premises data gateways that provide

access to local data sources.

4. Power BI PowerShell Console

PowerShell commands for automating administrative tasks in Power BI. It’s useful for
bulk management or automation.

Key features:

Power BI Cmdlets: Automate tasks like dataset refreshes, managing user permissions,
workspace creation, etc.

Tenant and Workspace Management: Manage workspaces, users, and resources in

bulk.
5. Power BI REST API Console

The Power BI REST API allows programmatic access to manage content and services
within Power BI.

Key features:

Embedding Reports: Integrate Power BI reports into other applications.

Dataset and Report Management: Automate processes like dataset refresh, report
publishing, or data integration.

6. Performance Analyzer (Power BI Desktop)

Performance Analyzer helps diagnose performance issues with reports in Power BI

Desktop by recording and analyzing report query performance, load times, and
rendering times.

Available in Power BI Desktop under the "View" tab.

Each of these consoles plays a critical role in managing and troubleshooting Power BI
environments, either at the individual report level or the organizational/tenant level.
2.2 Data Uploading

Data uploading in Power BI is the process of adding files to your workspace so you can
analyze and visualize data.

Steps of data uploading in power BI

1. Open Power BI Desktop:

Launch Power BI Desktop on your computer.

2. Get Data:

On the Home ribbon, click on the Get Data button.

Choose your data source type (Excel, SQL Server, Web, Text/CSV, etc.).

3. Choose Your Data Source:

Select the appropriate data source (for example, choose Excel for an Excel file).

Browse to the file location or provide connection details (for databases or online
sources).

4. Load Data:

After selecting the data source, click Connect (for databases or online sources).

Once connected, a list of available tables or sheets will appear.

Select the tables or sheets you want to load.

You can preview the data and apply transformations if needed (click Transform Data to
open Power Query for more options).
5. Load into Power BI:

After selecting the desired tables, click Load to import the data into Power BI.

Power BI will create a dataset based on the selected data.

6. Build Your Report:

Once the data is loaded, you can start building your report by dragging fields from the
Fields pane to the report canvas.

Create visualizations, tables, or graphs based on the loaded data.

7. Save and Publish:

Save the Power BI report on your local machine or publish it to the Power BI service for
sharing and further collaboration
2.3 DATA MODELING

Data modeling in Power BI is the process of connecting multiple data sources and establishing
relationships between them to create a foundation for a database.

Steps of data modeling in power BI

1. Data Collection and Import

Connect to Data Sources: Power BI allows you to connect to various data sources like databases
(SQL, Oracle), cloud services (Azure, Google Analytics), Excel files, and web data.

Load Data: After connecting, import the data into Power BI using the Power Query Editor.

2. Data Transformation (Power Query Editor)

Clean Data: Remove errors, duplicate records, and handle missing values.

Shape Data: Transform the data by filtering, splitting columns, creating new calculated columns,
changing data types, etc.

Merge/Append Queries: Combine different tables or queries if needed to create a unified

dataset.

3. Define Relationships

Identify Tables and Keys: Determine which tables are related to each other, and create
relationships based on primary and foreign keys.

Create Relationships: Use the "Model" view to define the relationships between tables (e.g.,
one-to-many, many-to-many).

Set Relationship Properties: Define cardinality, cross-filter direction, and other properties of
relationships.

4. Create Calculated Columns and Measures

Calculated Columns: These are columns that are created using DAX (Data Analysis Expressions)
formulas. They are computed when the data is loaded.

Measures: Measures are dynamic calculations used in reports that aggregate or analyze data
(e.g., sum, average, count).

KPIs: Key Performance Indicators can be defined using DAX for performance tracking.
5. Hierarchies and Aggregations

Create Hierarchies: Create logical groupings like Date hierarchy (Year > Quarter > Month > Day)
to drill down in reports.

Set Aggregation: Specify how Power BI should aggregate data for measures, such as sum,
average, or distinct counts.

6. Data Modeling Optimization

Star Schema vs Snowflake Schema: Organize data into a star schema (fact tables and dimension
tables) for better performance and simplicity, or snowflake schema if normalization is necessary.

Optimize Data Types: Reduce storage size by choosing appropriate data types for columns (e.g.,
Integer vs Decimal).

Handle Large Data Models: Use techniques like data reduction, aggregation tables, and
incremental data loads to improve performance.

7. Data Security (Row-Level Security)

Define Roles: Create roles in Power BI to control access to data based on user identity.

Set Row-Level Security (RLS): Define filters on tables so that users see only the data that they are
permitted to see.

8. Model Validation and Testing

Check for Errors: Ensure that all relationships, calculations, and filters are functioning as
expected.

Test the Model: Use the "Data" and "Model" views to review the tables, relationships, and
calculations for consistency and accuracy.

9. Publish and Share

Publish to Power BI Service: Once the model is ready, publish it to the Power BI Service for
sharing with users and collaborating.

Schedule Refresh: Set up scheduled refresh for the dataset so the model gets updated with new
data automatically.
10. Create Reports and Dashboards

Build Visualizations: Use the data model to create interactive charts, tables, and graphs in Power
BI reports.

Create Dashboards: Aggregate the reports into dashboards for sharing insights across teams.
2.4 PIE CHART

A pie chart in Power BI is a visual representation of data that displays it as a proportion of the
whole. Pie charts are a type of shape chart, which are charts without axes. When a numeric
field is dropped onto a shape chart, it calculates the percentage of each value to the total.

To create a pie chart in Power BI, follow these steps:

1. Open Power BI Desktop:

Launch Power BI Desktop application.

2. Load Your Data:

Import your dataset by clicking on the "Home" tab and selecting "Get Data". Choose your data
source and load the data into Power BI.

3. Create a New Report:

Once your data is loaded, click on the "Report" view (the canvas area).

4. Select Pie Chart:

In the Visualizations pane (on the right), click on the Pie Chart icon (it looks like a circle divided
into sections).

This will add a blank pie chart visual to the report canvas.

5. Assign Data Fields:

Drag and drop the data fields into the Values and Legend sections:

Values: This is typically the numerical data you want to visualize (e.g., sales, revenue).

Legend: This field represents categories (e.g., product categories, regions) that will split the pie
chart.

Optionally, you can add additional fields for tooltips or filters.

6. Customize the Pie Chart:

To customize the chart, use the Format pane (paint roller icon). Here you can adjust:

Title: Add or modify the chart title.

Data colors: Change the colors of each segment.

Legend: Show or hide the legend, and adjust its position.

Details: Control the slice labels, percentages, and other visual details.

7. Resize and Position:

Resize and position the pie chart on the canvas as needed.

8. Save the Report:

Once satisfied with your pie chart, save the report by clicking "File" > "Save As".
2.5 HISTOGRAM CHART
A histogram in Power BI is a visualization chart that shows how data is distributed in a dataset.
It's a bar chart that groups data points into bins and shows the number of data points in each
bin.

To create a histogram chart in Power BI, follow these steps:

1. Prepare your data

Ensure you have the data you want to visualize in Power BI. A histogram typically requires
continuous numerical data to create bins.

2. Load data into Power BI

Open Power BI Desktop.

Load your dataset by clicking on Home > Get Data and selecting the appropriate source (Excel,
SQL Server, etc.).

Once the data is loaded, you will see it in the Fields pane.

3. Select the 'Histogram' visualization

In the Visualizations pane, there is no direct "Histogram" chart type, but you can use the Column
chart or Bar chart and adjust it to resemble a histogram.

Select the Clustered Column Chart from the visualization pane.

4. Prepare the data for binning

To create bins (ranges for the histogram), you need to define the bin size. You can either:

Use the Field itself (e.g., Age, Income, etc.) and apply a bin using the Group By feature.

In the Fields pane, right-click on the numeric field (e.g., Age) and select New Group. Then, select
the bin size or define custom bin ranges.

5. Add the data to the chart

Drag the numerical field (e.g., Age) to the Axis well of the chart.
Drag the same field or a corresponding count to the Values well (this counts the number of
entries in each bin).

6. Customize the bins (optional)

You can further adjust the bin size by modifying the grouping in the Fields pane. Right-click the
group and choose Edit Group to adjust bin size or range.

For more flexibility, consider using DAX functions to create custom bins or calculate bin ranges.

7. Format the chart

After you’ve set up the histogram, you can format it as needed. Click on the Format pane (paint
roller icon) and customize the chart:

Adjust colors, labels, gridlines, and axis formatting.

Turn on data labels for better visibility of counts.

8. Review the histogram

Once the data is visualized, ensure that the bins and frequencies are displayed correctly.

Modify the chart further as needed (e.g., by adjusting the axis, labels, or appearance).

These steps will help you create a basic histogram in Power BI.
SPSS
SPSS (Statistical Package for the Social Sciences), also known as IBM SPSS Statistics since 2009,
is a user-friendly software package used for the analysis of statistical data and to make data-
driven decisions.

3.1 Loading Data

Loading data in SPSS refers to the process of importing or entering data into the SPSS software
to perform statistical analysis. There are various ways to load data into SPSS:

1. Entering Data Manually:

You can enter data directly into the SPSS Data View, similar to entering data in a spreadsheet.
Each row represents an observation or case, and each column represents a variable.

2. Importing Data from External Files:

Excel Files: You can import data from Excel (.xlsx, .xls) files by going to File > Open > Data and
selecting the Excel file.

CSV Files: Comma-separated value files (.csv) can also be loaded into SPSS by selecting File >
Open > Data and choosing the .csv file.

Text Files: Text files (.txt) can be imported using the Text Wizard to specify delimiters and other
file parameters.

Databases: SPSS can also connect to databases like SQL, allowing you to import data directly.

3. Using Syntax:

SPSS also supports syntax commands to load data programmatically, which can be useful for
automation or reproducibility. For example:

GET FILE='C:\path\to\datafile. Sav'.

After loading data, SPSS can perform a variety of statistical operations, such as descriptive
statistics, hypothesis testing, regression analysis, etc., based on the imported data.
3.2 One Sample T test
A one-sample t-test is a statistical test used to determine whether the mean of a single sample
is significantly different from a known or hypothesized population mean. It compares the
sample mean to the population mean while accounting for the sample's size and variability.

To perform a one-sample t-test in SPSS, follow these steps:

Step 1: Open SPSS and Load Data

Open SPSS and load your dataset. If you’re entering data manually, go to the Data View and
enter your variable in one column (for example, "Scores" or "Weights").

Step 2: Click on "Analyze"

In the top menu, click on Analyze.

Step 3: Select "Compare Means"

From the dropdown menu, select Compare Means, then choose One-Sample T Test.

Step 4: Select the Test Variable

In the "One-Sample T Test" dialog box, move your test variable (e.g., the column with the data
you want to test) into the Test Variable(s) box. This is the variable you want to compare to the
population mean.

Step 5: Enter the Test Value

In the Test Value box, enter the population mean (the value you want to compare your sample
mean to). For example, if you are comparing the sample mean of a group of students' test scores
to a known population mean of 75, enter 75.

Step 6: Choose Options (Optional)

Click on the Options button if you want to change the confidence level or other settings. By
default, SPSS uses a 95% confidence level.

Step 7: Run the Test

Click OK to run the test.

Step 8: Review the Output

SPSS will generate an output with the t-test results

3.3 Paired Sample T Test

To perform a paired sample t-test in SPSS, follow these steps:

1. Open Your Data:

Open your dataset in SPSS where you have the two related variables (e.g., pre-test and post-test
scores for the same subjects).

2. Go to the 'Analyze' Menu:

Click on Analyze in the top menu.

3. Select 'Compare Means':

From the dropdown, choose Compare Means and then select Paired-Samples T Test.

4. Select the Paired Variables:

In the window that appears, you’ll see a list of your variables.

Select the two variables you want to compare (e.g., pre-test and post-test scores) and move
them to the Paired Variables box. SPSS will ask you to specify pairs, so ensure the correct pairing.

5. Check Options:

You can click Options to set the confidence interval and other statistics if needed, but the default
settings are typically fine.

6. Run the Test:

Click OK to run the paired sample t-test.

7. Interpret the Output:

In the output window, you will see the paired sample statistics, the t-value, degrees of freedom
(df), and the significance value (p-value).

If the p-value is less than your alpha level (usually 0.05), you can reject the null hypothesis,
indicating a significant difference between the paired groups.
3.4 Two Sample T Test

To perform a two-sample t-test in SPSS, follow these steps:

1. Open SPSS and load your data file.

2. Check Your Data: Ensure that your data is in the correct format:

One column for the dependent variable (the measurement you want to compare).

One column for the independent variable (the grouping variable, which indicates the two groups
being compared).

3. Navigate to the t-test Option:

Click on Analyze in the top menu.

Choose Compare Means.

Select Independent-Samples T Test.

4. Set Up the T-Test:

In the dialog box, move the dependent variable (the one you are comparing) to the Test
Variable(s) box.

Move the grouping variable (the one that defines the two groups) to the Grouping Variable box.

5. Define Groups:

Click on the Define Groups button.

In the pop-up window, specify the values for the two groups you want to compare (e.g., Group
1: 1 and Group 2: 2, depending on your data).

Click Continue.
6. Select Additional Options (optional):

If needed, click on Options to specify confidence intervals or other settings.

Click Continue.

7. Run the Test:

Click OK to run the test.

8. Interpret the Output:

SPSS will generate an output with several tables.

3.5 Chi Square Test
A Chi-square test in SPSS is a statistical method used to determine if there is a significant
association between two categorical variables. It's commonly used for testing relationships in
contingency tables (cross-tabulation).

To perform a Chi-square test in SPSS, follow these steps:

1. Prepare your data

Ensure your data is in the correct format for the Chi-square test. Each variable should be
categorical (nominal or ordinal), and each row should represent an individual observation.

2. Open SPSS

Start SPSS and load the dataset you want to analyze.

3. Go to Crosstabs

Click on Analyze in the top menu.

Select Descriptive Statistics, then click on Crosstabs.

4. Select Variables

In the Crosstabs dialog box:

Move one categorical variable (e.g., Gender) into the Row(s) box.

Move the other categorical variable (e.g., Voting Preference) into the Column(s) box.

5. Set Up the Chi-Square Test

Click on the Statistics button.

In the dialog that appears, check the box next to Chi-square.

Click Continue to return to the Crosstabs window.

6. View Cell Counts (Optional)

If you want to see the observed and expected frequencies:

Click on the Cells button.

Check both Observed and Expected counts.

Click Continue.

7. Run the Test

Click OK to run the test.

8. Interpret the Output

SPSS will provide the results in the Output Viewer

3.6 Cluster Analysis

To perform cluster analysis in SPSS, you can follow these steps:

Step 1: Prepare Your Data

Ensure that your data is well-organized, with each variable as a column and each observation as
a row.

Check for missing values, as they can interfere with the analysis. You might want to either
remove or impute missing values before proceeding.

Step 2: Access Cluster Analysis

1. Open your dataset in SPSS.

2. Click on Analyze in the top menu.

3. Select Classify and then Cluster.

4. You can choose between K-Means Cluster (for partitioning cases into a specified number of
clusters) or Hierarchical Cluster (for a data-driven approach).

Step 3: Perform K-Means Cluster Analysis (for fixed clusters)

1. Select Variables: Choose the variables you want to use for clustering (e.g., those that describe
your subjects or cases).

2. Choose the Number of Clusters: Under "Number of Clusters," specify the number of clusters
you want to form.

3. Options: You can choose to standardize the data (important if the variables are on different
scales) and set criteria for convergence.

4. Click OK to run the analysis.

Step 4: Perform Hierarchical Cluster Analysis (for dynamic clusters)

1. Select Variables: Choose the variables you want to cluster.

2. Choose Clustering Method: In the "Method" section, select a method (e.g., Ward's method,
Average Linkage).

3. Choose a Distance Measure: Typically, Squared Euclidean distance is used for continuous data.

4. Dendrogram: A dendrogram is a tree diagram that shows the hierarchical relationship

between clusters. You can save this and decide where to cut the tree for the desired number of
clusters.

5. Click OK to run the analysis.

Step 5: Review Results

K-Means: SPSS will give you the cluster centers, the number of cases in each cluster, and the
ANOVA table to check the between-group differences.

Hierarchical: The output includes a dendrogram, which helps visualize the hierarchical structure
of your data. You can also get a cluster membership table to see which case belongs to which
cluster.

Step 6: Interpret the Results

Review the cluster centers (for K-Means) or the dendrogram (for hierarchical) to understand the
characteristics of each cluster.

You can visualize clusters using scatter plots or other graphical methods, especially if you
reduced the number of variables through factor analysis.

Step 7: Save Cluster Membership

You can save the cluster membership to a new variable (e.g., "Cluster") by clicking on the option
in the Cluster Analysis dialog box to Save the cluster membership as a new variable.

Respostas Prova para Exame OCI
No ratings yet
Respostas Prova para Exame OCI
8 pages
E-Note SS One 3rd Term Data Processing
75% (8)
E-Note SS One 3rd Term Data Processing
19 pages
RSDB Update 20190620
100% (1)
RSDB Update 20190620
1 page
DBMS Bal Krishna Nyaupane PDF
No ratings yet
DBMS Bal Krishna Nyaupane PDF
166 pages
Database Security and Auditing: Protecting Data Integrity and Accessibility
100% (1)
Database Security and Auditing: Protecting Data Integrity and Accessibility
46 pages
Revised Final All-In-One QP - MS - XII - CS - TERM2 PB 2021-22
No ratings yet
Revised Final All-In-One QP - MS - XII - CS - TERM2 PB 2021-22
112 pages
(2020) Ben Collins - Spice Up Your Sheet Life (2nd. Edition)
100% (3)
(2020) Ben Collins - Spice Up Your Sheet Life (2nd. Edition)
232 pages
115 SQL Interview Questions and Answers
100% (1)
115 SQL Interview Questions and Answers
34 pages
SQL-1 (Scratch To Advance)
No ratings yet
SQL-1 (Scratch To Advance)
31 pages
OLD QPs - DBMS
No ratings yet
OLD QPs - DBMS
21 pages
Dba Code
No ratings yet
Dba Code
83 pages
How To Use SQL Trace and TKPROF For Performance Issues With EBusiness Suite
No ratings yet
How To Use SQL Trace and TKPROF For Performance Issues With EBusiness Suite
6 pages
Voltage SecureData Enterprise - Aster Scalar UDF Integration Guide
No ratings yet
Voltage SecureData Enterprise - Aster Scalar UDF Integration Guide
15 pages
It-Chapter 5
No ratings yet
It-Chapter 5
30 pages
MS Excel Ebook Ira
No ratings yet
MS Excel Ebook Ira
53 pages
Data-Mining-And-Warehouse (Set 1)
No ratings yet
Data-Mining-And-Warehouse (Set 1)
21 pages
ORACLe Backup Policy
No ratings yet
ORACLe Backup Policy
2 pages
BDA Module 3
No ratings yet
BDA Module 3
66 pages
BC2402 Week 6 Class Exercises
No ratings yet
BC2402 Week 6 Class Exercises
4 pages
IT Chapter 5 - Special Excel Functions
No ratings yet
IT Chapter 5 - Special Excel Functions
67 pages
How Do You Store Data With A Script in A File With WinCC (TIA Portal) PC Runtime
100% (1)
How Do You Store Data With A Script in A File With WinCC (TIA Portal) PC Runtime
3 pages
College Management System ER Diagram PDF
No ratings yet
College Management System ER Diagram PDF
4 pages
At Lease 10 Difference Between Oracle 8i and SQL Server 2000
No ratings yet
At Lease 10 Difference Between Oracle 8i and SQL Server 2000
26 pages
Data Analysis Functions in A Nutshell
No ratings yet
Data Analysis Functions in A Nutshell
33 pages
Database Management Systems 1 Removed
No ratings yet
Database Management Systems 1 Removed
168 pages
Top 15 Most Useful Google Sheets Tips and Tricks
No ratings yet
Top 15 Most Useful Google Sheets Tips and Tricks
23 pages
D49656GC10 39 SF
No ratings yet
D49656GC10 39 SF
5 pages
Advanced Spreadsheet Skills: Lesson 2
No ratings yet
Advanced Spreadsheet Skills: Lesson 2
43 pages
1415 Sem 1
No ratings yet
1415 Sem 1
6 pages
Spread Sheet Manual (Form 7) : Microsoft Excel
No ratings yet
Spread Sheet Manual (Form 7) : Microsoft Excel
22 pages
Chap2.Basic Function
No ratings yet
Chap2.Basic Function
101 pages
Advantage of Auxiliary Cloud Services
No ratings yet
Advantage of Auxiliary Cloud Services
5 pages
Data Analysis Functions in A Nutshell
No ratings yet
Data Analysis Functions in A Nutshell
33 pages
Data Analysis Functions
No ratings yet
Data Analysis Functions
33 pages
Empowerment Technologies Quarter 2 Module 1
No ratings yet
Empowerment Technologies Quarter 2 Module 1
44 pages
Excel Formulas: Functions: Saqer Al-Shra'ah
No ratings yet
Excel Formulas: Functions: Saqer Al-Shra'ah
15 pages
Autumn Term Exam Scores: Jones Smith Bloggs Drury Evans
No ratings yet
Autumn Term Exam Scores: Jones Smith Bloggs Drury Evans
28 pages
Excel DSS Functions
No ratings yet
Excel DSS Functions
13 pages
Functions in A Nutshell
No ratings yet
Functions in A Nutshell
33 pages
Neighbourhood Blocking For Record Linkage
No ratings yet
Neighbourhood Blocking For Record Linkage
10 pages
Simple Notes
No ratings yet
Simple Notes
12 pages
Advanced Excel - Using The IF Function in Excel To Program Your Spreadsheets
No ratings yet
Advanced Excel - Using The IF Function in Excel To Program Your Spreadsheets
7 pages
Sheets - Advanced
100% (1)
Sheets - Advanced
39 pages
Google Sheets 101-The Beginner's Guide To Online Spreadsheets
No ratings yet
Google Sheets 101-The Beginner's Guide To Online Spreadsheets
23 pages
Adv Excel WKBK
No ratings yet
Adv Excel WKBK
26 pages
Excel Concept
No ratings yet
Excel Concept
12 pages
Spreadsheets
No ratings yet
Spreadsheets
17 pages
Complete Worksheet
No ratings yet
Complete Worksheet
21 pages
Chapter 3
No ratings yet
Chapter 3
30 pages
Intro To Google Sheets
No ratings yet
Intro To Google Sheets
17 pages
Vinith Siripuram Data Engineer
No ratings yet
Vinith Siripuram Data Engineer
5 pages
ENGG1003 02 SpreadsheetApplication
No ratings yet
ENGG1003 02 SpreadsheetApplication
55 pages
SWAT - A System-Wide Approach To Tunable Leakage Mitigation in Encrypted Data Stores
No ratings yet
SWAT - A System-Wide Approach To Tunable Leakage Mitigation in Encrypted Data Stores
17 pages
2019 DBMS
No ratings yet
2019 DBMS
4 pages
Basic Excel
No ratings yet
Basic Excel
16 pages
Formulas Guide @excelbychris NEW
No ratings yet
Formulas Guide @excelbychris NEW
18 pages
Cenario or Sensitivity Analysis-1
No ratings yet
Cenario or Sensitivity Analysis-1
24 pages
2 - Google-Sheets-Course-Outline
No ratings yet
2 - Google-Sheets-Course-Outline
5 pages
Information Tech. Notes
No ratings yet
Information Tech. Notes
45 pages
Sandipan BA Practical File
No ratings yet
Sandipan BA Practical File
34 pages
Google Sheets Cheat Sheet
No ratings yet
Google Sheets Cheat Sheet
2 pages
Spreadsheet Functions
No ratings yet
Spreadsheet Functions
25 pages
Excel - Session - 1
No ratings yet
Excel - Session - 1
19 pages
Google Sheet
No ratings yet
Google Sheet
1 page
Dbms Jennys Lectures Watermarked
No ratings yet
Dbms Jennys Lectures Watermarked
92 pages
Excel - Part IV
No ratings yet
Excel - Part IV
9 pages
C Functions
No ratings yet
C Functions
17 pages
Google Sheets
No ratings yet
Google Sheets
5 pages
Chapter 1
No ratings yet
Chapter 1
6 pages
ENGG1003 02 SpreadsheetApplication
No ratings yet
ENGG1003 02 SpreadsheetApplication
57 pages
Handout M4
No ratings yet
Handout M4
7 pages
Google Sheets Guide
No ratings yet
Google Sheets Guide
11 pages
STATAPP
No ratings yet
STATAPP
4 pages
Grade 9 - Spreadsheet
No ratings yet
Grade 9 - Spreadsheet
12 pages
Basic Skills and Applications of IT
No ratings yet
Basic Skills and Applications of IT
55 pages
ENGG1003!02!03 SpreadsheetApplication C
No ratings yet
ENGG1003!02!03 SpreadsheetApplication C
57 pages
Ethio Coders
100% (7)
Ethio Coders
4 pages
Key Points For Students - Microsoft Excel - Introducing Spreadsheets
No ratings yet
Key Points For Students - Microsoft Excel - Introducing Spreadsheets
2 pages
Summary of Week 2 Material 1710450914
No ratings yet
Summary of Week 2 Material 1710450914
36 pages
Key Points For Students - Google Sheets - Introducing Spreadsheets
No ratings yet
Key Points For Students - Google Sheets - Introducing Spreadsheets
2 pages
Google Sheets Tutorial With PDF Mastering Google Sheets in 2025
No ratings yet
Google Sheets Tutorial With PDF Mastering Google Sheets in 2025
8 pages
DeepSeek - Google Spreadsheet Tutorials
No ratings yet
DeepSeek - Google Spreadsheet Tutorials
13 pages
Computer Reviewer (Q4)
No ratings yet
Computer Reviewer (Q4)
4 pages
13 Spreadsheet
No ratings yet
13 Spreadsheet
17 pages
Excel Notes
No ratings yet
Excel Notes
8 pages
Spreadsheet Notes
No ratings yet
Spreadsheet Notes
7 pages
Digital Tools Grade 6
No ratings yet
Digital Tools Grade 6
9 pages
Google Sheets Cheat Sheet - Google Workspace Learning Center
No ratings yet
Google Sheets Cheat Sheet - Google Workspace Learning Center
4 pages
Bossing Spreadsheets: A Girl's Guide to Data Analysis: Bossing Up
From Everand
Bossing Spreadsheets: A Girl's Guide to Data Analysis: Bossing Up
Sophie Johnson
No ratings yet
Pivot Tables In Depth For Microsoft Excel 2016
From Everand
Pivot Tables In Depth For Microsoft Excel 2016
Suljan Qeska
3.5/5 (3)