0% found this document useful (0 votes)

101 views

Data Analytics Unit 1

Uploaded by

topaneh316

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views

Data Analytics Unit 1

Uploaded by

topaneh316

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

1

Data analytics Unit 1

What is Data Analytics?

Data analytics takes raw data and turns it into useful information. It uses various tools and methods to discover patterns and solve problems with data.
Data analytics helps businesses make better decisions and grow.

Companies around the globe generate vast volumes of data daily, in the form of log files, web servers, transactional data, and various customer-related
data. In addition to this, social media websites also generate enormous amounts of data.

Companies ideally need to use all of their generated data to derive value out of it and make impactful business decisions. Data analytics is used to drive
this purpose.

Importance of data analytics

Now that you have looked at what data analytics is, let’s understand how we can use data analytics.

1. Improved Decision Making: Data Analytics eliminates guesswork and manual tasks. Be it choosing the right content, planning marketing campaigns,
or developing products. Organizations can use the insights they gain from data analytics to make informed decisions. Thus, leading to better outcomes
and customer satisfaction.

2. Better Customer Service: Data analytics allows you to tailor customer service according to their needs. It also provides personalization and builds
stronger relationships with customers. Analyzed data can reveal information about customers’ interests, concerns, and more. It helps you give better
recommendations for products and services.

3. Efficient Operations: With the help of data analytics, you can streamline your processes, save money, and boost production. With an improved
understanding of what your audience wants, you spend lesser time creating ads and content that aren’t in line with your audience’s interests.

4. Effective Marketing: Data analytics gives you valuable insights into how your campaigns are performing. This helps in fine-tuning them for optimal
outcomes. Additionally, you can also find potential customers who are most likely to interact with a campaign and convert into leads.

Let’s now dive into the various steps involved in data analytics.

Excel for Data Analysis

Data analysis with Excel is a common and accessible way for individuals and businesses to analyze and visualize data. Microsoft Excel provides a range
of tools and functions for performing basic to advanced data analysis tasks.

The software enables users to seamlessly import and organize data from various sources, facilitating a structured foundation for analysis. Data
cleaning becomes an intuitive process with Excel’s capabilities, allowing users to identify and rectify issues like missing values and duplicates.
PivotTables, a hallmark feature, empower users to swiftly summarize and explore large datasets, providing dynamic insights through customizable
cross-tabulations.

Excel for Data Analysis

1
2

How do you familiarize yourself with the Excel user interface?

To familiarize yourself with the Excel user interface, you can follow these steps:

Open Excel:

• The first step is to open Excel on your computer. You can either open a blank workbook or an existing one.
• Familiarize yourself with the Ribbon: The Ribbon is the main menu bar that appears at the top of the Excel window. It contains various tabs, each
with different groups of commands that you can use to perform tasks in Excel. Spend some time exploring each tab to get an idea of what it
contains.
• Get to know the Quick Access Toolbar: The Quick Access Toolbar is a customizable toolbar that appears next to the Ribbon. You can add frequently
used commands to the toolbar for quick and easy access.
• Learn about the different views: Excel has different views, including Normal view, Page Layout view, and Page Break Preview. Each view offers a
different way to work with your data and design your worksheets.
• Understand the different elements of a worksheet: A worksheet in Excel is made up of cells, rows, and columns. Each cell is identified by a unique
cell reference, which is a combination of the column letter and row number.
• Explore the different formatting options: Excel offers a variety of formatting options that you can use to customize your worksheets, including font
styles, colors, borders, and cell alignment.
• Practice entering and editing data: Enter some sample data into a worksheet and practice editing it. You can use the cut, copy, and paste
commands to move data around, and the fill handle to quickly fill a series of cells with data.
• Use the Formula Bar: The Formula Bar is located above the worksheet and displays the contents of the active cell. You can use it to enter and edit
formulas and functions.

4 Data Types in Microsoft Excel (With Uses and Tips)

If you use Microsoft Excel in your professional role, you may process different kinds of data for different projects. Each type may require different formulas and
commands. Learning about the different types of data in Excel may help you understand how and when to use them for your own professional spreadsheets.
In this article, we explore what Excel data types are, the different types and some tips you can use.

What are Excel data types?

Excel data types are the four different kinds of values in Microsoft Excel. The four types of data are text, number, logical and error. You may perform different
functions with each type, so it’s important to know which ones to use and when to use them. You may also consider that some data types may change when
exporting data into a spreadsheet.

4 Excel data types

Here’s a list of the four data types you can find in Microsoft Excel, with information about the ways you can use them:

1. Number data

Data is this category includes any kind of number. These may include large numbers or small fractions and quantitative or qualitative data. It’s important to
remember the difference between quantitative and qualitative number values because some numbers may not represent an amount of something. For
example, you might enter a number that represents financial earnings in one cell and a number that represents a date in another. Both count as number data,
but may enter differently in the spreadsheet. Make sure you use the appropriate symbols and formats to ensure Excel reads your number data accurately.

Examples of number data may include:

• Monetary totals
• Whole numbers
• Percentages
• Decimals
• Dates
• Times
• Integers
• Phone numbers
2. Text data

This kind of data includes characters such as alphabetical, numerical and special symbols. The primary difference between number data and text data is that
you can use calculations on number data but not text data. Since there can be overlap between these two types of data, you may manually change the format

2
3

of a cell to ensure it operates the way you want. You may also use text data to label columns or rows to help keep track of different categories. For example,
you may label a row “revenue” and a column “January 2022.”

Excel may categorize figures it doesn’t recognize as text data by default, so it’s important to format your data to fit the type you want. Examples of text data
may include:

• Words
• Sentences
• Dates
• Times
• Addresses
3. Logical data
• Data in this type is either TRUE or FALSE, usually as the product of a test or comparison. This means you can use a function to determine whether
the data in your spreadsheet meets different measures. For example, you may want to use your spreadsheet to set sales goals and measure
whether your sales performance matches. You may conduct these tests using logical functions for different scenarios. The four logical functions
are:
• AND: An AND function may help you determine whether your data meets multiple conditions. For example, you might use this function to test if
data in one cell is larger than a certain amount and the data in another cell is also larger than another amount.
• OR: You may use this function to determine that at least one of your arguments meets your conditions. If none of the data matches your conditions,
Excel produces a FALSE value.
• XOR: This function stands for “Exclusive Or,” which means that only one argument may be TRUE or FALSE. For example, you might use this function
to ensure that only one of your cells contains a certain value.
• NOT: You might use this function when you want to filter out arguments that don’t match your conditions. This marks each argument as TRUE so you
can assess possible patterns in data that doesn’t match your conditions.

4.Error data

This type of data occurs when Excel recognizes a mistake or missing information while processing your entry. For example, if you attempt to run a function on
a cell that contains text data, Excel produces the error value #VALUE!. This helps you identify where the issue is so you can correct it and produce the result
you want. A “#” character at the beginning of each error value can help you easily recognize these instances. Knowing the different error values can help you
understand how to resolve different mistakes or add the appropriate information. These values are:

• #NAME?: You may see this value if you have a value inside a formula without quotes or with a beginning or end quote missing. It may also populate
if there’s a typo in the formula.
• #DIV/0: This error value might arise if you try dividing a number by zero. Since the result is an undefined number, Excel uses #DIV/0 to represent
where you can try a different equation.
• #REF!: An invalid cell reference error value may result if you remove or paste items in a cell or range of cells where you previously entered a formula.
To correct this issue, you can undo your previous action and place your new data in a cell or cell range that doesn’t contain a formula.
• #NUM!: A #NUM! value may appear if you enter an invalid formula or function. It may also appear if the total that a formula or function produces is
too large for Excel to represent in a cell.
• #N/A: You may enter this error value when you want to indicate to yourself areas where you can enter a value later. Excel may also automatically
populate this value if imported data contains empty or unreadable cells.
• #VALUE!: This error indicates that an argument or operator in a function or formula is invalid. For example, if you try to calculate the sum of a range
of cells where one cell contains alphabetical characters, you can get a #VALUE! Result.
• #NULL!: If you’re referencing the intersection between a range of cells in a function, you may see this error value because those cells don’t actually
intersect. It may also appear if a range of cells for a function are missing separating commas.

Microsoft Excel is a software that you can use to organize data for your work and everyday life. Learn about formulas, functions, and more that you can apply
when using Excel.

[Featured Image] A woman works on a laptop computer.

Microsoft Excel can be an incredibly powerful tool to learn for your career, with benefits for everyone from data analysts, to social media marketers. It has
capabilities for the everyday user to create charts, graphs, and more to organize and visualize data.

In this article, you’ll learn what Excel is and does, formulas and functions to know, and some resources to help you get started.

What is Excel?

Excel is part of Microsoft’s 365 suite of software, alongside Word, PowerPoint, Teams, Outlook, and more. Microsoft Excel is a spreadsheet program that
allows users to organize, format, and calculate data in a spreadsheet. Excel users can create pivot tables and graphs to help them compute and visualize
complex data sets.

Excel vs. Google Sheets: What’s the difference?

3
4

Excel and Google Sheets offer similar capabilities and features. The main difference is that Google Sheets offers a free version where several users can edit
the doc at the same time, which makes it convenient for real-time collaboration. When you share your Google Sheets link with others, they can then edit the
file.

Documents you can create in Excel

There’s no shortage of things you can do with an Excel spreadsheet. Here are just a few common documents you can create:

• Balance sheet
• Budgets
• Calendar
• Data report
• Forms
• Income statement
• Invoice
• Mailing list
• Planning document
• Time sheet
• To-do list

All of these documents can be applied to your business or personal life. Excel is a versatile tool that can help you stay organized and calculate important
information.

How to use Excel

When using Excel, you’ll want to be sure to know the basics of a spreadsheet program. Once you’re familiar with its interface and features, you can add data to
the cells or create a document by formatting the cells to your liking. Then, you can learn formulas and functions to calculate sums of money, for example, or
the number of products needed for a launch.

Basics of Excel

Excel formulas

• There are many formulas available in Excel that you can use to work with data. Each formula in Excel begins with an equal sign. Before you create a
formula, you’ll need to write an equal sign (=) in the cell where you want the formula’s result to appear.
• These are some of the basic formulas to keep in mind.
• Add: To add the values of two or more cells, use the plus (+) sign.

Example: =A4+D5

• Subtract: To subtract the values of two or more cells, use the minus (-) sign.
• Example: =A4-D5
• Multiply: To multiply the values of two or more cells, use the asterisk (*).
• Example: =A4*D5
• Divide: To divide the values of two or more cells, use the forward slash (/). Example: =A4/D5
• You can use parentheses to create a large formula that combines these actions. Example: =((A4+C4)/(D5-C5)*3).

Excel functions

• On Excel, you can use “functions” to automate tasks you normally use in a formula. Instead of using the plus sign to add a range of cells, you can
use the SUM function. Let’s go through a few popular functions:
• SUM: The SUM function adds up a range of cells. To input the function, use parentheses to indicate the range of cells. If you are summing up the
numbers in cell A1 through A17, your formula would be: =SUM(A1:A17).
• AVERAGE: Similar to the SUM function, the AVERAGE function calculates the mean of the values of a range of cells. For example: =AVERAGE
(A1:A17).
• IF: With the IF function, you can ask Excel to return values based on a logical test. The syntax looks like: IF(logical_test, value_if_true,
[value_if_false]). For example: =IF(A1>B1,”Over Budget”,”OK”).
• VLOOKUP: The VLOOKUP function allows you to search for anything in your spreadsheet’s columns or rows. The syntax looks like: VLOOKUP(lookup
value, table array, column number, Approximate match (TRUE) or Exact match (FALSE)). For example:
=VLOOKUP([@Engineer],tbl_Engineers,7,TRUE).
• COUNTIF: The COUNTIF function is another useful one that returns the number of cells that meet certain criteria. The syntax looks like:
COUNTIF(range, criteria). For example: =COUNTIF(A1:A17,”San Francisco”).
4
5

Importing Data

Sometimes you may need to perform data analyses on related data from multiple sources. To import data from another source, you can simply use the Get
External Data functionality in the Data section on the Excel ribbon.

Query Tables

Sources such as Access Database can be used as query tables. Query tables are unique in that the data imported into Excel will update as the original source
is updated and the Excel data is refreshed.

You can connect your Excel table to the query table by means of a common key between the two tables.

VLOOKUP

VLOOKUP is a function that makes Excel search for a certain value in a column in order to return a value from a different column in the same row.

In its simplest form, the VLOOKUP function says:

=VLOOKUP(What you want to look up, where you want to look for it, the column number in the range containing the value to return, return an Approximate or
Exact match – indicated as 1/TRUE, or 0/FALSE).

Import Data in excel

• The Excel import tool can be used to create multiple container records at a time in your inventory. Any information to be associated with your
containers, including custom fields, can be imported.
• The import tool allows you to upload an Excel file listing information about the containers to be created. If you are setting up a new inventory in
ChemInventory this tool can be useful when migrating your data from external tools.
• Imports can be run at any time, including after an inventory has already been set up. Imported containers are always appended to your inventory as
new container records; if you would like to update or replace existing records, the Bulk Update Tool should be used instead.
• During an import, ChemInventory will assign a range of information to your containers automatically. This includes chemical structures, GHS safety
information and chemical synonyms.

The import process .

Before starting an import, ensure that you have your container data ready to transfer into an Excel file. An Excel template that is compatible with the import
tool is available to download at the link below. While the use of the template is not mandatory, we recommend you transfer your existing information into this
file to ensure that formatting is correct.

Download the import template

The layout of import-compatible files is simple: each column in the spreadsheet represents a container field (such as name or CAS number), while each row
represents one container record to be created.

What is data cleaning?

Data cleaning is the process of fixing or removing incorrect, corrupted, incorrectly formatted, duplicate, or incomplete data within a dataset. When combining
multiple data sources, there are many opportunities for data to be duplicated or mislabeled. If data is incorrect, outcomes and algorithms are unreliable, even
though they may look correct. There is no one absolute way to prescribe the exact steps in the data cleaning process because the processes will vary from
dataset to dataset. But it is crucial to establish a template for your data cleaning process so you know you are doing it the right way every time.

Advantages and need of data cleaning

Having clean data will ultimately increase overall productivity and allow for the highest quality information in your decision-making. Benefits include:

• Removal of errors when multiple sources of data are at play.

• Fewer errors make for happier clients and less-frustrated employees.
• Ability to map the different functions and what your data is intended to do.
• Monitoring errors and better reporting to see where errors are coming from, making it easier to fix incorrect or corrupt data for future applications.
• Using tools for data cleaning will make for more efficient business practices and quicker decision-making.

Data Transformation Techniques

In the rapidly evolving landscape of data-driven decision making and innovation, the ability to effectively leverage information is paramount. With an
increasing reliance on data analytics and data science, it is essential to recognize that raw data often requires a touch of refinement before it can be
harnessed for valuable insights. Enter the world of data transformation: a vital step that ensures your dataset is well-suited for accurate analysis and model
training.

5
6

Data transformation involves a range of techniques designed to make a dataset more suitable for analysis and other applications, such as training machine
learning models. This can include cleaning, formatting, and deleting data as required, making the information more accessible, structured, and easy to
interpret. As datasets vary in quality and structure, transforming them into a usable format is crucial for extracting value and driving better outcomes.

There are numerous methods available for effective data transformation, each catering to different project requirements and dataset characteristics. In this
blog post, we will outline the most common data transformation techniques, highlight their benefits, and help you choose the best techniques for you. By
mastering these methods, you’ll be well-equipped to prepare your data for insightful analysis and to build more accurate, reliable machine learning models.

Data transformation involves a series of steps that can vary depending on the specific needs and goals of a project. Here are a few key steps that are typically
followed:

• Data Discovery: Explore and understand data sources and their structure.
• Data Mapping: Define relationships between data elements from different sources. Document mapping specifications to guide the data
transformation process and maintain a clear record of changes.
• Code Generation: Develop scripts, algorithms, or tools to automate data transformation processes. Implement the data mapping and
transformation rules defined during the data mapping phase. Many data analysts prefer Python for this stage.

Benefits of data transformation

Before we cover the data transformation methods, let’s first understand some of the benefits of data transformation:

• Improved data quality: Data transformation helps identify and correct inconsistencies, errors, and missing values, leading to cleaner and more
accurate data for analysis.
• Enhanced data integration: By converting data into a standardized format, data transformation enables data integration from multiple sources,
fostering collaboration and data sharing among different systems.
• Better decision making and business intelligence: With clean and integrated data, organizations can make more informed decisions based on
accurate insights, which improves efficiency and competitiveness.
• Scalability: Data transformation helps teams manage increasing volumes of data, allowing organizations to scale their data processing and
analytics capabilities as needed.
• Data privacy: Protect data privacy and comply with data protection regulations by transforming sensitive data through techniques like
anonymization, pseudonymization, or encryption..
• Improved data visualization: Transforming data into appropriate formats or aggregating it in meaningful ways makes it easier to create engaging and
insightful data visualizations.
• Easier machine learning: Data transformation prepares data for machine learning algorithms by converting it into a suitable format and addressing
issues like missing values or class imbalance, which can improve model performance.
• Time and cost savings: By automating data transformation processes, organizations can reduce the time and effort needed for data preparation,
allowing data scientists and analysts to focus on higher value tasks

Types of data transformation

As we explore the world of data transformation, it is essential to understand the various techniques available, which can be broadly categorized into four
groups:

• Constructive Transformations: Constructive transformations create new data attributes or features within the dataset, or enhance existing ones to
improve the quality and effectiveness of data analysis or machine learning models. These transformations add value to the dataset by generating
additional information or by providing better representation of existing data, making it more suitable for analysis.
• Destructive Transformations: Destructive transformations remove unnecessary or irrelevant data from the dataset, and streamline the information
to be more focused and efficient for analysis or modeling. This can include data cleaning (removing duplicates, correcting errors), dealing with
missing values (imputation or deletion), and feature selection (eliminating redundant or irrelevant features). By reducing noise and distractions,
destructive transformations contribute to more accurate insights and improved model performance.
• Aesthetic Transformations: Aesthetic transformations deal with the presentation and organization of data, ensuring it is easily understandable and
visually appealing for human interpretation. These transformations include data standardization (converting data to a common format), sorting,
and formatting. While aesthetic transformations may not directly affect the analytical or predictive power of the data, they play a vital role in
facilitating efficient data exploration and communication of insights.
• Structural Transformations: Structural transformations involve modifying the overall structure and organization of the dataset, making it more
suitable for analysis or machine learning models. They are useful in time series analysis, multi-source data integration, preparing data for machine
learning, data warehousing, and data visualization.

Data Transformation Techniques

Data transformation involves a series of steps that can vary depending on the specific needs and goals of a project. Here are a few key steps that are typically
followed:

Data Discovery: Explore and understand data sources and their structure.

Data Mapping: Define relationships between data elements from different sources. Document mapping specifications to guide the data transformation
process and maintain a clear record of changes.

Code Generation: Develop scripts, algorithms, or tools to automate data transformation processes. Implement the data mapping and transformation rules
defined during the data mapping phase. Many data analysts prefer Python for this stage.

Data transformation methods

In this section, we will discuss various data transformation techniques, the problems they solve, scenarios where they can be useful, and a brief explanation
of how they work.

Manipulation (Data Cleaning):

Problem Solved: Data manipulation addresses data quality issues such as errors, inconsistencies, and inaccuracies within a dataset.

Scenarios: Data manipulation is crucial in almost every data analysis project.

How it works: Techniques include removing duplicate records, filling missing values, correcting typos or data entry errors, and standardizing formats. Data
manipulation ensures that the dataset is reliable and accurate for analysis or machine learning models.

Normalization:

Problem Solved: Data normalization scales numerical features to a standard range, typically [0, 1] or [-1, 1]. This prevents features with larger scales from
dominating the model and causing biased results.

Scenarios: Normalization is particularly important when working with machine learning algorithms that are sensitive to the scale of input features.

How it works: Techniques include min-max scaling and z-score standardization, which transform the original feature values to a standard range or
distribution, making them more suitable for analysis and modeling.

Attribute Construction (Feature Engineering):

Problem Solved: Attribute construction creates new features or modifies existing ones to improve the performance of machine learning models.

Scenarios: Feature engineering can be useful in various scenarios, such as combining or aggregating features to capture higher-level patterns, applying
mathematical transformations (e.g., log, square root) to address skewed distributions, or extracting new information from existing features (e.g., creating day
of the week from a timestamp).

How it works: Feature engineering can be accomplished through various methods, such as mathematical transformations, aggregation, binning, and
dimensionality reduction techniques. The goal is to create new data attributes that are more representative of the underlying patterns in the data and that help
to improve the performance of the machine learning model.

Generalization:

Problem Solved: Generalization reduces the complexity of data by replacing low-level attributes with high-level concepts.

Scenarios: Generalization can be useful in scenarios where the dataset is too complex to analyze, such as in image or speech recognition.

How it works: Techniques include abstraction, summarization, and clustering. The goal is to reduce the complexity of the data by identifying patterns and
replacing low-level attributes with high-level concepts that are easier to understand and analyze.

Discretization:

Discretization converts continuous data or variables into discrete intervals, making them more suitable for analysis.

7
8

Data Aggregation:

Aggregation combines data at different levels of granularity, making it easier to analyze and understand.

Data Smoothing:

Smoothing removes noise and fluctuations from data, making it easier to analyze and interpret.

Identify the Missing Data Values

• Most analytics projects will encounter three possible types of missing data values, depending on whether there’s a
relationship between the missing data and the other data in the dataset:

• Missing completely at random (MCAR): In this case, there may be no pattern as to why a column’s data is missing. For
example, survey data is missing because someone could not make it to an appointment, or an administrator misplaces the
test results he is supposed to enter into the computer. The reason for the missing values is unrelated to the data in the
dataset.

• Missing at random (MAR): In this scenario, the reason the data is missing in a column can be explained by the data in other
columns. For example, a school student who scores above the cutoff is typically given a grade. So, a missing grade for a
student can be explained by the column that has scores below the cutoff. The reason for these missing values can be
described by data in another column.

• Missing not at random (MNAR): Sometimes, the missing value is related to the value itself. For example, higher income people
may not disclose their incomes. Here, there is a correlation between the missing values and the actual income. The missing
values are not dependent on other variables in the dataset.

Handling missing value in excel

o Delete Rows/Columns:Remove rows or columns containing missing data if they are not critical to your analysis. This
should be done cautiously, as it can result in data loss and may not always be an appropriate solution.
o Fill with Zeros:If missing values indicate a lack of data (e.g., for numeric data), you can fill the empty cells with zeros.
This approach is suitable when zeros won’t distort the analysis.
o Fill with Mean/Median/Mode:For numeric data, you can replace missing values with the mean (average), median
(middle value), or mode (most frequent value) of the column. This helps maintain the overall statistical properties of
the dataset.
o Interpolation:Interpolate missing values based on the values of adjacent data points. Linear interpolation assumes a
linear relationship between data points, while other methods may consider different patterns.
o Use Conditional Formulas:Excel’s built-in functions like IF, ISBLANK, and IFERROR can be used to create conditional
formulas. You can set conditions to replace missing values with specific values or calculate replacements based on
certain criteria.
o Fill Down or Fill Up:You can manually fill missing values by selecting a cell with data above or below the missing cell
and then dragging the fill handle (a small square at the cell’s corner) up or down to copy the adjacent data.
o Data Validation:Use Excel’s Data Validation feature to set rules for data entry. This can help prevent missing data at
the input stage.
o PivotTables:When summarizing data using PivotTables, you can choose to either exclude rows with missing data or
show them separately as “(blank)” in the PivotTable.
o Data Cleaning Tools:Excel offers various data cleaning and transformation tools, such as the “Remove Duplicates”
and “Text to Columns” functions, which can help you manage missing data.
o External Data Sources:If you have access to external data sources or databases, consider importing data that may fill
in the missing information.
o Data Imputation:In some cases, you can use more advanced techniques like regression analysis, K-nearest
neighbors imputation, or machine learning algorithms to predict and fill missing values based on the available data .

8
9

Data Formatting in Excel

In Excel, the whole entered data in the sheet uses the same formatting by default which can make the data look monotonous, dull, and difficult to read. Excel
provides a pool of tools called formatting tools which customize the data in such a way that it only affects the appearance of the data and not the content.

How to apply formatting in Excel?

Following are the steps to apply the formatting:

• Select the range of cells on which formatting needs to be done.

• Select the suitable formatting tool from the toolbox(ribbon).
• Formatting can be observed on the sheet.
• Formatting tools that work on the text and cell appearance can be found in Home Tab [ Font Group ] in the ribbon.

Let’s discuss some formatting tools in this article.

• Font: It changes the writing style of the data.

• Font Size: It changes the size of the data, can make it appear big or short.
• Increase/Decrease Font Size: It does the same work as by Font Size i.e adjusts the size of the font accordingly. However, it gets handy as
in only one click, the size is getting changed.
• Bold and Italic: Bold thickens the letter and makes it appear darker than before while Italic, italicize the letters. Both are useful when some data
needs to be differentiated from the rest.
• Underline: This feature marks a line at the foot of the data
• There is a drop-down button on the right of the underline tool, which has two options: Underline and Double Underline. Underline is the same as
the Underline tool and Double Underline marks two lines below the text selected.
• Top and Double Bottom Border: This feature creates the border as per selection from the drop-down button around the selected cells. This is
majorly used to differentiate some cells from the drop-down and create tables in large sheets
• The drop-down in the right of the tool has many options in which we can create the borders around the cells
• Fill Colour: As the name suggests, it fills the colour in the cell making it distinct from the rest. Excel offers a pool of colours from which the choice
can be made.
• Font Colour: It changes the colour of the font and makes it appear better. Correct colour choices can be made from a plethora of options available

Introduction conditional formating in excel

Conditional Formatting in Excel enables you to the cells with certain color depending on the condition. It is an excellent way to visualize data in a spreadsheet.
You can also create rules with your own custom formulas. This guide will provide you with step-by-step examples of the most popular conditional formatting
functions.

What is Conditional Formatting?

Conditional formatting is a feature in Microsoft Excel that allows you to apply specific formatting to your cells according to certain criteria. It enables you to
make sense of your data and spot significant trends.

Applying conditional formatting rules to improve data visualization

• Conditional formatting is a powerful tool for improving data visualization by applying formatting rules based on the data itself. Here’s how you can
use it effectively:
• Highlighting Data Trends: Apply color scales to visualize data trends. For example, use a green-to-red color scale to highlight low-to-high values,
making it easier to spot patterns.
• Identifying Outliers: Use conditional formatting to highlight outliers or extreme values. For instance, apply a bold font or a different color to values
that exceed a certain threshold.
• Comparing Data Sets: Use icon sets to compare data sets visually. For instance, use up and down arrows to indicate whether values have
increased or decreased relative to a baseline.
• Creating Data Heatmaps: Apply conditional formatting to create data heatmaps, where colors represent data intensity. This is particularly useful for
analyzing large datasets and identifying patterns at a glance.
• Improving Readability: Use conditional formatting to improve the readability of your data. For example, you can apply alternating row colors to make
it easier to follow rows across a large dataset.
• Highlighting Important Values: Use conditional formatting to draw attention to specific values that are of particular interest. For example, you can
apply bold or italic formatting to key performance indicators.
• Customizing Rules: Tailor conditional formatting rules to suit your specific data and visualization needs. Experiment with different formatting
options to find the most effective way to communicate your data insights.
• By applying these conditional formatting techniques, you can enhance the visual appeal and clarity of your data visualizations, making it easier to
interpret and derive insights from your data.

Introduction to advanced Excel functions and formula

9
10

• Advanced Excel functions and formulas can significantly enhance your ability to analyze and manipulate data. Here’s an introduction to some key
concepts:
• VLOOKUP and HLOOKUP: These functions allow you to search for a value in a table and return a corresponding value from another column or row.
• INDEX and MATCH: A powerful combination used for looking up values in a table based on the row and column headings, offering more flexibility
than VLOOKUP.
• SUMIFS, COUNTIFS, and AVERAGEIFS: These functions enable you to sum, count, or average values based on multiple criteria, providing more
sophisticated filtering options.
• IFERROR: Helps to handle errors in formulas by returning a specified value if an error occurs.
• ARRAY formulas: These formulas perform calculations on multiple values simultaneously. They can be complex but offer powerful capabilities for
advanced data analysis.
• PivotTables: A dynamic tool for summarizing, analyzing, and presenting large volumes of data in a customizable format.
• Data Validation: Allows you to control the type of data entered into a cell, ensuring accuracy and consistency.
• Conditional Formatting: Formats cells based on specified conditions, making it easier to visually identify trends, patterns, and outliers in your data.
• Named Ranges: Assigning names to cells or ranges of cells can make formulas easier to read and maintain.
• Macros and VBA: For automating repetitive tasks or creating customized functions, Visual Basic for Applications (VBA) can be used to write scripts
within Excel.

These are just a few examples of the many advanced functions and features Excel offers. Experimenting with these tools and gradually incorporating them into
your workflow will help you become more proficient in Excel data analysis and manipulation.

Understanding various data analytics techniques

Sure, data analytics encompasses a wide range of techniques for interpreting and extracting insights from data. Some common techniques include:

• Descriptive Analytics: Describing what has happened in the past, often using summary statistics, charts, and graphs.
• Diagnostic Analytics: Exploring data to understand why certain events happened, identifying patterns or correlations.
• Predictive Analytics: Forecasting future outcomes based on historical data and statistical algorithms, such as regression analysis or machine
learning models.
• Prescriptive Analytics: Recommending actions to optimize future outcomes based on predictive models and business constraints.
• Exploratory Data Analysis (EDA): Investigating data sets to summarize their main characteristics, often using visual methods like scatter plots or
histograms.
• Time Series Analysis: Analyzing data collected over time to identify trends, patterns, and seasonality.
• Cluster Analysis: Grouping similar observations together based on their characteristics, useful for segmentation and pattern recognition.
• Text Mining and Natural Language Processing (NLP): Extracting insights from unstructured text data, such as sentiment analysis, topic modeling,
and text classification.
• Machine Learning: Using algorithms to build models that can learn from data and make predictions or decisions without being explicitly
programmed.
• Data Visualization: Presenting data visually through charts, graphs, and dashboards to facilitate understanding and communication of insights.

Applying Excel functions and tools for data analysis

• Certainly! Excel is a powerful tool for data analysis, and there are several functions and tools you can use for various analytical tasks. Here are
some commonly used ones:
• SUM, AVERAGE, MIN, MAX: Basic functions for summarizing numerical data.
• COUNT, COUNTA, COUNTIF: Counting functions for different types of data, including counting cells with values, non-empty cells, and cells
meeting specific criteria.
• IF, IFERROR: Conditional functions for applying logic to your data, allowing you to perform different calculations based on specified conditions.
• VLOOKUP, HLOOKUP, INDEX/MATCH: Functions for searching and retrieving data from a table based on certain criteria.
• PivotTables: Powerful tool for summarizing, analyzing, exploring, and presenting large amounts of data from different angles.
• Charts and Graphs: Excel offers various chart types for visualizing data, such as bar charts, line graphs, pie charts, etc., which can help identify
trends and patterns.
• Data Validation: Ensures that data entered into a cell meets certain criteria, helping maintain data integrity.
• Filters and Sorting: Quickly filter and sort data to focus on specific information or analyze it in different orders.
• Conditional Formatting: Apply formatting to cells based on specified conditions, making it easier to visually identify important data points.
• Data Analysis ToolPak: Excel add-in that provides advanced data analysis tools, including regression analysis, histograms, Fourier analysis, and
more.

These are just a few examples, but Excel offers a wide range of functions and tools for data analysis, making it a versatile choice for various analytical tasks.

Top 80+ Data Analyst Interview Questions and Answers (2024)
No ratings yet
Top 80+ Data Analyst Interview Questions and Answers (2024)
78 pages
Data Analysis in Microsoft Excel
50% (2)
Data Analysis in Microsoft Excel
320 pages
Chap1 Glencoe Precalc Resource Master
67% (3)
Chap1 Glencoe Precalc Resource Master
72 pages
Data Analytics With MS Excel Power BI This Book Will Transform You Into Data Analytics Expert
100% (2)
Data Analytics With MS Excel Power BI This Book Will Transform You Into Data Analytics Expert
183 pages
Prabhu, Punit - Data Analytics With Excel (2021)
67% (3)
Prabhu, Punit - Data Analytics With Excel (2021)
152 pages
Learn Excel Data Analysis
100% (15)
Learn Excel Data Analysis
721 pages
Data Analysis and Business Modeling With Excel 2013 - Sample Chapter
No ratings yet
Data Analysis and Business Modeling With Excel 2013 - Sample Chapter
27 pages
Course Content - For Merge
No ratings yet
Course Content - For Merge
6 pages
Data Science Analytics Reviewer
No ratings yet
Data Science Analytics Reviewer
10 pages
Excel
No ratings yet
Excel
22 pages
RohanAdus Data Analyst Roadmap1
No ratings yet
RohanAdus Data Analyst Roadmap1
23 pages
Excel Manual-Part 1
100% (1)
Excel Manual-Part 1
53 pages
Module 3 Toolkit
No ratings yet
Module 3 Toolkit
38 pages
MIS Assignment
No ratings yet
MIS Assignment
16 pages
Introduction and ETL
No ratings yet
Introduction and ETL
125 pages
Unit 5 Activity 1 Data Modelling PPT
No ratings yet
Unit 5 Activity 1 Data Modelling PPT
24 pages
Course Content
No ratings yet
Course Content
5 pages
CABLAB- REPORT -84
No ratings yet
CABLAB- REPORT -84
14 pages
10+ Simple Yet Powerful Excel Tricks For Data Analysis
No ratings yet
10+ Simple Yet Powerful Excel Tricks For Data Analysis
8 pages
Data Analytics Practical PDF
No ratings yet
Data Analytics Practical PDF
16 pages
Microsoft Excel - Introduction To Data Sceince
No ratings yet
Microsoft Excel - Introduction To Data Sceince
22 pages
Chapter 4 Data Exploration and Visualization 2
No ratings yet
Chapter 4 Data Exploration and Visualization 2
11 pages
Introduction To Analytics
No ratings yet
Introduction To Analytics
40 pages
DATA Analytics Course Presentation
No ratings yet
DATA Analytics Course Presentation
10 pages
Introduction To Data Analysis
No ratings yet
Introduction To Data Analysis
58 pages
Data Analysis With MS Excel
100% (1)
Data Analysis With MS Excel
10 pages
UrBizEdge Microsoft Excel and Business Data Analysis Training Outcome
No ratings yet
UrBizEdge Microsoft Excel and Business Data Analysis Training Outcome
34 pages
Advanced Data Analysis
No ratings yet
Advanced Data Analysis
30 pages
Data Analysis Quest
No ratings yet
Data Analysis Quest
31 pages
Summer Offer Till 12 June'24 Free Python ML & Tableau With Microsoft Data Analytics or Business Analytics Workshop Course
No ratings yet
Summer Offer Till 12 June'24 Free Python ML & Tableau With Microsoft Data Analytics or Business Analytics Workshop Course
14 pages
Gayu Report
No ratings yet
Gayu Report
24 pages
Data Management
No ratings yet
Data Management
9 pages
Note GG Data Analytics Course
No ratings yet
Note GG Data Analytics Course
16 pages
Www Tutorialspoint Com Excel Data Analysis Excel Data Analysis Quick Guide Htm
No ratings yet
Www Tutorialspoint Com Excel Data Analysis Excel Data Analysis Quick Guide Htm
50 pages
A - V.02 the Ultimate Roadmap to Becoming a Data Analyst Without Spending Money (for 9-To-5 Job Holders)
No ratings yet
A - V.02 the Ultimate Roadmap to Becoming a Data Analyst Without Spending Money (for 9-To-5 Job Holders)
5 pages
Introduction To Data Analytics
No ratings yet
Introduction To Data Analytics
12 pages
DA-Unit-2-Trio-1
No ratings yet
DA-Unit-2-Trio-1
26 pages
Excel Data Analysis Handbook - A - Johnson, A.T
No ratings yet
Excel Data Analysis Handbook - A - Johnson, A.T
46 pages
Group 23 BS Assignment Report
No ratings yet
Group 23 BS Assignment Report
5 pages
Manual - Excel Masterclass 1 - DS7
No ratings yet
Manual - Excel Masterclass 1 - DS7
4 pages
Data analysis course
No ratings yet
Data analysis course
11 pages
Statistics Fundament
No ratings yet
Statistics Fundament
7 pages
Data Analysis Ideas & Data
No ratings yet
Data Analysis Ideas & Data
32 pages
Data Analysis Project
No ratings yet
Data Analysis Project
50 pages
Explorer Globe Eng...
No ratings yet
Explorer Globe Eng...
8 pages
Data Analysis - Version 2
No ratings yet
Data Analysis - Version 2
12 pages
Chapter 1 Overview of Data Data Science Analytics and Tools
No ratings yet
Chapter 1 Overview of Data Data Science Analytics and Tools
31 pages
Excel Dashboard For Business Analytics
No ratings yet
Excel Dashboard For Business Analytics
22 pages
Module 1 - Data Analysis in Excel
No ratings yet
Module 1 - Data Analysis in Excel
15 pages
Data Analysis: in Microsoft Excel
100% (1)
Data Analysis: in Microsoft Excel
48 pages
1) What Is Business Analytics?
No ratings yet
1) What Is Business Analytics?
6 pages
Notes
No ratings yet
Notes
143 pages
Chandan Kumar Mastering Microsoft Excel - From Novice To Pro in One Book Independently Published - 202
No ratings yet
Chandan Kumar Mastering Microsoft Excel - From Novice To Pro in One Book Independently Published - 202
348 pages
Data Analytics 1 and 2
No ratings yet
Data Analytics 1 and 2
95 pages
Notas - Curso Data Analysis
No ratings yet
Notas - Curso Data Analysis
38 pages
Analysis Terms
No ratings yet
Analysis Terms
1 page
Data Analytics For Beginners - Paul Kinley - CreateSpace Independent Publishing Platform 2016 - IsBN 978-1-53989-673-9
100% (2)
Data Analytics For Beginners - Paul Kinley - CreateSpace Independent Publishing Platform 2016 - IsBN 978-1-53989-673-9
51 pages
Lesson 1 Notes
No ratings yet
Lesson 1 Notes
14 pages
A Comprehensive Guide On Microsoft Excel For Data Analysis
No ratings yet
A Comprehensive Guide On Microsoft Excel For Data Analysis
28 pages
Excel: Mastering Data Analysis, Visualization, and Automation for Success with Microsoft 365
From Everand
Excel: Mastering Data Analysis, Visualization, and Automation for Success with Microsoft 365
Sam O.A
No ratings yet
Excel Foundations
From Everand
Excel Foundations
Tom Austin
No ratings yet
Maths 1
No ratings yet
Maths 1
3 pages
Discrete Mathematics Farkaleet Series
No ratings yet
Discrete Mathematics Farkaleet Series
154 pages
Thank You So Much For Purchasing This Worksheet! I Appreciate Your Support!
No ratings yet
Thank You So Much For Purchasing This Worksheet! I Appreciate Your Support!
5 pages
GEMath
No ratings yet
GEMath
140 pages
Smooth Numbers: Computational Number Theory and Beyond: Andrew Granville
No ratings yet
Smooth Numbers: Computational Number Theory and Beyond: Andrew Granville
58 pages
Top 10 Greatest Mathematicians
No ratings yet
Top 10 Greatest Mathematicians
5 pages
Es Textbook Pt. 1 PDF
100% (1)
Es Textbook Pt. 1 PDF
258 pages
Ese Question Bank PWP
No ratings yet
Ese Question Bank PWP
3 pages
Applied Physics Week 1.1
No ratings yet
Applied Physics Week 1.1
32 pages
Introductory To Mathematics
No ratings yet
Introductory To Mathematics
9 pages
Lesson 1 - Real Numbers and Integer Exponent
No ratings yet
Lesson 1 - Real Numbers and Integer Exponent
29 pages
Mental math tricks to become a human calculator Second Edition Abhishek V R all chapter instant download
100% (1)
Mental math tricks to become a human calculator Second Edition Abhishek V R all chapter instant download
62 pages
Summative Test 2 Math
No ratings yet
Summative Test 2 Math
5 pages
7 TH Class Chahi
No ratings yet
7 TH Class Chahi
57 pages
Lesson Plan
No ratings yet
Lesson Plan
8 pages
Illustration of Limit Theorems
No ratings yet
Illustration of Limit Theorems
28 pages
Practical and Mathematical Skills Booklet
No ratings yet
Practical and Mathematical Skills Booklet
30 pages
1.1 Numbers and Simple Calculations: Task 1
No ratings yet
1.1 Numbers and Simple Calculations: Task 1
11 pages
X WS Polynomials DTQ M.Sunitha 2023-24
No ratings yet
X WS Polynomials DTQ M.Sunitha 2023-24
2 pages
Question 840943
No ratings yet
Question 840943
6 pages
Generalized Functions
No ratings yet
Generalized Functions
56 pages
Problem Solving Involving Fraction (GERBAS, JULINA A.) BEED 3-B
No ratings yet
Problem Solving Involving Fraction (GERBAS, JULINA A.) BEED 3-B
9 pages
KFS PYP Scope and Sequence
No ratings yet
KFS PYP Scope and Sequence
105 pages
Kannada Numbers
No ratings yet
Kannada Numbers
12 pages
Year 4 Mathematics 2022 Spring White Rose Reasoning Problem Solving Paper 2 Marking Scheme
No ratings yet
Year 4 Mathematics 2022 Spring White Rose Reasoning Problem Solving Paper 2 Marking Scheme
3 pages
Maths Checklist
No ratings yet
Maths Checklist
3 pages
7UhCz915KH_Quant-Time-Table-(March) copy
No ratings yet
7UhCz915KH_Quant-Time-Table-(March) copy
2 pages
Unit Plan Example
50% (2)
Unit Plan Example
17 pages
Lesson 4 - Inequalities
No ratings yet
Lesson 4 - Inequalities
42 pages

Data Analytics Unit 1

Uploaded by

Data Analytics Unit 1

Uploaded by

1

Data analytics Unit 1

What is Data Analytics?

Importance of data analytics

Excel for Data Analysis

Excel for Data Analysis

How do you familiarize yourself with the Excel user interface?

4 Data Types in Microsoft Excel (With Uses and Tips)

What are Excel data types?

4 Excel data types

Examples of number data may include:

[Featured Image] A woman works on a laptop computer.

Excel vs. Google Sheets: What’s the difference?

Documents you can create in Excel

How to use Excel

In its simplest form, the VLOOKUP function says:

Import Data in excel

The import process .

Download the import template

What is data cleaning?

Advantages and need of data cleaning

• Removal of errors when multiple sources of data are at play.

Data Transformation Techniques

Benefits of data transformation

Types of data transformation

Data Transformation Techniques

Data transformation methods

Manipulation (Data Cleaning):

Scenarios: Data manipulation is crucial in almost every data analysis project.

Attribute Construction (Feature Engineering):

Identify the Missing Data Values

Handling missing value in excel

Data Formatting in Excel

How to apply formatting in Excel?

Following are the steps to apply the formatting:

• Select the range of cells on which formatting needs to be done.

Let’s discuss some formatting tools in this article.

• Font: It changes the writing style of the data.

Introduction conditional formating in excel

What is Conditional Formatting?

Applying conditional formatting rules to improve data visualization

Introduction to advanced Excel functions and formula

Understanding various data analytics techniques

Applying Excel functions and tools for data analysis

You might also like