Data Analysis3
Data Analysis3
•Descriptive analytics answers: ‘What is the current prevalence of HIV across the
country?’
•Diagnostic analytics answers: ‘Why are HIV patients stopping to use medications l?‘
•Predictive analytics answers: 'Which HIV paients are at risk of stopping medications in
near future?‘
•Prescriptive analytics answers: ‘'What actions can be taken to reduce number of HIV
Patients stopping to use medications ?'
• 1. Descriptive analytics
• Descriptive analytics focuses on answering the question, ‘What is
happening?’ or ‘What has happened?’ by analyzing past data.
• Of all the types of data analytics, this is the most straightforward
approach as it summarizes and simplifies the main features and
characteristics of complex datasets through interactive visualizations.
• 2. Predictive analytics
• Predictive analytics uses historical data to answer the question, ‘What
may happen next?’ This model to predict future outcomes, find
patterns, and identify risks or growth opportunities.
• While descriptive analytics serves as a reflective mirror, showing us a
holistic picture of our past activities, predictive analytics acts as a
crystal ball, providing a sneak peek into the future.
• 3. Prescriptive analytics
• Unlike predictive analytics, which focuses on future outcomes,
prescriptive analytics helps decision-makers identify the best course
of action to help them achieve their business goals.
• The primary goal of this model is to answer the question: ‘What
should we do?’
• 4. Diagnostic analytics
• Diagnostic analytics examines past data to identify the root causes
behind a particular outcome. This type of analytics aims to answer the
question, ‘Why did this happen?’
• It focuses on uncovering insights into historical data patterns,
anomalies, and correlations to facilitate a deeper understanding of a
particular business problem.
DATA ENTRY
• Data entry is the process of digitizing data by entering it into a
computer system for organization and management purposes
• Data entry is often done with a keyboard and at times also using a
mouse,[7
• Although most data entered into a computer are stored in a database,
a significant amount is stored in a spreadsheet.[17] The use of
spreadsheets instead of databases for data entry can be traced to the
1979] although a manually-fed scanner may be involved.[8]
TYPES OF DATA ENTRY
• Manual data entry
• This type of data entry involves inputting data directly into online
forms or systems.it is commonly used for tasks such as online surveys
or customer registration.
DATA CLEANING
• Data cleaning refers to a process of fixing or removing incorrect ,
corrupted , incorrectly formatted , duplicate or incomplete data
within data set
• There several ways to find the centrality of a data. However, the most
popular ones are mean, mode and median. These three summarises
the distribution of the dataset.
• Mean
• Mean is used to calculate the numerical average of a dataset.
Arithmetic mean is calculated by adding all the values of the given
dataset and dividing it by the by number of items therein. The
mathematical formula is as follows:
x = ∑x/n
• Here, ‘∑’ represents ‘summation’
‘n’ represents ‘number of items’
• For example: consider the following heights of 10 men in centimeters
(cm): 165, 167, 169, 169, 171, 173, 175, 176, 176, 169
• The mean height is calculated by adding the heights for the ten men
and dividing the sum by 10.
Arithmetic mean = 165 + 167 + 169 + 169 + 171 + 173 + 175 + 176 + 176
+ 169 /10
x̄ = 1710/10 = 171 cm
Mode
sn 1 2 3 4 5 6 7 8
Value 24 34 38 46 47 53 53 61