0% found this document useful (0 votes)

8 views29 pages

Data Preparation

Uploaded by

Hanish verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views29 pages

Data Preparation

Uploaded by

Hanish verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 29

Guru Gobind Singh Indraprastha University

(GGSIPU)

Submitted in partial fulfilment of requirement of

Masters in Business Administration (MBA)

LAB FILE
OF
DATA PREPARATIONS AND EXPLORATION

Submitted to: Submitted by:

Dr. Shitika DIVYA
00416619824
DESCRIPTIVE_STATISTICS

EX 1
Q Given the following series of data on Gender and Height for 8 patients, fill in two frequency
tables one for each Variable, according to the model below. Then add a graph. Then create a
contingency table end describe the relation between Gender and Height using appropriate
statistical summaries.
id Height, cm Gender: 1=M, 2=F
1 165 M
2 157 F
3 168 F
4 178 M
5 171 F
6 182 M
7 182 M
8 153 F

modality Absolute freq. Percent freq. Cumulative freq.

SOLUTION 1
Variable Gender:

For Gender we DON'T compute cumulative frequencies as it is a non-ordered qualitative

variable. We will compute the cumulative frequencies for Height, which is quantitative (and thus
necessarily ordered) and continuous.
For Gender, an appropriate graph is with columns (or bars), i.e. with two separate rectangles one
for each modality, M and F, with height proportional to the percentage. It is in general correct to
use a vertical axis going from 0% to 100% in order to avoid a distorted perception of the
importance of the frequencies.

For Height, we consider a division in classes. Let us assume we know the minimum (140 cm)
and the maximum value (200 cm) (notice: a different choice of the extremes as well as of the
number and width of the classes will lead to results slightly different from the following):

The appropriate graphical representation is the histogram, drawn on a cartesian diagram, putting
the classes on the horizontal axis, and drawing for each class a rectangle with height equal to the
frequency density of the class, so that the area of the rectangle equals the frequency of the class.
(Notice the difference with the column chart used for qualitative variables: rectangles here are
contiguous and their area, not their height, is equal to the frequency)
Contingency table:

To describe the relation between Gender and Height, we can compute separately for calcoliamo
M and F the percentages for each age class: these are also called 'row profiles' or conditional
distributions of Height (conditional on Gender, i.e. "restricted to" a specific gender):
This table suggests that M are taller than F. Let us observe also that for Males the Mode is the
class 170 -| 200, while for Females the Mode is 140 - | 160.

EX 2
Q 186 patients were given a therapy for a certain disease; 122 had therapy A, the other 64 had
therapy B.
In group A, there were 37 responders (patients who had benefit from the therapy). In group B,
there were 32 responders. Which was the best treatment? How can we measure the advantage
with this treatment?
Among responders, how many had treatment B?
SOLUTION 2
We fill-in a table with the data given (grey cells) and we complete the missing cells:

(e.g. 85=122-37, 69=37+32. It is a good exercise to repeat what we read in the table, e.g. "there
were 122 patients in group A, 37 of them responded to treatment. In total, 189 patients were
treated, and 69 of them responded")
Now we compute the appropriate percentages:
The best treatment seems to be treatment B. It is intuitive, and it will be seen in the course, that
to compare the percentages in two groups it is useful to compute the ratio. This measure of
comparison of percentages or probabilities is called Risk Ratio (see Probability. The comparison
of numbers via a ratio is illustrated in the Appendix I, among Prerequisites):
RR=50/30.3=1.65
Thus, B has a response percentage superior by 65% with respect to treatment A.
Among responders, the percent of those who got treatment B was:
32/69=46,4%
This percentage is obtained by considering the column profile, or in other terms the distribution
of Treatment conditional on Response equal to Yes.

EX 3
Q Compute the mean and the median class for the following distribution of the number of nurses
in 23 medical institutes.
nurses n
1 – 10 6
11 – 20 13
21 – 40 4
23

SOLUTION 3
The variable Number of Nurses observed in a sample of 23 medical institutes (statistical units) is
a quantitative discrete variable, which we can treat as if it was continuous since it has many
modalities (the numbers from 1 to 40); in fact, the distribution is described by frequencies in
classes. (Let us remark that usually the classes should be contiguous while here there are gaps in
between: this is due to the discrete nature of the variable. This fact has no consequences in our
exercise but it could be annoying when making a graphical representation. For example, in order
to make a representation using a histogram, e.g. the class 1-10 should be considered as 1 |- 11; if
we adopt this convention in our exercise, the mid-values are different from the ones used below)
To compute the mean, we chose a representative value for each class: we take the value in the
middle, computed as (lower limit + upper limit)/2. The total amount of nurses for each class is
then found as this mid-value times the frequency. The mean is the total amount among the
classes divided by the total sample size (the number of statistical units) which is 23.
To identify which is the median class, or in other terms the class that contains the median, we use
the cumulative frequencies.

Mean=365.5 / 23 = 15.5
Median: modality that occupies the rank 12. Looking at the column of cumulative frequencies, it
belongs to the class 11-20 (in fact, the first class includes only the first 6 units; as another
example, the modality that occupies the 20th rank belongs to the class 21-40)

EX 4
Q We know the values of haemoglobin for 6 patients before and after a course of chemotherapy:
we wish to compute the mean reduction. What is the relation between the latter and the means of
the values "before" and "after"?
before after
13.0 9.4
12.8 11.5
11.0 11.5
13.2 13.1
12.5 10.2
11.9 12.0

SOLUTION 4
The Reduction is the difference between the value Before and the value After; in some case a
reduction is negative, this happens since the variable X (haemoglobin) has actually increased.
We can compute the reduction for each of the six patients (statistical units) and then compute
their average. Another way to compute the mean reduction is by using the property of
LINEARITY: given the variables X and Z, if we apply a linear transform such as
Y=aX+bZ
it is always true that mean(Y)=a·mean(X)+b·mean(Z).
In our exercise, a=1 and b= (-1), and thus the mean of the difference is the difference of the
means: mean (Before-After) = mean (Before)-mean (After). All computations are in the table
below.
Notice: the demonstration of such properties will not be a subject of the course tests, as in
general the theoretical results are not part of the knowledge required to the students. This
exercise is illustrated as a complement to the classes.
Another exercise in this document uses again the property of linearity, in the version:
mean(a+bx) =a+bx. Think of a situation when it could be useful to compute the mean of a+bX
when only a, b and mean(X) are known.

EX 5
Q A certain treatment is used in two different centres, A and B; patients in centre A were 25 and
were on average 54 years old; patients treated in centre B were 62 and had mean age equal to 58
years. What is the overall mean among all patients who got the treatment?
SOLUTION 5
We need to compute a weighted average, i.e. the average of two means (54 and 58) weighted by
the size of the two groups (25 and 62).
overall Mean = (54·25 + 58·62) / (25+62) = 4946 / 87 = 56.85

EX 6
Q Pregnant women (within month 4) who are being followed-up by a nutritionist had weights
(kg) equal to: 64.3; 65.2; 70.0; 54.5; 58.8; 81.5; 61.0; 62.0. What was the mean? and the median?
Do data suggest a strong skewness of the distribution of the Weight?
SOLTUION 6
We can sort the observed values and identify the values that occupies the position (rank) 4 and 5
(since we have 8 statistical units - pregnant women).
A slightly different way of illustrating this same procedure is assigning to each value the
corresponding rank in the following table (in other terms, we avoid to write down the values
sorted, but we need to sort to assign the rank!):

Sum of values = 517.3 Mean = 517.3 / 8 = 64.66

Central values (look at ranks 4 and 5): 62 e 64.3 Median = (62 + 64.3) / 2 = 63.15
Since Mean and Median are not very far from each other, the data don't suggest that the variable
Weight has a strongly skewed distribution.
EX 7
The following data regard 10 male adults; we consider age, value of FEV1 (Forced Espiratory
volume in 1 second) and diastolic pressure. Compute median and standard deviation of the three
variables. Then tell what is the variable with higher variation, using an appropriate measure for
comparisons.
Age FEV1 pressure
25 2.5 85
32 1.8 71
28 1.5 92
21 2.5 80
33 4.5 87
33 2.1 83
34 3.4 70
24 1.2 101
41 2.8 90
26 3.9 83

SOLUTION 7
We have 3 quantitative continuous variables. Mean (arithmetic mean) and standard deviation
provide a synthesis of position and variability. The mean is computed as the sum of all values
divided by 10 (n=10 sample size). Let us apply the "fast" formula for computing the standard
deviation. Computations are reported in the table.
To compare variability of these three variables it is NOT sufficient to look at standard deviations,
that by the way are expressed in different units of measurement and describe variables with
different nature! We must express variability in relative terms with respect to the mean, using the
coefficient of variation. The variable with highest variability is FEV1, 4 times more variable than
Pressure and 2 times more variable than Age (notice that FEV1 had apparently the smallest value
for the standard deviation ...)
EX 8
The Age quartiles in a sample of participants in a clinical trial were respectively 27, 41 e 59.
a) This means that:
o 1 out of 4 was younger than … years o 1
out of 4 was older than … years o 2 out of 4
were between … and … years old o half of
them was more than … years old
b) Additionally, we know that mean and standard deviation were respectively equal to 42 and 12.
Can we say whether the distribution was approximately Normal or not?
c) Which index of position is appropriate to give a synthetic description of the distribution?

SOLUTION 8

Point a):
1 out of 4 was younger than 27 years: this is the definition of first quartile, ¼=25% of observed
values was lower than Q1=27
1 out of 4 was older than 59 years: similarly, this is the definition of third quartile, ¾=75%
of observations were lower than Q3, and the other 25% was larger than Q3=59
2 out of 4 are …. We can claim: "between 0 years and the median 41 years", but also
"between Q1 and Q3" and also "between the median 41 and the maximum age" (but we don0t
know the latter).
Half of them was more than 41 years old: this comes from the definition of the Median.
b) First, we notice that the mean is 42 and it is very close to the median, in fact their
distance (equal to 1) is small (1/12) compared to the standard deviation. Thus, the observed
distribution is rather symmetric. But the Normal is not the only kind of symmetric distribution;
we can go further by looking at the quartiles. In a normal curve the first and third quartile should
be at a specific distance from the mean, given by 0.67 times the standard deviation, thus
0.67·12=8. Thus, if the observed distribution was approximately Normal the first and third
quartiles were expected to be 34 e 50. Our observed quartiles are instead 27 and 59, rather distant
from the ones of a normal distribution with the same mean and standard deviation. Thus, our
distribution is NOT Normal-shaped; it was symmetric but not bell-shaped; it could have been a
distribution with very high tails and few observations in the middle, possibly a distribution with
two modes.
Given what we just remarked, neither the mean nor the median are good indexes to describe the
distribution; if it was bi-modal, we should use the two modes, and if we could recognize the
presence of two subpopulations, we should use the means or the medians of these
subpopulations.

EX 9
Consider the 6 patients with values of haemoglobin before and after chemotherapy of EX 4. We
computed the means: respectively 12.40 and 11.28 - and thus 1.12 for the reduction.
Now compute the standard deviation for the variable Before, After and for the Reduction (Before
- After): does the linearity property hold?
SOLUTION 9
The computation of the standard deviations for Before and After is let for the student; the results
are 0.822912 and 1.313646 respectively. For the Reduction, using the "fast" formula:
The variance is:

and the standard deviation is its squared root: 1.595514

Thus, for the standard deviation linearity does not hold - this is because its calculation requires
computing the power 2 of the values and the squared root, and these operations do not follow
linearity: (a + bx)2 ≠ a2 + bx2
We now give the answer to the question of EX 4: using the linearity for the mean is useful for
example when the values are transformed into another unit of measurements of the kind y = a +
bx.
For example, a mean temperature is expressed in Fahrenheit degrees and we want to express it
in Celsius degrees. This won't be possible for the standard deviation!

EX 10
The weight distribution of a sample of adults with physical inabilities is approximately Normal,
with mean 72 and standard deviation 8. Find an interval of values around the mean such that:
a) includes 95% of the observed values
b) includes almost all observed values (and thus coincides with the range, min-max)
c) includes 50% of the observed values

SOLUTION 10
we use the properties of each Normal distribution.
In an interval given by mean ± 2·st.dev. we have about 95% of the values (more precisely, we
should use 1.96 as a factor instead of 2). This answers to question a). similarly for question b)
we shall compute the interval with radius 3·st.dev. which includes 99.7% of values: a) 72 ± 2·8
= (56,88)
b) 72 ± 3·8 = (48,96)
For the last point, let us remark that an interval around the mean=median which includes 50% of
observations is by definition delimited by the first and third quartile (Q1, Q3), thus we can
compute the limits as:
c) 72 ± 0.67·8 = (66.64,77.36)
ASSIGNMENT-1 (date 27-8)

1. Generate 100 random nos. between 25 to 100

2. Create a table with the following headers

a. Employee ID
b. Department (H.R, Accounts, Admin)
c. Date of Joining [YYYY-MM-DD]
d. Salary before tax
e. Income tax
i. If salary is less than 2 lakhs, income tax charged should be 5% of salary
ii. If salary is in between 2lakhs to 10 lakhs, income tax charged should be
10% of salary
iii. If salary is more than 10 lakhs, income tax charged should be 30% of
salary

1) Find out the average salaries offered by the company.

2) Also find the Count of employees in each department along with the total income tax
paid by them.

3) It should also highlight the employee ID with maximum earnings/ salary.

Assignment-2 (date 30-8)

You have decided to use Pivot Table to analyse your data and see how business can be improved.

1. How much in $ did you sell in each city?

2. Per each home appliance item, how many items were sold by your company?
3. Now create a table which shows a breakdown of total revenue from each of the items
sold (in the rows), and the regions each item was sold (in columns).

4. Now, based on your last Pivot Table, apply filter so only Microwave, Oven and
Refrigerators will show in "Item".
Additionally, remove NA from the "Region" field.

5. Now, show the Average Discount % per Sales Person. Apply an external filter that
will show only sales made in Columbia. Sort the results in Ascending Order. Who's
the best sales agent.
6. Create a report detailing the $ Sales by Region, Country and Store, in the following
format:
---------
Assignment-2 (date 27-9)
SPSS

3.
4.
EXCEL CASES SPORTS DAY

Q. SPORTSMEN DATA STANDARDIZATION

Q. SPORTSMEN DATA FORMATTING
Q SUMMARIZE THE DATA

1 Introduction To Biostatistics
100% (3)
1 Introduction To Biostatistics
52 pages
Programme Guide PGDAST 2022
No ratings yet
Programme Guide PGDAST 2022
52 pages
MCQs Statistic Master Revision
No ratings yet
MCQs Statistic Master Revision
10 pages
Topic 1 - W1-3 Introduction To Biostatistics
No ratings yet
Topic 1 - W1-3 Introduction To Biostatistics
52 pages
Biostatics and Epidemiology 2022 1
No ratings yet
Biostatics and Epidemiology 2022 1
17 pages
SCM Assignment 3 - Cost Estimation
No ratings yet
SCM Assignment 3 - Cost Estimation
12 pages
7 Measures of Central Tendency Data Location and Variability
No ratings yet
7 Measures of Central Tendency Data Location and Variability
61 pages
Introduction To Statistics and SPSS
100% (1)
Introduction To Statistics and SPSS
110 pages
In-House Validation of Analytical Methods: Procedures & Calculation Sheets
No ratings yet
In-House Validation of Analytical Methods: Procedures & Calculation Sheets
21 pages
Template - MATHEMATICS AS A TOOL (Measures of Central Tendency)
100% (1)
Template - MATHEMATICS AS A TOOL (Measures of Central Tendency)
11 pages
Handout - INTRO TO BIOEPI
No ratings yet
Handout - INTRO TO BIOEPI
22 pages
Blocking Factors
100% (1)
Blocking Factors
56 pages
Epidemiology Notes
No ratings yet
Epidemiology Notes
17 pages
Biostatistics Teaching
No ratings yet
Biostatistics Teaching
283 pages
Indicator Variables: Variable or Dummy Variables
No ratings yet
Indicator Variables: Variable or Dummy Variables
11 pages
DA1-Types of Data
100% (1)
DA1-Types of Data
17 pages
Descriptive Statistics Tutorial 1 Solutions
No ratings yet
Descriptive Statistics Tutorial 1 Solutions
5 pages
Statistics Introduction
No ratings yet
Statistics Introduction
55 pages
0 Ppt1 Introduction To Biostatistics123
No ratings yet
0 Ppt1 Introduction To Biostatistics123
59 pages
Contact Details:: Dr. Joy C. Chavez
No ratings yet
Contact Details:: Dr. Joy C. Chavez
101 pages
Contact Details:: Dr. Joy C. Chavez
No ratings yet
Contact Details:: Dr. Joy C. Chavez
54 pages
Bio-Statistics and RD Lecture Note
No ratings yet
Bio-Statistics and RD Lecture Note
176 pages
Biostatistics CN
No ratings yet
Biostatistics CN
79 pages
Case Solution On Agricomp
No ratings yet
Case Solution On Agricomp
10 pages
Probability and Statistics: BY Engr. Jorge P. Bautista
No ratings yet
Probability and Statistics: BY Engr. Jorge P. Bautista
172 pages
IP 200 L Biostatistics
No ratings yet
IP 200 L Biostatistics
86 pages
One-Way ANOVA Two-Way ANOVA
No ratings yet
One-Way ANOVA Two-Way ANOVA
31 pages
STAT1008 Cheat Sheet
100% (1)
STAT1008 Cheat Sheet
1 page
احصاء حيوي
No ratings yet
احصاء حيوي
37 pages
Basic Concepts in Biostatistics-1
No ratings yet
Basic Concepts in Biostatistics-1
40 pages
Stat1 Formulas and Tables For Statistics 2022
No ratings yet
Stat1 Formulas and Tables For Statistics 2022
34 pages
Biostatistics
No ratings yet
Biostatistics
56 pages
Biostatistics
No ratings yet
Biostatistics
56 pages
Review of Sessions 1-7 PUBH 614 Spring 2019
No ratings yet
Review of Sessions 1-7 PUBH 614 Spring 2019
68 pages
Statistics For Everyone Workshop Fall 2010
No ratings yet
Statistics For Everyone Workshop Fall 2010
47 pages
Final Lecture Measures of Central Tendency
No ratings yet
Final Lecture Measures of Central Tendency
38 pages
Basics of Statistics Unit-I SCLS
No ratings yet
Basics of Statistics Unit-I SCLS
135 pages
MODULE 1 Introduction To BIOSTAT
No ratings yet
MODULE 1 Introduction To BIOSTAT
49 pages
MATH103 M2 Data Presentation
No ratings yet
MATH103 M2 Data Presentation
43 pages
Data Presentation
No ratings yet
Data Presentation
67 pages
Basics of Statistics Unit-I SCLS
No ratings yet
Basics of Statistics Unit-I SCLS
127 pages
01 - Introduction
No ratings yet
01 - Introduction
27 pages
Final Defense
No ratings yet
Final Defense
25 pages
Introduction To Biostatistics and
No ratings yet
Introduction To Biostatistics and
114 pages
Section 1: Organizing Data: Line List Line Listing
No ratings yet
Section 1: Organizing Data: Line List Line Listing
45 pages
Summry Biostatstics
No ratings yet
Summry Biostatstics
32 pages
3
No ratings yet
3
46 pages
Biostatistics Module Sep2023 240520 122333
No ratings yet
Biostatistics Module Sep2023 240520 122333
65 pages
Statistics
No ratings yet
Statistics
47 pages
02-Descriptive Statistics of Numerical Data - 52
No ratings yet
02-Descriptive Statistics of Numerical Data - 52
61 pages
QUANTIFYING 315 Week 7
No ratings yet
QUANTIFYING 315 Week 7
63 pages
Assignment 2-Data Analysis and Report Writing
No ratings yet
Assignment 2-Data Analysis and Report Writing
2 pages
2.4 General Epidemiological Measures
No ratings yet
2.4 General Epidemiological Measures
32 pages
Bio Statistics
No ratings yet
Bio Statistics
56 pages
Chapter 02 Regression - Ongoing
No ratings yet
Chapter 02 Regression - Ongoing
57 pages
3. measures of central tedency - ١١١٨٠٦
No ratings yet
3. measures of central tedency - ١١١٨٠٦
29 pages
5 Marks Statistics
No ratings yet
5 Marks Statistics
43 pages
1 Introduction To Biostatistics
No ratings yet
1 Introduction To Biostatistics
30 pages
Chap 14 - Statistical Process Control
No ratings yet
Chap 14 - Statistical Process Control
38 pages
Module 2 MIDTERM StatProb
No ratings yet
Module 2 MIDTERM StatProb
23 pages
Data Processing and Analysis
No ratings yet
Data Processing and Analysis
11 pages
Chapter 3
No ratings yet
Chapter 3
45 pages
Hoare Hoe - Statistics 2 Final
No ratings yet
Hoare Hoe - Statistics 2 Final
18 pages
Q1 MCQ
No ratings yet
Q1 MCQ
25 pages
17 Describing The Form Shape Trend Direction and Variation Strength Based On A Scatter Plot
No ratings yet
17 Describing The Form Shape Trend Direction and Variation Strength Based On A Scatter Plot
20 pages
Methodology: Fig 3.1: Geographical Location of San Vicente, Kapatagan, Lanao Del Norte
No ratings yet
Methodology: Fig 3.1: Geographical Location of San Vicente, Kapatagan, Lanao Del Norte
15 pages
4.introduction To Biostatistics
No ratings yet
4.introduction To Biostatistics
30 pages
ML Labmanual
No ratings yet
ML Labmanual
33 pages
2 - Measures of Centrel Tendency
No ratings yet
2 - Measures of Centrel Tendency
35 pages
1 - Practice Exercise 1 Data Descriptives
No ratings yet
1 - Practice Exercise 1 Data Descriptives
10 pages
BOT 315 Slide
No ratings yet
BOT 315 Slide
20 pages
Lecture 1 NOTES Variables and Distributions 2020-21
No ratings yet
Lecture 1 NOTES Variables and Distributions 2020-21
11 pages
Statistical Method Lecture Note
No ratings yet
Statistical Method Lecture Note
13 pages
LO1 - Prepare For The Application of Health Survey.
No ratings yet
LO1 - Prepare For The Application of Health Survey.
31 pages
Two-Way ANOVA Final Notes
No ratings yet
Two-Way ANOVA Final Notes
10 pages
Document 1
No ratings yet
Document 1
6 pages
Biostatistics Practical Answers - 2023
No ratings yet
Biostatistics Practical Answers - 2023
11 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Contrast Sensitivity Measures by Two Diferent Test
No ratings yet
Contrast Sensitivity Measures by Two Diferent Test
8 pages
Kahan Et Al 2015 - Risk of Selection Bias in Randomised Trials
No ratings yet
Kahan Et Al 2015 - Risk of Selection Bias in Randomised Trials
8 pages
Anne Bio-Stat Assignment Edited
No ratings yet
Anne Bio-Stat Assignment Edited
9 pages
GKPotazo - BSN-1A - Module 3 - Lesson 4 Hands On
No ratings yet
GKPotazo - BSN-1A - Module 3 - Lesson 4 Hands On
4 pages
Quantitative Techniques For Disaster Managment
No ratings yet
Quantitative Techniques For Disaster Managment
5 pages
Sample Size in Animal Studies
No ratings yet
Sample Size in Animal Studies
4 pages
Statistics Ready Reckoner
No ratings yet
Statistics Ready Reckoner
4 pages
1 Introduct
No ratings yet
1 Introduct
9 pages
Machine Learning With R
No ratings yet
Machine Learning With R
2 pages
Stats
No ratings yet
Stats
2 pages
Variance of Sample Variance
No ratings yet
Variance of Sample Variance
0 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)

Data Preparation

Uploaded by

Data Preparation

Uploaded by

Guru Gobind Singh Indraprastha University

Submitted in partial fulfilment of requirement of

Submitted to: Submitted by:

modality Absolute freq. Percent freq. Cumulative freq.

For Gender we DON'T compute cumulative frequencies as it is a non-ordered qualitative

Sum of values = 517.3 Mean = 517.3 / 8 = 64.66

and the standard deviation is its squared root: 1.595514

1. Generate 100 random nos. between 25 to 100

2. Create a table with the following headers

1) Find out the average salaries offered by the company.

3) It should also highlight the employee ID with maximum earnings/ salary.

1. How much in $ did you sell in each city?

Q. SPORTSMEN DATA STANDARDIZATION

You might also like