0% found this document useful (0 votes)
41 views

Processing of Data

The document discusses the process of data processing and analysis. It explains that after data collection, the data must be processed by editing, coding, classifying, and tabulating it so that it can be analyzed. This processing involves examining the data for errors, assigning codes to responses, grouping the data into homogeneous categories based on common attributes or numerical ranges, and arranging the data in tables for further analysis and comparison. The principles of tabulation include giving tables a clear title and number, using brief column and row headings with measurement units indicated, and sources noted. This processing prepares the raw data for meaningful analysis and allows relationships and differences to be identified.

Uploaded by

Satyanshu Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
41 views

Processing of Data

The document discusses the process of data processing and analysis. It explains that after data collection, the data must be processed by editing, coding, classifying, and tabulating it so that it can be analyzed. This processing involves examining the data for errors, assigning codes to responses, grouping the data into homogeneous categories based on common attributes or numerical ranges, and arranging the data in tables for further analysis and comparison. The principles of tabulation include giving tables a clear title and number, using brief column and row headings with measurement units indicated, and sources noted. This processing prepares the raw data for meaningful analysis and allows relationships and differences to be identified.

Uploaded by

Satyanshu Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Processing of Data

DATA PROCESSING AND ANALYSIS

Data processing and Analysis

The data, after collection, has to be processed and analyzed in accordance


with the outline laid down for the purpose at the time of developing the
research plan. This is essential for a scientific study and for ensuring that we
have all relevant data for making contemplated comparisons and analysis.
Technically speaking, processing implies editing, coding, classification and
tabulation of collected data so that they are amenable to analysis. The term
analysis refers to the computation of certain measures along with searching
for patterns of relationship that exist among data-groups. Thus, “in the
process of analysis, relationships or differences supporting or conflicting with
original or new hypotheses should be subjected to statistical tests of
significance to determine with what validity data can be said to indicate any
conclusions.

Processing Operations

1. Editing: Editing of data is a process of examining the collected raw data


(specially in surveys) to detect errors and omissions and to correct these
when possible. As a matter of fact, editing involves a careful scrutiny of the
completed questionnaires and/or schedules. Editing is done to assure that the
data are accurate, consistent with other facts gathered, uniformly entered, as
completed as possible and have been well arranged to facilitate coding and
tabulation.

With regard to points or stages at which editing should be done, one can talk
of field editing and central editing. Field editing consists in the review of the
reporting forms by the investigator for completing (translating or rewriting)
what the latter has written in abbreviated and/or in illegible form at the time
of recording the respondents’ responses. This type of editing is necessary in
view of the fact that individual writing styles often can be difficult for others to
decipher.
Central editing should take place when all forms or schedules have been
completed and returned to the office. This type of editing implies that all forms
should get a thorough editing by a single editor in a small study and by a
team of editors in case of a large inquiry. Editor(s) may correct the obvious
errors such as an entry in the wrong place, entry recorded in months when it
should have been recorded in weeks, and the like.

2. Coding: Coding refers to the process of assigning numerals or other


symbols to answers so that responses can be put into a limited number of
categories or classes. Such classes should be appropriate to the research
problem under consideration. They must also possess the characteristic of
exhaustiveness (i.e., there must be a class for every data item) and also that
of mutual exclusively which means that a specific answer can be placed in
one and only one cell in a given category set.

Another rule to be observed is that of unidimensionality by which is meant


that every class is defined in terms of only one concept.

3. Classification: Most research studies result in a large volume of raw data


which must be reduced into homogeneous groups if we are to get meaningful
relationships. This fact necessitates classification of data which happens to be
the process of arranging data in groups or classes on the basis of common
characteristics. In this way the entire data get divided into a number of groups
or classes. Classification can be one of the following two types, depending
upon the nature of the phenomenon involved:

Classification according to attributes: As stated above, data are classified on


the basis of common characteristics which can either be descriptive (such as
literacy, sex, honesty, etc.) or numerical (such as weight, height, income, etc.).
Descriptive characteristics refer to qualitative phenomenon which cannot be
measured quantitatively; only their presence or absence in an individual item
can be noticed. Data obtained this way on the basis of certain attributes are
known as statistics of attributes and their classification is said to be
classification according to attributes.

Classification according to class-intervals: Unlike descriptive characteristics,


the numerical characteristics refer to quantitative phenomenon which can be
measured through some statistical units. Data relating to income, production,
age, weight, etc. come under this category. Such data are known as statistics
of variables and are classified on the basis of class intervals. For instance,
persons whose incomes, say, are within Rs 201 to Rs 400 can form one group,
those whose incomes are within Rs 401 to Rs 600 can form another group and
so on. In this way the entire data may be divided into a number of groups or
classes or what are usually called, ‘class-intervals.

4. Tabulation: When a mass of data has been assembled, it becomes


necessary for the researcher to arrange the same in some kind of concise and
logical order. This procedure is referred to as tabulation. Thus, tabulation is the
process of summarizing raw data and displaying the same in compact form
(i.e., in the form of statistical tables) for further analysis. In a broader sense,
tabulation is an orderly arrangement of data in columns and rows.

Tabulation is essential because of the following reasons.

1. It conserves space and reduces explanatory and descriptive statement to a


minimum.

2. It facilitates the process of comparison.

3. It facilitates the summation of items and the detection of errors and


omissions.

4. It provides a basis for various statistical computations.

Generally accepted principles of tabulation: Such principles of tabulation,


particularly of constructing statistical tables, can be briefly states as follows:

1. Every table should have a clear, concise and adequate title so as to make
the table intelligible without reference to the text and this title should always
be placed just above the body of the table.

2. Every table should be given a distinct number to facilitate easy reference.

3. The column headings (captions) and the row headings (stubs) of the table
should be clear and brief.

4. The units of measurement under each heading or sub-heading must


always be indicated.

5. Explanatory footnotes, if any, concerning the table should be placed directly


beneath the table, along with the reference symbols used in the table.
6. Source or sources from where the data in the table have been obtained
must be indicated just below the table.

7. Usually the columns are separated from one another by lines which make
the table more readable and attractive. Lines are always drawn at the top and
bottom of the table and below the captions.

8. There should be thick lines to separate the data under one class from the
data under another class and the lines separating the sub-divisions of the
classes should be comparatively thin lines.

9. The columns may be numbered to facilitate reference.

10. Those columns whose data are to be compared should be kept side by
side. Similarly, percentages and/or averages must also be kept close to the
data.

You might also like