Unit 2 RM Literature Survey Correct
Unit 2 RM Literature Survey Correct
SYLLABUS:
Importance of literature survey - Sources of information - Assessment of quality of journals and articles -
Information through internet. Effective literature studies, approaches, analysis, plagiarism, and research
ethics. Data - Preparing, Exploring, examining and displaying. 5 Hours
Ex: If a paper R as shown in figure 2.1 uses an information found previously in paper C published prior to
paper R, then:
R contains a reference to C
This ratio is published annually in Thompson Scientific Journal Citation Reports (JCR). The scientific JCR
impact factors are based on data from Journals indexed in Web of Science. Scopus and Web of Science
are global databases for discovering academic literature. They allow users to search for articles,
conference proceedings, trade publications, and book chapters on a topic. They also help to find author
information, such as ‘h’-index, and lists of publications.
The calculation of ‘h’-index: SCOPUS uses a measure called "h-index", which was developed by Hirsch in
2005 to evaluate the impact of journals. h-index is based on a list of publications ranked in descending
order by the citation count. The value of h is equal to the number of
papers (N) in the list that have N or more citations. One can obtain
the h-index using freely available databases-Google Scholar and
Publish or Perish, subscriber databases such as SCOPUS and Web of
Science. The h-index can also be used to evaluate the productivity and
impact of a group of scientists, in a Department or University or a
country. Science Citation Index (SCI) is owned by Thompson Reuters
and is called "Science Citation Index Expanded".
It covers more than 6500 world's leading and significant journals
across more than 150 disciplines and the selection process is very
rigorous. It is available through Web of Science database, a part of
Web of Knowledge collection databases.
A literature review is a comprehensive review and analysis of the published literature on a specific topic
or research question. The literature that is reviewed contains: books, articles, academic articles,
conference proceedings, association papers, and dissertations. It contains the most pertinent studies and
points to important past and current research and practices. It provides background and context, and
shows how your research will contribute to the field.
(Q: What should an editor do when encountering obviously incorrect entries in the data, and when is it
appropriate to contact the respondent for clarification?
Q: Explain: a. Coding rules. b. Spreadsheet data entry. c. Bar codes. d. Content analysis. e. Missing data.
f. Optical mark recognition.
Q: How should the researcher handle “don’t know” responses?)
Editors should correct obvious data errors, like mismatched time units or misplaced entries. Ex: When a
respondent clearly specifies time in days (Ex: 13) when it was requested in weeks (Ex: 4 or less) or data
is entered in the wrong place or when replies are inappropriate (out of range or not related to the question
asked) or missing. They can find the correct answer by reviewing other data, but this should be limited to
clear cases. If possible, better to contact the respondent for clarification.
Following rules guide editors in their work:
i. Understand the instructions for interviewers and coders.
ii. Do not erase or damage the original responses; they should stay readable.
iii. Use a distinctive color and standard format for all edits in the data.
iv. Initial any changes or added answers.
v. Include initials and the date of editing.
ii. Coding
It is the process of assigning numbers and other symbols to answers so that the responses can be grouped
into categories. In coding, categories are the partitions of a data set of a given variable Ex: if the variable
is gender, the partitions are male and female. Categorization is the process of using rules to partition a
body of data. Both closed- and open-response questions must be coded. Open-ended questions allow
Fig. 2.5 Sample Codebook of Questionnaire Items Fig. 2.6 Sample Questionnaire Items
Open-ended questions are more difficult to code since answers are not prepared in advance. Content
analysis is a systematic method for analyzing open-ended questions. It uses preselected sampling units to
produce frequency counts and other insights into data patterns. It is described as “a research technique
for the objective, systematic, and quantitative description of the manifest content of a communication.”
“Don’t know” replies: They are evaluated in light of the question’s nature and the respondent. While
many DKs are legitimate, some result from questions that are ambiguous or from an interviewing situation
that is not motivating. It is better to report DKs as a separate category.
Exploratory Data
Analysis (EDA)
includes following
methods for
displaying data:
i. Frequency tables
They array data from
lowest to highest
values with counts
and percentages.
They are most useful for inspecting the range of responses and their repeated occurrence.
Ex: Figure 2.8 shows a frequency table of the perceived minimum age for owning a social networking
account.
ii. Bar charts and pie charts
They are suitable for relative comparisons of nominal data.
Ex: The same data of minimum age for social networking are presented in figure 2.9 using a pie chart and
a bar chart. The values and percentages are more readily understood in this graphic format.
Figure 2.9 Nominal displays of minimum age for social networking data. i) Pie chart ii) Bar chart
iii. Histograms
They are used with continuous variables when it is possible to group the variable’s values into intervals.
They are useful for (1) displaying all intervals in a distribution, even those without observed values, and
(2) examining the shape of the distribution for skewness, kurtosis, and the modal pattern. Histograms can
help answer the questions: Is there a single mode? Are subgroups identifiable when multiple modes are
present? Are straggling data values (Outliers) detached from the central concentration?
Note: A histogram cannot be used for a nominal variable like minimum age for social networking (Figure
2.5) that has no order to its categories.
Figure 2.10 Average Annual Purchases of PrimeSell’s Top Figure 2.11 Histogram of PrimeSell’s Top
50 Customers. 50 Customers’ Average Annual Purchases
Ex: Figure 2.10 gives average annual purchases and figure 2.11 presents a histogram for same. The
midpoint for each interval for the variable of interest (average annual purchases) is shown on the
horizontal axis; the frequency or number of observations in each interval, on the vertical axis. The height
of the bar corresponds with the frequency of observations in the interval.
iv. Stem-and-leaf displays
v. Pareto diagrams
Pareto diagrams are named after a 19th-century Italian economist and are used in quality management
to illustrate the 80/20 rule, where 80% of quality
improvements can come from addressing 20% of
the causes. In these diagrams, responses are
sorted by importance, with bars arranged from
tallest to shortest.
Ex: Figure 2.13 shows MindWriter customer
complaints, highlighting that the top two issues
accounted for 80% of perceptions of inadequate
repair service.
vi. Boxplots
Boxplots convey a detailed picture of the distribution’s location, spread, shape, tail length, and outliers.
They use the five-number summary that consists of the median, the upper and lower quartiles, and the
largest and smallest observations.
2.4.2.2 Cross-tabulation
Cross-tabulation is used to examine relationships involving categorical variables. The tables used for this
purpose consist of cells and marginals (row and column total). The cells may contain combinations of
count, row, column, and total percentages. It serves as a framework for later statistical testing. Computer
software for cross-classification analysis allows for efficient table-based data visualization and decision-
References:
1. Ganesan R, Research Methodology for Engineers, MJP Publishers, Chennai. 2011
2. Cooper, Donald R. and Schindler, Pamela S., Business Research Methods, Tata McGraw-hill Publishing Company
Limited, New Delhi, India. 2012
(Note: Questions are given for illustrative purpose only)