Secondary Data Analysis Guidebook 21
Secondary Data Analysis Guidebook 21
For some topics and in some circumstances collecting primary data can be difficult or
inappropriate. Below we outline and approach to developing a research report using secondary
data, that is existing data that can be used by you to answer you research question(s). It is
important to remember when reading in this area that the word “text” will be used a lot. This comes
from (amongst other sources) philosophers such as Derrida and Deleuze, however, when using
the word “text” we mean “written”, spoken and “symbolic”. So in these terms analysing a text can
e.g. mean analysing a recording of an interview.
Content Analysis
1. Content analysis identifies patterns in recorded (written or spoken or graphic) communication.
To conduct content analysis, you systematically collect data from a set of texts, which can be
written, oral, or graphic:
Books, newspapers and magazines.
Speeches and interviews.
Web content and social media posts.
Photographs and films.
2. Content analysis can be quantitative (focused on counting and measuring) and/or qualitative
(focused on interpreting and understanding). In both types, you categorize or “code” words,
themes, and concepts within the texts and then analyse the results.
3. Quantitative Content analysis can be used to quantify the occurrence of certain words,
phrases, subjects or concepts in a set of historical or contemporary texts. To research the
importance of Sustainability in project reports, you could analyse the reports for the frequency
of terms related to environmental responsibilities and use statistical analysis to find differences
over time or between firms.
4. Content analysis can be used to make qualitative inferences by analysing the meaning and
semantic relationship of words and concepts. To gain a more qualitative understanding of
Sustainability issues in projects, you could locate the word “sustainability ” in project reports,
identify what other words or phrases appear next to it (such as “social”, “community”, “ethics”,
“procurement” etc.), and analyse the meanings of these relationships to better understand the
intentions of different stakeholders or their understanding of what sustainability means to them.
Select the content you will analyse, based on your research question. You need to decide:
The medium (e.g. newspapers, speeches or websites) and genre (e.g. opinion pieces,
political campaign speeches, marketing copy …).
The criteria for inclusion (e.g. newspaper articles that mention a particular event,
speeches by a certain politician or websites selling a specific type of product …).
The parameters (or limits of what is included) in terms of date range, location, etc.
If there are only a small number of texts that meet your criteria, you might analyse all of
them. If there is a large volume of texts, you can select a sample.
Define the units and categories of analysis – e.g. is the unit of analysis each individual
politician (so individuals) or politicians from each of the political parties (so parties).
Next, you need to determine the level at which you will analyse your chosen texts. This means
defining:
The unit(s) of meaning that will be coded. For example, are you going to record the
frequency of individual words and phrases, the characteristics of people who produced
or appear in the texts, the presence and positioning of images or the treatment of
themes and concepts that fit with your coding system?
The set of categories that you will use for coding. Categories can be objective
characteristics (e.g. aged 40-50, lawyer, …) or more conceptual (e.g. trustworthy,
corrupt, family oriented …).
Once coding is complete, the collected data is examined to find patterns and draw
conclusions in response to your research question.
In quantitative content analysis, you can use basic descriptive tables to show frequencies
of coded categories. Further, you might use statistical analysis to find correlations or
trends, discuss your interpretations of what the results mean, and make inferences about
the creators, context and audience of the texts.
In qualitative content analysis, you need to produce a coding tree to show your
first-order/level, second-order/level codes and final main themes emerged.
Discourse Analysis
9. Discourse analysis is a research method for studying written or spoken language in relation to
its social context. It aims to understand how language is used in real life situations. Discourse
analysis is a qualitative and interpretive method of analysing texts (in contrast to more
systematic methods like content analysis). You make interpretations based on both the details
of the material itself and on contextual knowledge.
Grant et al. (2005) quote Fairclough and Wodak (1997, p. 277) when considering the
contribution of discourse analysis to an understanding of organizational change:
In other words, language does not have a fixed, objective meaning, but its meaning is changed
or effected by a whole range of contextual factors: the author's belief system, the surrounding
political, economic and social context, any professional community to which the person belongs
– which will have its own jargon (as in medical or legal) – as well as the immediate situation in
which the words were uttered.
A simple example of this might be that if you say “How are you doing.” to an English person
they will probably assume you mean “Hello.”. If you say “How are you doing.” to a French
person they will probably assume you are asking about their health and may give you a list of
their ailments. The words are the same but in different cultural contexts they have different
meanings. Again a simple example is that if you are visiting an English person who is a patient
in a hospital and you say “How are you doing?” they will probably assume you are asking about
their health and may give you a list of their ailments. The same words will be interpreted
differently because of a different social context.