Open Coding
Open Coding
Introduction
We need to give names to our ideas and concepts to define, analyze and share with others. O e it s
defined, we can begin to examine them comparatively and ask questions to systematically specify the
states and to imply possible relations with others. It s also important that we name our concepts
appropriately; because people act toward things based on the meaning those things have for them;
and these meanings are derived from social interaction and modified through interpretation. [1]
To build concepts from a textual data source, we need to open up the text and expose the meaning, idea
and thoughts in it. One of the processes of analyzing textual content is Open Coding. Open Coding
includes labeling concepts, defining and developing categories based on their properties and
dimensions. It is used to analyze qualitative data and part of many Qualitative Data Analysis
methodologies like Grounded Theory.
In general, noticing means taking notes based on observation, recording events or interviews, gathering
documents etc. In the analysis phase, when you are going through the data you often mark important
sections and add descriptive name or code to it. This is the first step of coding and called Open Coding.
This article begins with a description of Open Coding and provides walk through on different open
coding techniques.
Building Concepts
The first step in qualitative data analysis is to go through the data (i.e. text) to break down in to pieces to
examine closely, compare for relations, similarities and dissimilarities. Different parts of the data are
arked ith appropriate la els or odes to ide tif the for further analysis.
A concept is a labeled section of data that a researcher identifies as significant to some facts that data
represent. Concepts are abstract representations of events, objects, actions or interactions and they
allow researchers to group similar information to better understand the data.
Concepts can be of various types; communication, storm or private company includes example of
concepts. Concepts may incite certain natural imagery as they have their own properties. For example,
we can think of data set representing telephone conversations between two participants and we can
label them as Telephone Co u i atio . So a labeled thing is something that can be location and
placed in a class of similar objects. Anything under a classification has one or more familiar properties or
characteristics; like sending information is a property of communication. It is important to understand
that concepts can be classified differently, it depends on the different properties of data the researcher
is focusing on and how he/she is translating them.
While analyzing the data, we sometimes get events or objects with common characteristics yet other
properties may separate them. We can use the common properties to group them under same concept.
Different researchers may think of different names from the same data set but in general, it should be
based on the context.
Following is a partial transcript of an interview with women in her early 20s and is about drug use by
teens. The interviewer did not have preset questions to ask. He continued his questions based on the
i ter ie ee s respo se.
As you can see in the above interview transcript, we have grouped the similar information using abstract
labels (i.e. drug talk). Some of the names for the labels are selected directly from the data (i.e. hard-core
use). This process of going through line by line data to assign codes is called line-by-line coding.
Memo: The first thi g that strikes e i this se te e is the ork use . This is a stra ge ter e ause,
when taken out of the context of drug taking, the work means that an object or a person is being
employed for some purpose. It implies a willful and directed act. In making a comparison, when I think
about a computer, I think about employing it to accomplish a task. I think of it as being at my disposal.
Source: Basics of Qualitative Research, Second Edition by Anselm Strauss & Juliet Corbin [6]
Glaser (1978) offered guidelines for preparing effective memos to generate substantive theory including
the following [3]:
Defining Categories
As we continue to create codes for new concepts, it s not unexpected to come to a point when we will
have more than few pages of codes. At that stage, we should analyze the codes to find the similarities
and group them into categories based on their common properties. We may also consider dimensions of
the codes that represent the location of the property along a continuum or range. The name of the
category can be different from the codes to express its scope better and if necessary, we can also create
sub-categories from the codes then link to categories.
Fro the a o e set of odes, e a group the o epts: E ail , Telepho e Co ersatio , Te t
message & Voi e Mail i to a ategor a d a e it Co u i atio .
Group conversation helps to take important decisions (i.e. single out phenomena for coding,
decide which existing concepts to use for coding or when to create new concept) [Berlin]
Concept definitions become more exact and differentiations get more precise
The data perspective is maintained more consistently
Generally, more number of phenomena are discovered and processed.
Use of Open Coding
Open Coding is generally the initial stage of Qualitative Data Analysis. After completing the Open Coding,
depending on the methodology we use, we can do Axial Coding and Selective Coding. At later stage of
the research, these coding help us to build theories in an inductive process (i.e. Grounded Theory). Open
Coding can be used with inductive, deductive or verification modes of inquiry too.
Exercise
No that e k o ho to do Ope Codi g, let s tr to use it. Following is part of an interview transcript
with a woman in her 20s and is about drug use by teens. We would like to use open coding to analyze
the data.
Interviewer: You say you did drugs for the experience. Do kids talk about experience?
Respondent: it’s ore of shari g the e perie e rather tha talki g a out the e perie e. You talk a out
doi g drugs ore tha hat it’s like he ou take drugs. It depe ds upon what level you are into it, I
guess. Most kids are doing it because it is a trend in high school. They are not doing it because of the
experience in some higher sense. They are doing it because they are following the crowd.
Source: Basics of Qualitative Research, Second Edition by Anselm Strauss & Juliet Corbin [6]
So, how are we going to do that? One option is to printout the content, high light the important
concepts and write the codes. But a manual open coding approach is not a good process especially when
we have to deal with large amount of data. As you are limited to search by reading only, it can also
cause unwanted errors. This process is simply impractical for a large scale data analysis with open
coding.
We can use software applications like Saturate [4] or Atlas.ti [5] to do Open Coding. Atlas.ti is a
commercial qualitative data analysis tool and Saturate is a free web based tool developed by Dr. Sillito
from University of Calgary. We will use this tool in our exercise. A video tutorial for the tool can be
found here [4].
No that e ha e soft are a d k o ho to use it, let s start uildi g o epts. Take a look i to the
response: I think teens use drugs as a release from their parents . This looks like an act of rebellion. So
e a ode it as re ellious a t . The ter use looks like ea i g so ethi g ore. If e takeout the
context of drug use for a second and think about it – it may mean that they are being used for some
other reason which e are ot sure at this state. “o, e should take a ote Me o for future
reference. We have to continue to analyze the data line-by-line and add codes as necessary as long as
we find significantly new concepts. Tools like Saturate can help us in both improving the efficiency and
better manage the data.
As we continue on coding the data, we may find similar concepts and can classify them under common
o epts i.e. drug talk , egati e o e tio . Following code snippet shows a part of the interview
transcript with codes.
Respondent: I think teens use drugs as a release fro their pare ts [ rebellious act ]. Well, I do ’t k o . I a o l
talk for self. For e, it as a e perie e [ experience ] [i i o ode]. You hear a lot a out drugs [ drug talk ].
You hear the are ad for ou [ negative connotation to the drug talk ]. There is a lot of the arou d
[ available supply ]. You just get i to the e ause the ’re a essi le [ easy access ] a d e ause it’s ki d of a
e thi g [ novel experience ]. It’s ool! You k o , it’s so ethi g that is ad for ou, ta oo, a o [ negative
connection ]. E er o e is agai st it [ adult negative stance ]. If ou are a tee ager, the first thi g ou are goi g to
do is try the [ challenge the adult negative stance ].
In the process of line-by-line coding, we will soon be able to group the concepts into categories like
drug use for o epts like hard- ore use a d soft ore . Once we start getting too many old concepts,
we can stop labeling and move on to next step (i.e. selective coding, axial coding) based on our research
methodology.
Problems
Although Ope Codi g is a i porta t tool for Qualitati e Data A al sis ut it s also a er ti e
o su i g a d tedious ork. “o eti es it s hard decide when to stop line-by-line coding and if the
researcher misses any important concept, he/she may have to restart the boring task again.
Annotated Bibliography:
John V. Seidel. Qualitative Data Analysis [2]
This document was originally part of the manual for the Ethnograph v4. It explains the process of Open
Coding and also Qualitative Data Analysis in a broad sense.
References
[1] Symbolic Interactionism. Bulmer H. (1969) [Link to Google Books]
[3] Page 582, Nursing research: principles and methods by Denise F. Polit, Cheryl Tatano Beck
[Link to Google Books]
[4] Saturate, a web-based Open Coding tool developed by Dr. Sillito. University of Calgary
http://www.saturateapp.com
[5] Atlas.Ti, A commercial desktop application for Qualitative Data Analysis. http://www.atlasti.com/
[6] Chapter 8, Basics of Qualitative Research, Second Edition by Anselm Strauss & Juliet Corbin [Link]
[7] A Coding Scheme Development Methodology Using Grounded Theory for Qualitative Analysis of
Pair Programming. Institut für Informatik, Freie Universität Berlin. [Link]
[8] Building Inductive Theory of Collaboration in Virtual Teams: An Adapted Grounded Theory
Approach. Sarker, S. Lau, F. Sahay, S. [Link]
[9] Using physical memorabilia as opportunities to move into collocated digital photo-sharing.
Michael Nunes, Saul Greenberg, Carman Neustaedter. [Link]