Chapter 1 Data Interpretation
Chapter 1 Data Interpretation
1
Data Interpretation Overview
“We see the world, not as it is, but as we are e or, as we are conditioned to see it.”
Stephen R. Covey (The 7 Habits of Highly Effective People, p. 28)
is an expectation that DNA analyst training involves developing an understanding and mastery of
data interpretation as well as report writing and statistical analysis used in reaching conclusions on
typing results.
The general steps and workflow involved in forensic DNA typing are illustrated in Figure 1.1.
The companion volume to this book entitled Advanced Topics in Forensic DNA Typing: Methodology
Understanding
Results Obtained
Gathering the Data & Sharing Them
Collection/Storage/ Extraction/ Amplification/ Separation/
Data Stats Report
Characterization Quantitation Marker Sets Detection
FIGURE 1.1 Steps involved in the overall process of forensic DNA typing. This book focuses on understanding the data
through data interpretation and statistical interpretation.
(Butler 2012) covered many aspects of gathering the data used in DNA testing. Picking up where that
book left off, the purpose of this book, Advanced Topics in Forensic DNA Typing: Interpretation, is to
help readers understand data obtained from the STR typing process, with a focus on interpreting
and reporting results. Data interpretation is covered in Chapters 1 through 8 where we address
the question, “What are the data obtained from a set of samples?” Statistical interpretation is
reviewed in Chapters 9 through 15 to help discuss, “How significant are the data?” Chapter 16 fo-
cuses on drawing conclusions and report writing to assess, “What do the data mean when compar-
isons are made between evidentiary and reference sample results?”
Everyone may think that their way of DNA analysis is correct. However, misinterpretations of
some fundamental principles have given rise to a variety of approaches being undertaken in labs
today, some of which are not optimal, or even border on being incorrect for certain scenarios of
use. Unfortunately, often times the approaches taken for interpretation are subjective, and therefore
become the weakest part of the overall DNA typing process. I have written this book because I
believe that a better understanding of fundamental principles will aid consistency and quality of
work being performed in forensic DNA laboratories around the world.
In February 2009, the U.S. National Academy of Sciences released a report entitled “Strengthening
Forensic Science in the United States” (NAS 2009). The report emphasized that good (forensic) science
includes: (1) valid and reliable methodologies, and (2) practices that minimize the threat of bias in
data interpretation. My Methodology volume demonstrates that valid and reliable methodologies
can be achieved with forensic DNA typing. This Interpretation volume seeks to help minimize the
threat of bias in data interpretation.
Good science takes time and effort to do well. It is worth noting that some measurements and in-
terpretations are more reliable than others. Hence, uncertainty in measurements and interpretation
should be reflected in the reports generated in a forensic case investigation. As will be described
throughout this book, it is important that assumptions made during the interpretation process be
documented and conveyed as clearly as possible. This documentation will aid those individuals
reviewing the lab report to appropriately assess the results obtained and the conclusions drawn. It
is important for analysts to offer what they know from the data obtained in a case in a fashion
Copyright © 2014. Elsevier Science & Technology. All rights reserved.
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
THE INTERPRETATION PROCESS 5
wavelengths of light that represent the digital signature of a DNA profile. When these data points are
plotted with time on the x-axis and fluorescence intensity on the y-axis, an electropherogram is
created. This electropherogram, sometimes referred to as an EPG or e-gram, is then evaluated using
STR genotyping software to produce a final results table representing the biological sample’s DNA
profile.
An overview of the components and processes involved in data interpretation are illustrated in
Figure 1.2. A sample data file contains time and fluorescence information for a PCR-amplified sample
along with an internal size standard. The sample data file, which has a file extension of .fsa or .hid, is
loaded into genotyping software along with an allelic ladder data file containing the same internal
size standard to enable the sample and allelic ladder results to be correlated. Along with the allelic
ladder sample, STR kit manufacturers provide a computer file specific for each STR kit containing
bins (that define the allele repeat number for each STR locus) and panels (that define the STR loci pre-
sent in the kit). When combined with the allelic ladder data file, bins and panels provide genotyping
software with the capability to transform DNA size information into an STR allele repeat number for
each observed peak.
A primary purpose of SOPs is to provide consistent results across DNA analysts within a labora-
tory as well as across cases analyzed by the same DNA analyst. The hope is that by following well-
defined directions in a laboratory’s SOP, the same result can be obtained on a particular DNA sample
by any qualified analyst or data reviewer.
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
6 1. DATA INTERPRETATION OVERVIEW
TABLE 1.1 Information Flow in the Data Interpretation Process Correlated with Chapters in This Book
4 Allele Heterozygote or Homozygote or Allele(s) Peak heights and peak height ratios;
missing stochastic threshold
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
THE INTERPRETATION PROCESS 7
As described in D.N.A. Box 1.1, the .fsa files are in a binary file format known as ABIF (Applied
Biosystems, Inc. Format) and are similar to Tag Image File Format (TIFF), which has been used for
graphics files (Applied Biosystems Genetic Analysis Data File Format 2009). Although we will
focus on the electronic information used to create a DNA profile, it is worth noting that other
diagnostic information, such as laser power and run current, is also stored in the computer file
during data collection; this information can be helpful in troubleshooting efforts described in
Chapter 8.
During the process of data analysis and genotyping, information from the .fsa or .hid data
file is converted from time (scan) points to DNA size relative to an internal size standard
and then to an STR allele call relative to STR typing kit-specific bins and panels and allelic
ladders.
Sophisticated software has been developed to take sample electrophoretic data rapidly through
the STR genotyping process (Ziegle et al. 1992). Life Technologies/Applied Biosystems (Foster
City, CA), which manufactures the Genetic Analyzer CE instruments used in forensic DNA lab-
oratories, supplies software for processing the .fsa or .hid files generated by their CE instruments.
This software enables peaks to be defined and STR alleles designated using kit-specific allelic
ladders and bins and panels. GeneScan and Genotyper software programs were used originally
with early Mac and NT versions of data collection from the ABI 310 and 3100 series instruments.
In more recent years, GeneMapperID v3.2 and GeneMapperID-X v1.1 or 1.2 have replaced
GeneScan/Genotyper functions (in 2012, GeneMapperID-X v1.4 expanded data analysis capabil-
ities to 6-dyes).
GeneMarkerHID software (Holland & Parson 2011) from Soft Genetics (State College, PA) can also
process .fsa and .hid files directly as can Cybergenetics’ TrueAllele (Pittsburgh, PA; see also Kadash
2004) and Qualitype’s GenoProof (Dresden, Germany). In addition, the National Center for Biotech-
nology Information (NCBI) has produced an open-source STR genotyping software program (Goor
et al. 2011) called OSIRIS, which stands for Open Source Independent Review and Interpretation
System (OSIRIS 2014).
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
8 1. DATA INTERPRETATION OVERVIEW
various versions of their .fsa file format schema, alternative genotyping software programs such
most recently in September 2009, to enable other as GeneMarkerHID (SoftGenetics, State College,
software developers to create products that can PA), GenoProof (QualiType, Dresden, Germany),
utilize Genetic Analyzer data. Unfortunately, the TrueAllele (Cybergenetics, Pittsburgh, PA), and
full details of their .hid file format have not yet OSIRIS (National Center for Biotechnology
been publicly released. Information, Bethesda, MD) can process .hid file
However, both .fsa and .hid files appear to formats. The new .hid file format captures addi-
consist of the same basic structure: (1) a header tional information in the sample file including the
that points to (2) a directory of tags which then 3500 radio frequency identification (RFID) infor-
points via a file offset to (3) electrophoretic data. mation used for instrument consumables, such as
The electrophoretic data are collected by scan the polymer and buffer lot numbers. In addition,
number (time) and fluorescence signal in speci- normalization capabilities exist with ABI 3500
fied dye-channels. In the developer toolkit on the .hid files. Normalization, which enables signal to
Applied Biosystems website, a detailed descrip- be equalized between different ABI 3500 or 3500xl
tion of the file tags are provided for (a) ABI 3100 instruments, is performed by multiplying the
and 3100-Avant, (b) ABI 3130 and 3130xl, (c) ABI data points by a factor calculated from the in-
3500 and 3500xl, and (d) ABI 3730 and 3730xl tensity of some of the internal size standard
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
THE INTERPRETATION PROCESS 9
peaks in the LIZ600v2 size standard. This feature how programs perform baseline subtraction and
is only available with ABI STR kits. With ABI other forms of signal processing.
3500 .hid files, GeneMapperID-X and other non-
Applied Biosystems software programs show Source: Applied Biosystems Genetic Analysis Data File
Format, Sept 2009 (available at http://www.appliedbiosystems.
similar-sized peaks for data displayed without com/absite/us/en/home/support/software-community/tools-for-
the 3500 normalization feature turned on. How- accessing-files.html); information shared by George Riley, Douglas
ever, minor differences in relative fluorescence Hoffman, and Robert Goor from OSIRIS development (see http://
www.ncbi.nlm.nih.gov/projects/SNP/osiris/)
unit (RFU) peak heights may exist depending on
Expert System
(for single-source samples)
Color-separated
Raw Data Time Points DNA Sizes STR Alleles Genotype
Color separation
Interpretation
Allele Calling
Sizing
14,16
FIGURE 1.4 An example of the transformation of sample information that occurs at a single STR locus during the course of
data interpretation. This same transformation occurs with all other STR loci that are PCR-amplified in a multiplex kit.
Additional information (indicated across the bottom) helps convert the initial data through steps of color separation, sizing,
and allele calling. An analyst must review the initial software results as part of the interpretation process. Expert system
software can take a sample from raw data to genotype for high-quality, single-source samples.
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
10 1. DATA INTERPRETATION OVERVIEW
to determine the overall STR profile representative of the original DNA template. Understanding
each step facilitates the troubleshooting efforts that reviewed in Chapter 8.
Through calibration to an internal size standard run with every sample, data points measured in
time (scan number) on the x-axis are converted to a relative size typically expressed to the one-
hundredth of a nucleotide. While we may sometimes refer to the DNA size of a PCR product in
base pairs (bp), in the denaturing environment of the capillary electrophoresis instrument we are
actually examining single-stranded DNA so nucleotides (nt) is a more correct unit of size. Thus,
example DNA size results might be 107.23 nt or 315.02 nt.
Different sizing algorithms are available in the GeneMapperID software, with the default method
being local Southern sizing (Elder & Southern 1983, Mayrand et al. 1992). Local Southern involves
determining the size of a DNA fragment by utilizing two peaks from the size standard larger and
two peaks smaller than the DNA fragment being sized.
Validation Studies
The purpose of validation studies is to observe, document, and understand variation in the data
generated under specific laboratory conditions. Validation helps define the scope or range of condi-
tions under which reliable results may be obtained. Throughout this book, suggestions are made for
validation studies that can be performed and the means for translating this information into param-
eters and thresholds used to assess and interpret data. By operating within validated ranges, uncer-
tainty in measurements made on evidentiary samples with the technique can be accurately conveyed
in laboratory reports.
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
CHARACTERISTICS OF IDEAL DATA 11
Locus 1 Locus 2 Locus 3 Locus 4
2500
8,8
2000
1500
13 14 29 31 10 13
1000
500
FIGURE 1.5 An artificial electropherogram with ideal STR typing data demonstrating perfect intra-locus (100% peak
height ratios) and perfect inter-locus balance (heterozygous alleles from different loci are the exact same height). With this
electropherogram, homozygous alleles in Locus 3 stack to produce a peak height exactly twice that of heterozygous alleles in
other loci. Shaded vertical bins reflect potential alleles defined by a previously run allelic ladder. Horizontal dashed lines
represent potential stochastic and analytical thresholds. Numbers above the peaks represent STR allele calls. The y-axis is in
relative fluorescence units (RFUs). Hypothetical data image created with EPG Maker (SPM v3) kindly provided by Steven
Myers.
three repeats (Locus 4). The homozygous alleles in Locus 3 (each possessing 8 repeats and a signal of
1,000 RFU) stack to produce a signal of 2,000 RFU.
In this example, there is 100% interlocus balance between all four loci. Peak heights of the alleles
present in the three heterozygous loci are all 1,000 RFU. Within each locus, the sum of alleles present
is 2,000 RFU. This perfect interlocus balance enables the double signal from stacked homozygous
alleles to be easily recognized. Furthermore, no observable stutter artifacts interfere with the ability
to decipher whether a minor component from a second contributor might be present in this sample
result. In fact, the absence of any other detectable alleles provides great confidence in assuming that
Copyright © 2014. Elsevier Science & Technology. All rights reserved.
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
12 1. DATA INTERPRETATION OVERVIEW
whether performed within a single laboratory or with database results generated from different
laboratories. Unfortunately, with real-world data, the situation is more complex and interpretation
is more challenging.
Real-World Challenges
Stochastic (random) variation in sampling each allele at a locus during PCR amplification leads to
variation in peak heights and peak height ratios for heterozygous samples (see Chapter 4). DNA
quality and quantity play a major factor in the degree of stochastic variation. Degraded DNA tem-
plates may make some STR allele targets unavailable. Alleles may fail to amplify if a sequence differ-
ence exists (due to mutation relative to the standard template sequence) in a PCR primer binding
region. These alleles, which are present in the original sample but fail to amplify, are termed “silent”
or “null” (see Chapter 4). PCR inhibitors present in forensic evidentiary samples may reduce effi-
ciency in amplifying some loci and/or alleles resulting in an imbalance in the signal obtained across
the DNA profile. The PCR process is also highly dependent on DNA sample quantity; this can lead to
great variability in amplifying individual alleles (if retesting the identical DNA extract) when ampli-
fying smaller amounts of DNA.
The existence of PCR amplification artifacts, such as stutter products or STR alleles that are not
fully adenylated and possess A peaks (see Chapter 3), complicates interpretation, particularly
when a mixture of DNA templates from more than one individual may be present. The possibility
of tri-allelic patterns (see Chapter 5) can further complicate the ability to recognize and discern
whether a DNA profile originates from more than one individual.
Technological artifacts can arise due to fluorescent dye impurities in the primer synthesis (dye
blobs), failure of the spectral calibration due to signal saturation (pull-up or bleedthrough between
dye channels), and other anomalies such as electrophoretic spikes. See Chapter 8 for more informa-
tion on these artifacts.
To complicate factors even more, variability exists between CE instruments due to the individual
instruments’ optics used to detect the fluorescence signal arising from the dye-labeled PCR products.
Copyright © 2014. Elsevier Science & Technology. All rights reserved.
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
GUIDANCE FOR DNA INTERPRETATION 13
discusses methods and produces guidance documents to aid the forensic DNA community
(including revisions to the QAS). A helpful guidance document is the 2010 SWGDAM Interpretation
Guidelines for Autosomal STR Typing by Forensic DNA Testing Laboratories (SWGDAM 2010). The
September 2011 Quality Assurance Standards for Forensic DNA Laboratories are available online
(QAS 2011).
Other groups around the world that play a similar role as SWGDAM include the DNA Commis-
sion of the International Society for Forensic Genetics, the European Network of Forensic Science
Institute’s (ENFSI 2014) DNA Working Group, and the Australia/New Zealand Biology Specialist
Advisory Group (BSAG).
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
14 1. DATA INTERPRETATION OVERVIEW
from more than 60 countries promoting scientific knowledge in the field of genetic markers as applied
to forensic science. Since 1989, the ISFG has issued recommendations on a variety of important topics
in forensic DNA analysis through a DNA Commission. These recommendations have included
naming of STR variant alleles and STR repeat nomenclature, mitochondrial DNA and Y-STR issues,
DNA mixture interpretation, paternity testing biostatistics, disaster victim identification, use of ani-
mal DNA in forensic genetic investigations, and coping with potential allele drop-out and drop-in
through probabilistic genotyping. For more information on the ISFG DNA Commission, see their
website (ISFG 2014).
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
GUIDANCE FOR DNA INTERPRETATION 15
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
16 1. DATA INTERPRETATION OVERVIEW
In early 2010, SWGDAM approved and released “SWGDAM Interpretation Guidelines for
Autosomal STR Typing by Forensic DNA Testing Laboratories.” With the availability of these inter-
pretation guidelines, laboratories were “encouraged to review their standard operating procedures
and validation data and to update their procedures as needed” (SWGDAM 2010).
As will be emphasized throughout this book, a forensic DNA laboratory should develop STR
interpretation guidelines based upon its own validation studies. Information from STR kit and instru-
ment manufacturers and results reported in the literature can be helpful. Practical experience with
instrumentation and results from performing casework are also important factors in developing an
interpretation strategy.
statistical analysis in support of any inclusion that is determined to be relevant in the context of a
case, irrespective of the number of alleles detected and the quantitative value of the statistical anal-
ysis” (SWGDAM 2010). Providing an appropriate weight to the evidence provides an opportunity to
reflect the uncertainty in the result obtained e particularly with partial profiles where more ambigu-
ity may exist.
If a match is observed between a suspect (known sample “K”) and crime-scene evidence (question
sample “Q”), then three possibilities exist: (1) the suspect deposited the sample, (2) the suspect did
not provide the sample but has the profile by chance, and (3) the suspect did not provide the sample
and the matching result is a false positive due to a sample switch or some other kind of error.
The first explanation is the basis behind the use of DNA testing in the criminal justice system. The
second possibility depends on population genetics principles, covered in the second half of this book,
specifically Chapter 10, from which the probability of a random match is determined. The third
explanation of why a match might occur concerns the possibility of laboratory mistakes. Chapter 7
in Advanced Topics: Methodology (Butler 2012) discusses quality assurance measures that are in place
to prevent or reduce the possibility of error in performing DNA testing. Generally speaking, a great
deal of effort goes into ensuring reliable forensic DNA testing.
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
A MATCH OR NOT A MATCH: THAT IS THE QUESTION. 17
TABLE 1.2 Characteristics of Autosomal STR Loci Present in 31 Commercially Available STR Kits
Hexaplex ESS
Nonaplex ESS
Decaplex SE
ESSplex SE
NGM SElect
GlobalFiler
SEfiler Plus
SGM Plus
SinoFiler
ESSplex
MiniFiler
VeriFiler
Profiler
IDplex
NGM
Promega STR kits Life Technologies (ABI) STR kits Qiagen STR kits
1q31 F13B AAAT 6 to 11 2
Xp, Yp Amelogenin -- -- 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1
Allele range is from the NIST 1036 data set (D.N.A. Box 1.4). Numbers inside the colored boxes indicate relative size position for that locus within a dye
channel for the specific STR kit.
When utilizing data comparisons with DNA databases that may have data coming from many
sources, it is important to recognize that different PCR primer sets may detect or not detect an allele
(allele dropout) due to primer binding site mutations (see Chapter 4).
In forensic DNA Q-K comparisons (as currently practiced in many parts of the world), if any STR
locus fails to match when comparing the genotypes between two or more samples, then the compar-
ison of profiles between the questioned and reference sample is usually declared a non-match,
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
18 1. DATA INTERPRETATION OVERVIEW
TABLE 1.3 Characteristics of Y-STR Loci and Y-Chromosome Sex-Typing Markers in Commercial Kitsa
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
D.N.A. BOX 1.4
Life Technologies/ABI STR Kits (Internal Size Standard LIZ GS500 – 5-dye; LIZ GS600 – 6-dye)
Promega STR Kits (Internal Size Standard CXR ILS600 – 4-dye; CXR ILS 550 – 5-dye)
200 bp 300 bp 400 bp
100 bp
16plex
(4-dye)
16
24plex
Fusion
FIGURE 1.7 Layout of loci by dye channel and relative size in selected Promega PowerPlex STR kits (Promega 2012).
Copyright © 2014. Elsevier Science & Technology. All rights reserved.
regardless of how many other loci match. This binary (match/no-match) approach becomes problem-
atic with low-level evidentiary DNA samples where stocastic allele dropout is likely. Probabilistic
approaches are under development to help with these difficult situations (Gill et al. 2012).
As noted in Chapter 14, paternity testing is an exception to this “single mismatch leads to exclu-
sion” rule because of the possibility of mutational events. When analyzing and reporting the results
of parentage cases, an allowance for one or even two possible mutations is often made. In other
words, if 13 loci are used and the questioned parentage is included for all but one locus, the data
from the non-inclusive allele is usually attributed to a possible mutation.
In the end, interpretation of results in forensic casework is a matter of professional judgment and
expertise. Interpretation of results within the context of a case is the responsibility of the case analyst
with supervisors or technical leaders conducting a follow-up verification of the analyst’s interpreta-
tion of the data as part of the technical and administrative review process (see Chapter 16). When
coming to a final conclusion regarding a match or exclusion between two or more DNA profiles,
laboratory interpretation guidelines should be adhered to by both the case analyst and the supervi-
sor. However, as experience using various analytical procedures grows, interpretation guidelines
will evolve and improve. These guidelines should always be based on the proper use of controls
and validated methods.
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
STR LOCI, KITS, AND POPULATION DATA 21
STR LOCI, KITS, AND POPULATION DATA
At the time this book is being written, three commercial manufacturers provide more than two
dozen different STR kits. These kits examine subsets of markers from a total of 29 autosomal STR
loci, a sex-typing marker named amelogenin, and a Y-STR marker DYS391. Table 1.2 lists the char-
acteristics of these STRs, including their chromosomal location, primary repeat motif, and allele
range. For Y-chromosome analysis, up to 29 Y-STR loci can be examined with commercial kits avail-
able as of early 2014 (Table 1.3).
U.S. population data from 1,036 individuals has been collected on these 29 autosomal STR loci and
23 Y-STR loci (D.N.A. Box 1.4). Data generated from these DNA samples will be used throughout
the book.
The STR locus dye color and size range for several commonly used STR kits are laid out in
Figure 1.6 for Life Technologies kits (Life Technologies 2012), Figure 1.7 for Promega kits (Promega
2012), and Figure 1.8 for Y-STR kits from Life Technologies and Promega. As these STR kits will be
referred to in many of the following chapters, we include them here as a helpful reference.
Y-STR Kits
(Internal Size Standard CXR 600 – 4-dye; LIZ GS500 or CC5 ILS 500 – 5-dye)
FIGURE 1.8 Layout of loci by dye channel and relative size in Y-chromosome STR kits from Life Technologies and
Promega.
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
22 1. DATA INTERPRETATION OVERVIEW
SUMMARY
DNA interpretation with STR markers involves utilizing genotyping software and laboratory
SOPs to evaluate CE data. Peaks in multi-colored CE electropherograms generated as CE mobility
time points are translated into DNA size information and then to allele repeat number for each
STR locus. In both evidentiary and reference samples, decisions are made for each peak above an
analytical threshold regarding whether or not the peak is an allele or an artifact, whether or not alleles
at an STR locus can be paired to form a genotype, whether it is possible for some alleles to be missing
from the data, and whether or not the sample originated from a single-source or a mixture of multiple
contributors. Validation studies are essential for setting parameters used in a laboratory’s SOPs to
make these decisions. Guidance on validation studies and data interpretation has been provided
from organizations such as SWGDAM and the ENFSI.
Applied Biosystems. (2004). GeneMapperID Software Version 3.2: Human Identification Analysis Tutorial. Foster City,
California.
Applied Biosystems. (2009). Genetic Analysis Data File Format, Sept 2009. Available at http://www.appliedbiosystems.com/
absite/us/en/home/support/software-community/tools-for-accessing-files.html. Accessed March 18, 2014.
BatchExtract. ftp://ftp.ncbi.nih.gov/pub/forensics/BATCHEXTRACT. Accessed March 18, 2014.
GeneMapperID-X (from Applied Biosystems): http://www.lifetechnologies.com/us/en/home/technical-resources/
software-downloads/genemapper-id-x-software.html. Accessed March 18, 2014.
GeneMarker HID (from Soft Genetics): http://www.softgenetics.com/GeneMarkerHID.html. Accessed March 18, 2014.
GenoProof (from Qualitype AG): http://www.genoproof.de/en/. Accessed March 18, 2014.
Goor, R. M., et al. (2011). A mathematical approach to the analysis of multiplex DNA profiles. Bulletin of Mathematical Biology,
73(8), 1909e1931.
Holland, M. M., & Parson, W. (2011). GeneMarkerÒ HID: a reliable software tool for the analysis of forensic STR data. Journal
of Forensic Sciences, 56(1), 29e35.
Kadash, K., et al. (2004). Validation study of the TrueAllele automated data review system. Journal of Forensic Sciences, 49,
660e667.
OSIRIS (Open Source Independent Review and Interpretation System). http://www.ncbi.nlm.nih.gov/projects/SNP/osiris/.
Accessed March 18, 2014.
TrueAllele (from Cybergenetics). http://www.cybgen.com. Accessed March 18, 2014.
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
READING LIST AND INTERNET RESOURCES 23
DNA Sizing
Elder, J. K., & Southern, E. M. (1983). Measurement of DNA length by gel electrophoresis II: comparison of methods for
relating mobility to fragment length. Analytical Biochemistry, 128, 227e231.
Mayrand, P. E., et al. (1992). The use of fluorescence detection and internal lane standards to size PCR products automatically.
Applied and Theoretical Electrophoresis, 3(1), 1e11.
Rosenblum, B. B., et al. (1997). Improved single-strand DNA sizing accuracy in capillary electrophoresis. Nucleic Acids Research,
25, 3925e3929.
Ziegle, J. S., et al. (1992). Application of automated DNA sizing technology for genotyping microsatellite loci. Genomics, 14(4),
1026e1031.
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.
24 1. DATA INTERPRETATION OVERVIEW
Butler, J. M. (2006). Genetics and genomics of core short tandem repeat loci used in human identity testing. Journal of Forensic
Sciences, 51, 253e265.
Butler, J. M., & Hill, C. R. (2012). Biology and genetics of new autosomal STR loci useful for forensic DNA analysis. Forensic
Science Review, 24(1), 15e26.
FBI. (2012). Planned process and timeline for implementation of additional CODIS core loci. Available at http://www.fbi.gov/about
e us/lab/codis/planned e process e and e timeline e for e implementation e of e additional e codis e core e loci.
Gill, P., et al. (2006a). The evolution of DNA databases e Recommendations for new European STR loci. Forensic Science
International, 156, 242e244.
Gill, P., et al. (2006b). New multiplexes for Europe e amendments and clarification of strategic development. Forensic Science
International, 163, 155e157.
Hares, D. R. (2012a). Expanding the CODIS core loci in the United States. Forensic Science International: Genetics, 6(1), e52ee54.
Hares, D. R. (2012b). Addendum to expanding the CODIS core loci in the United States. Forensic Science International: Genetics,
6(5), e135.
Life Technologies (2012). http://www.invitrogen.com/site/us/en/home/Products-and-Services/Applications/Human-
Identification/globalfiler_str_kit.html. Accessed March 18, 2014.
GlobalFiler information (2014). http://www.invitrogen.com/site/us/en/home/Products-and-Services/Applications/
Human-Identification/globalfiler_str_kit/resources.html. Accessed March 18, 2014.
Mulero, J. J., & Hennessy, L. K. (2012). Next-generation STR genotyping kits for forensic applications. Forensic Science Review,
24(1), 1e13.
Promega. (2012). PowerPlex Fusion System. http://www.promega.com/products/pm/genetic-identity/powerplex-fusion.
Accessed March 18, 2014.
Ruitberg, C. M., et al. (2001). STRBase: a short tandem repeat DNA database for the human identity testing community.
Nucleic Acids Res., 29, 320e322.
STRBase: http://www.cstl.nist.gov/strbase. Accessed March 18, 2014.
Hill, C. R., et al. (2013). U.S. population data for 29 autosomal STR loci. Forensic Science International: Genetics, 7, e82ee83.
NIST Population Data (2014). http://www.cstl.nist.gov/strbase/NISTpop.htm. Accessed March 18, 2014.
I. DATA INTERPRETATION
Butler, John M.. <i>Advanced Topics in Forensic DNA Typing: Interpretation</i>, Elsevier Science & Technology, 2014. ProQuest Ebook Central,
http://ebookcentral.proquest.com/lib/uam-ebooks/detail.action?docID=1770238.
Created from uam-ebooks on 2019-08-16 06:11:08.