0% found this document useful (0 votes)

69 views15 pages

Structured Association Mapping Using STR

This document provides instructions for using STRUCTURE and TASSEL software to perform structured association mapping. It describes how to use STRUCTURE to determine population structure by analyzing genotype data with STRUCTURE. The optimal number of populations (K) can be identified by examining the log probability of the data. STRUCTURE then estimates the ancestral proportions (Q matrix) for each individual. This Q matrix is then used as a covariate in TASSEL to account for population structure in association mapping analyses.

Uploaded by

Alamgir Hossain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views15 pages

Structured Association Mapping Using STR

Uploaded by

Alamgir Hossain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Structured Association Mapping using STRUCTURE and

TASSEL

K K Vinod
Indian Agricultural Research Institute

With advances in genotyping technology, including rapid increases in the number of genetic
markers available for QTL studies, association analysis is now a viable approach for the
dissection of complex genetic traits (Churchill et al. 2004). Association mapping involves
assessment of population structure and using this population information and kinship
information among individuals to assess marker – trait association. Two common software
packages widely used today for association mapping are STRUCTURE (Pritchard et al.
2010) and TASSEL (Buckler et al. 2009). STRUCTURE implements a model-based
clustering method for inferring population structure using genotype data consisting of
unlinked markers. This program can demonstrate the presence of population structure,
identify distinct genetic populations, assign individuals to populations, and identify migrants
and admixed individuals. Trait Analysis by Association, Evolution and Linkage, or TASSEL,
makes use of the most advanced statistical methods to maximize statistical power for finding
QTL. Both a structured association approach (Pritchard et al. 2000; Thornsberry et al. 2001)
and a unified mixed model method have been implemented to minimize the risk of false
positives by integrating population structure and family relatedness within populations (Yu et
al. 2006).
I. Determining population structure using Structure 3.2.2
A. Preparation of marker genotype data
Prepare a matrix of marker genotype data in Excel as given below, for microsatellite data:
A B C D E F G H I J K
1 SSR1 SSR2 SSR3 SSR4 SSR5 SSR6 SSR7 SSR8 SSR9 SSR10
2 GENO1 110 330 190 140 220 140 240 160 200 180
3 GENO1 110 330 190 140 220 140 240 160 200 180
4 GENO2 110 330 190 140 230 140 240 160 190 180
5 GENO2 110 330 190 140 230 140 240 160 190 180
6 GENO3 110 320 190 140 220 140 240 160 200 180
7 GENO3 110 320 190 140 220 140 240 160 200 180
8 GENO4 110 320 999 140 220 140 240 160 200 180
9 GENO4 110 320 999 140 220 140 240 160 200 180
10 GENO5 110 330 180 140 220 140 240 160 200 180
11 GENO5 110 330 180 140 220 140 240 160 200 180

SSR is the code for markers; GENO is for genotype; -999: missing data value
x Save the data file in Text (tab delimited) type with a suitable filename <genodata.txt>.
B. Download and install STRUCTURE. Latest version of STRUCTURE is available for
download at http://pritch.bsd.uchicago.edu/structure.html.

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan
20 - Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
1
Agricultural University, Coimbatore.
Structured Association Mapping using STRUCTURE and TASSEL

x Run the Structure 3.2.2 software, by double clicking the icon at desktop.

DataTreePane ResultPane

AnalysisPane

(1) Building a project

x Click on File > New Project

x Fill in these boxes: Name of project, Select directory and Choose data file
x Select the file saved in step A and Click Next

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan 20 -
2 Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
Agricultural University, Coimbatore.
K K Vinod

x Fill in these boxes: number of individuals, ploidy of data ('2' for diploid), number of
loci, and missing data value ('-999'). Click [Next]

x Since our data contains marker and genotype labels, check 'Row and marker names'.
Click [Next]

x Since the data file contains genotypes labels, check Individual ID for each
Individual. Click [Finish]
(2) When project is done, a parameter set needs to be configured. For this, in the
STRUCTURE Main window, click on Parameter Set > New

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan
20 - Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
3
Agricultural University, Coimbatore.
Structured Association Mapping using STRUCTURE and TASSEL

x Fill in these boxes: Length of Burnin Period: (100000), and Number of Markov chain
Monte Carlo (MCMC) Reps (simulations) after Burnin: (100000).
This number should be high, preferably more than 100000 to get reliable
convergence.
x Click [OK] button
x In the Ancestry Model tab, select Use Admixture Model (This is the default). Click
[OK] button.

x In the Allele Frequency Model tab, select Alleles Frequencies Correlated

x Click [OK] button.

x Name the newly created parameter set in the input dialogue (e.g. test1)
x Click [OK].
(3) Running Simulations
[If optimum population structure (K) is already known by some other means, then
skip this step and go to step (5)]

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan 20 -
4 Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
Agricultural University, Coimbatore.
K K Vinod

x In the STRUCTURE main window, click on Project > Start a job

x In the Scheduler dialogue, select the Parameter set you want to run (e.g. test1);
x Set K between a range say 1 to 10;
x Set the number of replications (Iterations) to run (e.g. 5)
x Click [Start]

x The program will run for a very long time (>72 hours) depending on the speed of the
computer, size of the data, and number of iterations and the replications defined in the
parameter set.

x STRUCTURE displays Job is Completed! dialogue after successful analysis.

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan
20 - Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
5
Agricultural University, Coimbatore.
Structured Association Mapping using STRUCTURE and TASSEL

(4) Determining the optimum population structure

x To determine optimum value for K, Click on the Simulation Summary in the Tree
Pane of STRUCTURE window

x In the Result pane, click File on the left hand top corner, to save the simulation
summary in a text file.
x Copy the values of K and Ln P(D) into a convenient data editor (Excel is the best
option) and calculate average of Ln P(D) against each K, across replications. The K at
which Ln P(D) plateaus is to be taken as optimum K.

(5) Once optimum population structure (K) is known, estimate Inferred ancestry (Q matrix)
of individuals,
x In the STRUCTURE main window, Select Parameter Set >Run

x Enter the value of K in the box and click [OK].

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan 20 -
6 Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
Agricultural University, Coimbatore.
K K Vinod

x When the job is complete, go to the result pane and select the latest run from the
results folder
x From the results on the right pane, select inferred ancestry of individuals
x Copy and paste it in notepad.
Alternately,
x Go to the directory in where you save the project, open the folder with project name,
and open the result folder. There will be several files with a "f" suffix.
x Open the file with later run number in Notepad. In this output file, copy the values of
"Inferred ancestry of individuals"
x Inferred ancestry of individuals (Q matrix) is used as covariate in TASSEL.
x A typical Q matrix will look like as follows:
1 GENO1 0: 0.212 0.020 0.131 0.043 0.593
2 GENO2 0: 0.173 0.201 0.538 0.062 0.027
3 GENO3 0: 0.200 0.270 0.131 0.189 0.211
4 GENO4 0: 0.092 0.506 0.155 0.142 0.105
5 GENO5 0: 0.124 0.329 0.046 0.427 0.074
6 GENO6 0: 0.339 0.053 0.096 0.450 0.062
7 GENO7 0: 0.343 0.039 0.246 0.120 0.251
8 GENO8 0: 0.376 0.208 0.201 0.059 0.155
9 GENO9 0: 0.172 0.044 0.590 0.137 0.058
10 GENO10 0: 0.172 0.163 0.131 0.445 0.089
11 GENO11 0: 0.093 0.470 0.101 0.165 0.171
12 GENO12 0: 0.313 0.156 0.237 0.108 0.187
13 GENO13 0: 0.184 0.371 0.299 0.030 0.117
14 GENO14 0: 0.078 0.159 0.036 0.675 0.052
15 GENO15 0: 0.705 0.076 0.065 0.078 0.077
x Save the Q matrix. This need to be formatted to be read in TASSEL
II. Association Analysis using TASSEL
Association mapping can produce spurious association between marker and phenotype;
therefore, the population structure is an important component in estimating marker – trait
associations. This is done by incorporating the Q matrix of inferred ancestry coefficients of
the individuals across the sub-populations as covariate in the association mapping analysis.
To refine the results, the kinship coefficients are also used in association analysis. Kinship
matrix (K matrix) can be estimated using software such as SPAGeDi (Hardy and Vekemans,
2002) or can be estimated within TASSEL itself.
Unlike that of STRUCTURE which is a complete program by itself, TASSEL stand-alone
version runs only under Java runtime environment (JRE) version 1.5 and above. JRE is freely
downloadable software from Sun Microsystems, http://java.sun.com/. Alternatively, online
versions of TASSEL are also available.
Note: Latest version of TASSEL 3.0 does not support microsatellite data anymore. So SSR
data analysis can be done only using TASSEL version 2.1.
Both TASSEL 2.1 and 3.0 are available for free download at the following website:

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan
20 - Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
7
Agricultural University, Coimbatore.
Structured Association Mapping using STRUCTURE and TASSEL

http://www.maizegenetics.net/index.php?option=com_content&task=view&id=89&Itemid=1
19.
x Once software platforms are ready, double clicking on the file sTASSEL.jar will run
TASSEL 2.1.
A. Preparation of data
TASSEL requires three types of data primarily for the analysis. (i) Marker segregation data
(ii) Phenotype data and (iii) Ancestry coefficient data (Q matrix)
x Prepare these data in Excel, and save as Text Tab delimited (*.txt) files.
(a) Genotype data
Genotype data using microsatellites uses the following format:
A B C D E F G H I J
1 40 96:2
2 SSR1 SSR2 SSR3 SSR4 SSR5 SSR6 SSR7 SSR8 SSR9
3 GENO1 110:110 330:330 190:190 140:140 220:220 140:140 240:240 160:160 200:200
4 GENO2 110:110 330:330 190:190 140:140 230:230 140:140 240:240 160:160 190:190
5 GENO3 110:110 320:320 190:190 140:140 220:220 140:140 240:240 160:160 200:200
6 GENO4 110:110 320:320 190:190 140:140 220:220 140:140 240:240 160:160 200:200
7 GENO5 110:110 330:330 180:180 140:140 220:220 140:140 240:240 160:160 200:200
8 GENO6 110:110 ?:? 180:180 140:140 230:230 140:140 240:240 160:160 190:190
9 GENO7 110:110 330:330 190:190 140:140 220:220 140:140 240:240 160:160 200:200
10 GENO8 110:110 320:320 180:180 140:140 220:220 140:140 240:240 160:160 200:200
11 GENO9 110:110 330:330 190:190 140:140 220:220 140:140 250:250 160:160 200:200
12 GENO10 120:120 320:320 180:180 140:140 220:220 140:140 240:240 160:160 200:200

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan 20 -
8 Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
Agricultural University, Coimbatore.
K K Vinod

Note: The number in the first row tell TASSEL, the number of individuals, followed by
number of markers, and (:2) indicate diploid nature of the individuals. ? is commonly used
for missing data. Don't put individual with missing data in the first row. Instead, move it into
another row.
x Save the data matrix in Text (Tab delimited) type with a suitable filename,
<Markername.txt>.
(b) Phenotype data
Phenotype data uses following format:
A B C D E F G
1 40 6 1
2 PHE1 PHE2 PHE3 PHE4 PHE5 PHE6
3 GENO1 12.72 21.64 121.23 88.30 2.40 12.72
4 GENO2 11.32 25.16 129.20 95.30 2.46 11.32
5 GENO3 12.38 25.32 139.10 92.35 2.72 12.38
6 GENO4 13.00 25.19 123.60 104.80 2.26 13.00
7 GENO5 12.67 24.19 129.70 97.50 2.95 12.67
8 GENO6 10.80 24.24 118.10 86.10 2.57 10.80
9 GENO7 9.62 27.92 129.60 94.85 1.94 9.62
10 GENO8 9.35 25.30 114.20 96.70 2.28 9.35
11 GENO9 9.68 25.41 83.70 99.70 1.65 9.68
12 GENO10 9.16 26.44 94.50 91.00 2.10 9.16
Note: The number in the first row tell TASSEL, the number of individuals, followed by
number of traits, and 1 indicate number of header rows. -999 is commonly used for missing
data.
x Save the phenotype data matrix in Text (Tab delimited) type with a suitable filename,
<traitname.txt>.
(c) Population structure data
Population structure data (Q matrix) uses following format:
A B C D E F
1 40 5 1
2 Q1 Q2 Q3 Q4 Q5
3 GENO1 0.000 0.003 0.037 0.003 0.956
4 GENO2 0.000 0.003 0.006 0.016 0.975
5 GENO3 0.000 0.001 0.001 0.001 0.996
6 GENO4 0.000 0.004 0.005 0.002 0.989
7 GENO5 0.000 0.001 0.001 0.001 0.996
8 GENO6 0.000 0.001 0.002 0.001 0.996
9 GENO7 0.001 0.003 0.629 0.004 0.362
10 GENO8 0.000 0.002 0.001 0.001 0.995
11 GENO9 0.000 0.021 0.004 0.125 0.850
12 GENO10 0.001 0.002 0.002 0.002 0.993
Note: The number in the first row tell TASSEL, the number of individuals, followed by
number of sub-populations (K=5), and 1 indicate number of header rows.

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan
20 - Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
9
Agricultural University, Coimbatore.
Structured Association Mapping using STRUCTURE and TASSEL

x Save the Q matrix in Text (Tab delimited) type with a suitable filename, <Q_matrix
name.txt>.
(d) Kinship data (K matrix)
Kinship data is an optional requirement for Association mapping. Structured association
analysis is done using a general linear model (GLM) algorithm, which does not require K
matrix. K matrix is however, essential for mixed linear model (MLM) analysis.
If the kinship output from SPAGeDi is used, it should be formatted to read in TASSEL as
given below. For this, (i) add a value of "2" for relative kinship between same individuals and
(ii) change the all negative values of relative kinship into "0".
A B C D E F
1 40
2 GENO1 2.000 0.595 1.688 1.688 0.506
3 GENO2 0.595 2.000 0.572 0.550 1.286
4 GENO3 1.688 0.572 2.000 1.465 0.483
5 GENO4 1.688 0.550 1.465 2.000 0.416
6 GENO5 0.506 1.286 0.483 0.416 2.000

Note: The number in the first row tell TASSEL, the number of individuals. No missing data
are permitted in K matrix.
x Save the K matrix in Text (Tab delimited) type with a suitable filename <kinship.txt>
B. Running Structured Association Mapping
(i) Loading data
x Double click on the file sTASSEL.jar in the TASSEL 2-1 directory. Following
window opens.

OptionsPanel

DatatreePanel

MainPanel

ReportPanel

x Click on the [Data] button from Options panel.

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan 20 -
10 Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
Agricultural University, Coimbatore.
K K Vinod

x From the buttons below click on [File] button

x Select the data type to load, if not sure about data, it is better to choose “I will make
my best guess and try”, this will allow TASSEL to select the data type by itself.
x Click [OK] and select the file containing marker name <markername.txt>.
x Repeat the step, and load phenotype data <traitname.txt> and Q-matrix
<Q_matrixname.txt> one after the other.
x Once the data are loaded, they appear in the Data tree panel.
(ii) Geneotype data processing
When diploid microsatellite data is used, convert the raw format to genetype state, to do this,
x Click on the button [A:a Genotype] to start Genetype Converter

x Select Create alignment based on genotypic state (eg. A:a > Aa) and click [OK]
x This will add another dataset named “GenoStates” in the Data tree panel
(iii) Joining marker, phenotype and population structure data
x In the Data tree Panel, select data "GenoStates", "<Traitname>" and
"<Q_Matrixname>" by clicking on them, while <Ctrl> key is pressed

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan
20 - Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
11
Agricultural University, Coimbatore.
Structured Association Mapping using STRUCTURE and TASSEL

x Click on the button [U Join] in the Options Panel

x A new Data set named “GenoStates+<Traitname>+<Q_matrixname>” appear on the
data tree panel
(iv) Loading relative kinship data (for MLM analysis only)
x If kinship information is available, load it by clicking [File] button on the Options
Panel.
x Select either "Load square numerical matrix (eg. kinship) (phylip)" or “I will make
my best guess and try” and click [OK]
x Select Kinship file and Open
x The kinship data appear under Matrix in the Data tree panel
x Alternately Kinship can be calculated within TASSEL, by selecting the “GenoStates”
and clicking on [Analysis] and then [Kinship].
Note: This is a simple kinship matrix generated from the distance matrix. In order to
use more robust Kinship estimates it is recommended to use SPAGeDi or SAS.
(v) Structured association analysis using least squares GLM
x Select “GenoStates+<Traitname>+<Q_matrixname>” from the Data tree panel, by
clicking on it while holding the <Ctrl> key pressed
x Click on the [Analysis] button and then click on [GLM]

x Select all phenotype as data, and Sub-population data as covariate, Exclude the last
sub-population
x Check “Analyse Each Data Column Separately”
x Click [OK]

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan 20 -
12 Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
Agricultural University, Coimbatore.
K K Vinod

x Click on [Define Output]

x In Define Ftests, we can set number of permutations, say 1000. Click [Run]
x A new data set appear under Result>Association in the Data tree Panel named
“GLM_ GenoStates+<Traitname>+<Q_matrixname>”
(vi) Viewing and saving results
x Click on the button [Results] from Options Panel
x Select the result data from Data tree Panel, by holding the <Ctrl> key pressed and
clicking on “GLM_ GenoStates+<Traitname>+<Q_matrixname>”
x Click [Table] button from the Options Panel

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan
20 - Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
13
Agricultural University, Coimbatore.
Structured Association Mapping using STRUCTURE and TASSEL

x By clicking on the [Print] results can now be printed, or exported to Tab delimited
text file or Comma separated values (CSV) text file by clicking on buttons [Export
(CSV)] and [Export (Tab)] respectively.
(vii) Understanding the result file
Chr_ df_ F_ p_ #perm_ pperm_ padj_ df_ df_ MS_ Rsq_ Rsq_
Trait Locus Site Chr pos Marker Marker Marker Marker Marker Marker Model Error Error Model Marker

SPY RM1 0 0 0 1 0.0187 0.8921 3000 0.8957 1 5 33 7.6772 0.1557 4.78E04
SPY RM101 0 0 0 1 0.0612 0.8061 3000 0.8137 1 5 33 7.6674 0.1568 0.0016
SPY RM107 0 0 0 1 8.1549 0.0074 3000 0.0117 0.0643 5 33 6.1595 0.3226 0.1674
SPY RM11 0 0 0 1 0.2542 0.6175 3000 0.6148 1 5 33 7.6229 0.1617 0.0065
SPY RM127 0 0 0 2 0.5874 0.5616 3000 0.5648 1 6 32 7.6411 0.1851 0.0299
SPY RM13 0 0 0 1 2.0689 0.1597 3000 0.1799 1 5 33 7.2284 0.2051 0.0498
SPY RM144 0 0 0 1 0.0401 0.8425 3000 0.8414 1 5 33 7.6723 0.1563 0.001
SPY RM152 0 0 0 1 1.2263 0.2761 3000 0.2832 1 5 33 7.4064 0.1855 0.0303
SPY RM153 0 0 0 2 0.2129 0.8094 3000 0.8177 1 6 32 7.8176 0.1663 0.0111
SPY RM154 0 0 0 2 1.3971 0.262 3000 0.2602 1 6 32 7.2855 0.2231 0.0678
SPY RM16 0 0 0 2 0.2159 0.807 3000 0.8117 1 6 32 7.8162 0.1665 0.0112

The result file, in addition to displaying the F-statistics and p-values for the requested F-tests,
also contains information about degrees of freedom, the error mean square for the model, R-
square of the model, and Rsquare for the marker. The model R-square is the portion of total
variation explained by the full model. The marker R-square is the portion of total variation
explained by the marker but not by the other terms in the model. When permutations are
requested, #perm_Marker is the number of permutations run, pperm_ Marker is a test of
individual markers, and p-adj_Marker is the marker p-value adjusted for multiple tests. The
p-adj_Marker value is a permutation test derived using a step-down MinP procedure (Ge et
al. 2003) and controls the family-wise error rate (FWER). For example, if only markers with
p-adj values of .05 or less are accepted as significant, then the probability of rejecting a single
true null hypothesis across the entire set of hypotheses is held to .05 or less. This test takes
dependence between hypotheses into account and does not assume that hypotheses are
independent as do other multiple test correction procedures.
Note:
Both STRUCTURE and TASSEL comes with well written tutorials. This document is no
substitution for those. For any clarification and in depth information please read these
tutorials carefully. Besides, there are online discussion forums available for these software

Advanced faculty training on "Impact of genomics in crop improvement: Perceived and achieved", Jan 20 -
14 Feb 9, 2011, Centre for Advanced Faculty Training in Genetics and Plant Breeding, Tamil Nadu
Agricultural University, Coimbatore.
K K Vinod

packages, in which users post their doubts and suggestions. These discussions are watched by
the developers of these software and they incorporate modifications/ fix bugs as and when
required.
To join these forums visit following sites,
STRUCTURE: https://groups.google.com/d/forum/structure-software
TASSEL : http://groups.google.com/d/forum/tassel
Major References :
Buckler E, Casstevens T, Bradbury P, Zhang Z (2009) Trait Analysis by aSSociation, Evolution and
Linkage (TASSEL): User Manual. Cornell University
http://www.maizegenetics.net/tassel/docs/TASSEL_help.pdf
Pritchard JK, Wena X, Falush D (2010) Documentation for structure software: Version 2.3.
Department of Human Genetics, University of Chicago.
http://pritch.bsd.uchicago.edu/structure_software/release_versions/v2.3.3/structure_doc.pdf
Other references:
Bradbury PJ, Zhang Z , Kroon DE , Casstevens TM, Ramdoss Y, Buckler ES (2007) TASSEL:
software for association mapping of complex traits in diverse samples. Bioinformatics 23:
2633-2635
Churchill G, Airey DC, Allayee H, Angel JM, Attie AD et al. (2004) The Collaborative Cross, a
community resource for the genetic analysis of complex traits. Nature Genet 36: 1133-1137
Hardy OJ, Vekemans X (2002) SPAGeDi: a versatile computer program to analyse spatial genetic
structure at the individual or population levels. Mol Ecol Notes 2: 618-620
Pritchard JK, Stephens M, Rosenberg NA, Donnelly P (2000) Association mapping in structured
populations. Am J Human Genet 67: 170-181.
Pritchard, J. K., Stephens, M., and Donnelly, P. (2000) Inference of population structure using
multilocus genotype data. Genetics, 155:945–959
Thornsberry JM, Goodman MM, Doebley J, Kresovich S, Nielsen D et al. (2001) Dwarf8
polymorphisms associate with variation in flowering time. Nature Genet 28: 286-289.
Yu JM, Pressoir G, Briggs WH, Bi IV, Yamasaki M et al. (2006) A unified mixed-model method for
association mapping that accounts for multiple levels of relatedness. Nature Genet 38:203-
208

Map, Mars, GWS, GS
No ratings yet
Map, Mars, GWS, GS
44 pages
Design, Analysis, and Interpretation of Genome Wide Association Scans ISBN 1461494427, 9781461494423 Scribd Download
No ratings yet
Design, Analysis, and Interpretation of Genome Wide Association Scans ISBN 1461494427, 9781461494423 Scribd Download
17 pages
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
From Everand
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
Eric Elliott
No ratings yet
How To Transmit SAP Purchase Order To Vendor Via E-Mail
100% (14)
How To Transmit SAP Purchase Order To Vendor Via E-Mail
16 pages
S2000 Ddec Iv 170708
100% (4)
S2000 Ddec Iv 170708
95 pages
Biostatistics Course
100% (1)
Biostatistics Course
100 pages
Agfa NX
50% (2)
Agfa NX
12 pages
LM2500 and PGT25 Gas Turbine Families: Updated Shutdown and Restart Procedures
100% (2)
LM2500 and PGT25 Gas Turbine Families: Updated Shutdown and Restart Procedures
15 pages
Unit 2 Complete
No ratings yet
Unit 2 Complete
91 pages
Tassel User Guide 3.0
No ratings yet
Tassel User Guide 3.0
73 pages
ADBT 3 Marker Assisted Breeding
No ratings yet
ADBT 3 Marker Assisted Breeding
48 pages
Computer SSC CGL 2022 Tier II Paper I - RBE - Compressed
No ratings yet
Computer SSC CGL 2022 Tier II Paper I - RBE - Compressed
17 pages
Association Mapping and Its Role in Plant Breeding: Mahendrakumar N. Chaudhari
100% (1)
Association Mapping and Its Role in Plant Breeding: Mahendrakumar N. Chaudhari
28 pages
Quantitative Trait Loci (QTL) Mapping: Gurbachan S. Miglani
No ratings yet
Quantitative Trait Loci (QTL) Mapping: Gurbachan S. Miglani
42 pages
Atlas - Histologie PDF
100% (1)
Atlas - Histologie PDF
133 pages
QTL Mapping
No ratings yet
QTL Mapping
34 pages
4.PopulationStructure 2
No ratings yet
4.PopulationStructure 2
63 pages
2.1. Capitulo 2 Genetic Mapping and Mas
No ratings yet
2.1. Capitulo 2 Genetic Mapping and Mas
15 pages
Slides Woods
No ratings yet
Slides Woods
156 pages
Master Seminar 123
No ratings yet
Master Seminar 123
50 pages
Pop Gen
No ratings yet
Pop Gen
38 pages
Collard Et Al, 2005
No ratings yet
Collard Et Al, 2005
29 pages
A High-Density Genetic Linkage Map For Chinese Perch (Siniperca Chuatsi) Using Genotyping-By-Sequencing (GBS)
No ratings yet
A High-Density Genetic Linkage Map For Chinese Perch (Siniperca Chuatsi) Using Genotyping-By-Sequencing (GBS)
21 pages
Assciation Mapping
No ratings yet
Assciation Mapping
40 pages
Types of Mapping Populations
No ratings yet
Types of Mapping Populations
32 pages
Apache Cassandra Developer Associate - Exam Practice Tests
From Everand
Apache Cassandra Developer Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Population Formation by Hybridisation
No ratings yet
Population Formation by Hybridisation
50 pages
Hybrid autoencoder - tài liệu về autoencoder
No ratings yet
Hybrid autoencoder - tài liệu về autoencoder
14 pages
Network-Based Hierarchical Population Structure Analysis For Large Genomic Data Sets
No ratings yet
Network-Based Hierarchical Population Structure Analysis For Large Genomic Data Sets
14 pages
Co-So-Di-Truyen-Chon-Giong-Cay-Trong - 1705.06916 - (Cuuduongthancong - Com)
No ratings yet
Co-So-Di-Truyen-Chon-Giong-Cay-Trong - 1705.06916 - (Cuuduongthancong - Com)
28 pages
Co-So-Di-Truyen-Chon-Giong-Cay-Trong - Geneticmaps - (Cuuduongthancong - Com)
No ratings yet
Co-So-Di-Truyen-Chon-Giong-Cay-Trong - Geneticmaps - (Cuuduongthancong - Com)
41 pages
Review - AssociationMapping-plants-Zhu2008
No ratings yet
Review - AssociationMapping-plants-Zhu2008
16 pages
Statistical Analysis of Molecular Data in Diversity Studies
No ratings yet
Statistical Analysis of Molecular Data in Diversity Studies
11 pages
The Design of Early-Stage Plant Breeding Trials Using Genetic Relatedness
No ratings yet
The Design of Early-Stage Plant Breeding Trials Using Genetic Relatedness
26 pages
Deep Dive Aurora
No ratings yet
Deep Dive Aurora
55 pages
Nakaya2012 GenomicSelection
No ratings yet
Nakaya2012 GenomicSelection
14 pages
Genes 13 00618 v2
No ratings yet
Genes 13 00618 v2
14 pages
QTL Mapping and Genomic Breeding Approaches - 2023.2024 - 1
No ratings yet
QTL Mapping and Genomic Breeding Approaches - 2023.2024 - 1
23 pages
GP504-17 &18QTL Mapping-Final Notes
No ratings yet
GP504-17 &18QTL Mapping-Final Notes
13 pages
Nachimuthu Article AnalysisOfPopulationStructureA
No ratings yet
Nachimuthu Article AnalysisOfPopulationStructureA
25 pages
Review - BSA in Genetics, Genomics and Crop Improvement
No ratings yet
Review - BSA in Genetics, Genomics and Crop Improvement
15 pages
Genetic Analysis of Potato Breeding Collection Using
No ratings yet
Genetic Analysis of Potato Breeding Collection Using
17 pages
Structure3 HubiszEtAl09
No ratings yet
Structure3 HubiszEtAl09
11 pages
Lecture 10
No ratings yet
Lecture 10
9 pages
Review Kuhner de Métodos Coalescentes
No ratings yet
Review Kuhner de Métodos Coalescentes
8 pages
QTL Mapping in Crop Plants Principles and Applications
No ratings yet
QTL Mapping in Crop Plants Principles and Applications
7 pages
Molecular Ecology - 2005 - EVANNO - Detecting The Number of Clusters of Individuals Using The Software Structure A
No ratings yet
Molecular Ecology - 2005 - EVANNO - Detecting The Number of Clusters of Individuals Using The Software Structure A
10 pages
Straightforward Inference of Ancestry and Admixture
No ratings yet
Straightforward Inference of Ancestry and Admixture
10 pages
Detecting The Number of Clusters of Individuals Using The Software Structure
No ratings yet
Detecting The Number of Clusters of Individuals Using The Software Structure
10 pages
Genome-Wide Association Mapping: A Case Study in Bread Wheat (Triticum Aestivum L.)
No ratings yet
Genome-Wide Association Mapping: A Case Study in Bread Wheat (Triticum Aestivum L.)
22 pages
Advanced QTL Backcross
No ratings yet
Advanced QTL Backcross
13 pages
VIP Software
No ratings yet
VIP Software
18 pages
Genepop
No ratings yet
Genepop
51 pages
R GWAS Packages
No ratings yet
R GWAS Packages
18 pages
Population Genetics of Genomics Based CR
No ratings yet
Population Genetics of Genomics Based CR
9 pages
Ccs347 GD Unit1 QB
No ratings yet
Ccs347 GD Unit1 QB
1 page
Genetic Maps
No ratings yet
Genetic Maps
41 pages
Balancing Genomic Selection Efforts For Allogamous Plant Breeding Programs
No ratings yet
Balancing Genomic Selection Efforts For Allogamous Plant Breeding Programs
10 pages
LEA: An R Package For Landscape and Ecological Association Studies
No ratings yet
LEA: An R Package For Landscape and Ecological Association Studies
14 pages
2003 - Breeding by Design
No ratings yet
2003 - Breeding by Design
5 pages
Vekemans MolEcol2004 PDF
No ratings yet
Vekemans MolEcol2004 PDF
15 pages
QTL Mapping
No ratings yet
QTL Mapping
5 pages
CNM CH1
No ratings yet
CNM CH1
47 pages
Mauricio NRG 2001
No ratings yet
Mauricio NRG 2001
12 pages
Animal Breeding GEB 206
No ratings yet
Animal Breeding GEB 206
21 pages
Association Mapping in Plants-Lecture
No ratings yet
Association Mapping in Plants-Lecture
9 pages
Development and Mining Microsatellites For Crop Genome Resources
No ratings yet
Development and Mining Microsatellites For Crop Genome Resources
3 pages
Potential Applications of Molecular Markers in Plant: Mini Review
No ratings yet
Potential Applications of Molecular Markers in Plant: Mini Review
3 pages
Report On Genetic Distance and Heterosis
No ratings yet
Report On Genetic Distance and Heterosis
1 page
SGC 410 Product Sheet 4189341242 Uk
No ratings yet
SGC 410 Product Sheet 4189341242 Uk
2 pages
Biostatistics: Written by - Alomgir Hossain
No ratings yet
Biostatistics: Written by - Alomgir Hossain
7 pages
Pearson Statistics Homework Answers
100% (1)
Pearson Statistics Homework Answers
4 pages
Pqgen Manual
No ratings yet
Pqgen Manual
4 pages
Easergy MiCOM P63x Protection Relays - P63283
No ratings yet
Easergy MiCOM P63x Protection Relays - P63283
2 pages
Coefficient Variation - Anil Sir
No ratings yet
Coefficient Variation - Anil Sir
5 pages
Project Report Template 2023.docx-1
No ratings yet
Project Report Template 2023.docx-1
10 pages
A Geometrical Approach To Enhance Security Against Cyber Attacks in Digital Substations
No ratings yet
A Geometrical Approach To Enhance Security Against Cyber Attacks in Digital Substations
15 pages
File Management Explained
No ratings yet
File Management Explained
5 pages
OD2e L2 Word List
No ratings yet
OD2e L2 Word List
5 pages
Lecture 11
No ratings yet
Lecture 11
29 pages
Madhav Institute of Technology & Science, Gwalior
No ratings yet
Madhav Institute of Technology & Science, Gwalior
2 pages
Oral Presentation: Communicative English 2 Psas
No ratings yet
Oral Presentation: Communicative English 2 Psas
19 pages
Data and Variable
No ratings yet
Data and Variable
12 pages
SQL For Beginners
No ratings yet
SQL For Beginners
79 pages
Microbiology''', Is The Study of Microbes, Such As
No ratings yet
Microbiology''', Is The Study of Microbes, Such As
27 pages
DLT Registration Process For Videocon
No ratings yet
DLT Registration Process For Videocon
20 pages
Apertadeira 90º-48ea
No ratings yet
Apertadeira 90º-48ea
48 pages
Probuds t31
No ratings yet
Probuds t31
7 pages
SSR Primers
No ratings yet
SSR Primers
53 pages
Our Strategic Searching Lesson Plan:, Grade Level: 6-9
No ratings yet
Our Strategic Searching Lesson Plan:, Grade Level: 6-9
4 pages
MEDICI 4 Blockchain Use Cases
No ratings yet
MEDICI 4 Blockchain Use Cases
28 pages
OOPS Project Proposal-3
No ratings yet
OOPS Project Proposal-3
3 pages
Unit 6 Challenges
No ratings yet
Unit 6 Challenges
8 pages
Assignment GEB-203
No ratings yet
Assignment GEB-203
8 pages
Assignment On Mycobacteria
No ratings yet
Assignment On Mycobacteria
7 pages
Artefacts
No ratings yet
Artefacts
3 pages
LJF
No ratings yet
LJF
3 pages
Particle Filter: Exploring Particle Filters in Computer Vision
From Everand
Particle Filter: Exploring Particle Filters in Computer Vision
Fouad Sabry
No ratings yet
Chi-Square Test: What Do You Mean X Test?
No ratings yet
Chi-Square Test: What Do You Mean X Test?
5 pages
Google Deepmind Alphazero Chess, As Having
No ratings yet
Google Deepmind Alphazero Chess, As Having
1 page
Biostatistics Previous Question
No ratings yet
Biostatistics Previous Question
3 pages

Structured Association Mapping Using STR

Uploaded by

Structured Association Mapping Using STR

Uploaded by

Structured Association Mapping using STRUCTURE and

(1) Building a project

x In the Allele Frequency Model tab, select Alleles Frequencies Correlated

x Click [OK] button.

x In the STRUCTURE main window, click on Project > Start a job

x STRUCTURE displays Job is Completed! dialogue after successful analysis.

(4) Determining the optimum population structure

x Enter the value of K in the box and click [OK].

x Click on the [Data] button from Options panel.

x From the buttons below click on [File] button

x Click on the button [U Join] in the Options Panel

x Click on [Define Output]

You might also like