Technical Note: Guide To Probe Logarithmic Intensity Error (PLIER) Estimation
Technical Note: Guide To Probe Logarithmic Intensity Error (PLIER) Estimation
A F F Y M E T R I X P R O D U C T F A M I LY >
I N F O R M AT I C S >
Technical Note
This guide is a reference tool for biologists utilizing the new probe logarithmic intensity error (PLIER) method for
calculating signal. Additional detail
regarding the algorithm appropriate
for statisticians and bioinformaticists
is provided in the appendix.
Introduction
The probe logarithmic intensity error
(PLIER) method produces an improved
signal (a summary value for a probe set) by
accounting for experimentally observed
patterns for feature behavior and handling
error appropriately at low and high abundance. Resulting benefits include:
Higher reproducibility of signal (lower
coefficient of variation) without loss of
accuracy
Higher sensitivity to changes in
abundance for targets near background
Dynamic weighting of the most informative probes in an experiment to
determine signal
This method was developed by building
upon many of the concepts that have been
published recently within the field of
GeneChip microarray data analysis,
Figure 1: A probe set containing probes with varying feature response is helpful in detecting a
range of abundance, as illustrated by spiked-in concentrations. For example, strongly responsive
features are informative at the low end, but saturate at the high end. Weakly responsive features
are uninformative at the low end, but are informative at the high end of the abundance range.
15
14
13
Typically responsive feature
log(PM) intensity
II
II
12
Most responsive feature
11
10
9
8
7
6
5
0
0.25 0.5
1
2
4
8
16
32
64 128 256 512 1024
Concentration of PM labeled spikes (Probe set 36311_at in Latin Square experiment)
A F F Y M E T R I X P R O D U C T F A M I LY > A R R AY S
Figure 2: Once intensities are scaled using feature response, it becomes easier to detect
non-systematic differences among features, and down-weight low performance features. Only the
rescaled Perfect Match probes intensity is plotted here, and the actual algorithm incorporates
background information. Note that background plays a large effect in the least responsive feature,
and that the most responsive feature saturates first.
Same three features adjusted for responsiveness
12
10
8
Least responsive feature
6
Typically responsive feature
4
Most responsive feature
2
0
0
0.25 0.5
1
2
4
8
16
32
64
128 256 512 1024
Concentration of PM labeled spikes (Probe set 36311_at in Latin Square experiment)
Calculation of Signal
PROBE BEHAVIOR AND WEIGHTING
II
II
well as making it easy to identify poorly performing features with erratic hybridization
behavior. Feature responses are a measure of
how much the relative intensity of a feature
is due to the feature itself, as opposed to the
common target of a probe set. This relative
response is influenced by how likely a probe
is to bind to a complementary sequence
Figure 3: The information from all three feature pairs is combined to generate the signal estimate. Each feature pair votes for a particular target response value (lines 1-3). Each possible
value for the target response (plotted along the x-axis) has a penalty (lack of fit) when compared
to the intensity of a given feature pair. The three individual penalties are added up (line 4), and
the PLIER estimate of signal is the target response value with the lowest penalty (best fit to
data). Note that the feature with the lowest feature response is only useful in this instance for
excluding high signal values (because the feature reponse is so low that at a 4 pM spiked-in
concentration the perfect match intensity is mostly background). The remaining two features
estimate slightly different values individually due to noise. Note that the x-axis (possible target
response values) is logarithmically scaled, so that 5 is 25.
"Lack of fit" of target response to observed intensities
14
-1
0.09
0.08
0.07
0.06
0.05
0.04
0.03
0.02
0.01
0
1
5
7
9
Possible values for target response
11
13
15
INFORMATICS
A F F Y M E T R I X P R O D U C T F A M I LY > A R R AY S
Figure 4: The amount of uncertainty (or potential error in estimation) present is dependent
upon the total intensity, not just the background-adjusted intensity. The proportion of intensity
represented by background is dependent upon the abundance of the complementary target as
well as the feature response. Thus, the small PM-MM difference in the first feature pair has an
error bar comparable to that of the large PM-MM difference in the second feature pair, and so
is much less informative than might be expected. Small differences between relatively large
numbers contain very little information, and therefore are effectively down-weighted by PLIER.
2000
1800
1600
1400
Error is proportional to the full intensity, not the background adjusted intensity
PM-MM
MM (Background)
1200
1000
800
600
400
200
0
Feature pairs
tures. For example, a feature that is consistently twice as bright as the median feature
intensity is considered a high performance
probe (and strongly responsive). However,
a feature that is twice as bright only half
the time but is the same as the median feature intensity the other half of the time is
considered a low performance feature due
to inconsistent behavior. This type of
observation is often due to cross-hybridization effects creating erratic patterns.
Once the scaling factor is applied, the
intensity data from each feature can be
combined to generate a goodness of fit curve
for possible signal values (target responses,
see glossary). In this process, the most
informative features provide the strongest
contribution to the signal. At the low end
of target abundance, where background
dominates the intensity, these are most likely
the probes with the highest feature response.
Inconsistent features are down-weighted,
using the Geman-McClure function.
Through the scaling and down-weighting
process, the features that are most informative in each experiment contribute the
most to the final signal estimate (Figure 3).
ERROR MODEL
INFORMATICS
Input Data
FEATURE INTENSITY
MODEL FILES
II
II
A F F Y M E T R I X P R O D U C T F A M I LY > A R R AY S
Sensitivity
to Low
Expressors
Detection
Step-by-Step Selection of
Options
PM-MM
PM-B
Small
Differential
Change
between
Experiments
PM only
MM treated as PM
Low
High
Sample-to-Sample Variability
PM-MM
PM-B
Method
Assumptions
Benefits
Drawbacks
PM-B
Low noise
PM Only
Low noise
Approximately constant CV
MM treated
as additional
PM
PM-MM
II
II
INFORMATICS
A
AFFFFY
YM
MEETTR
RIIX
X PPR
RO
OD
DU
UC
CTT FFA
AM
MIILY
LY >> A
AR
RR
RAY
AYS
S
Method
Assumptions
Benefits
Drawbacks
PLIER
Computationally intensive
In cases where feature intensities
disagree, may have more than one
solution
Performance relative to amount of
model data provided
Lack of bias
Conservative
Smooth down-weighting of outliers
Positive output values
Minimal bias
dCHIP
RMA
MAS 5
INFORMATICS
Post-Processing
Once the PLIER algorithm has been run,
the appropriate scaling and normalization
options can be applied.
INTERPRETING OUTPUT
A F F Y M E T R I X P R O D U C T F A M I LY > A R R AY S
Comparison to Affymetrix
Microarray Suite 5.0
While Affymetrix Microarray Suite 5.0
(MAS 5) treats all features equally, PLIER
utilizes experimental evidence to 1) weight
features based on consistency, and then 2)
utilize the features that are most consistent
for that experiment to estimate signal.
Arrays analyzed using the MAS 5 algorithm
can be re-analyzed using the PLIER algorithm.
Table 3: Options
Options
Data Pre-Processing
Quantile normalization
Quantile normalization
Probe Type
Background
% Background (Uniform)
None
Optimization
Quick
Full
REFERENCES
Li, C. and W.H. Wong. Model based analysis of
oligonucleotide arrays: Expression index computation
and outlier detection. PNAS 98:31-36 (2001).
www.biostat.harvard.edu/complab/dchip
Conclusion
PLIER provides an advanced, flexible framework for primary analysis of GeneChip
microarray data incorporating recent
advances in expression analysis. These
advances include the use of feature response
terms to improve the interpretation of
intensities, a robust M-estimator to reject
outliers, and an improved model for intensity
level errors. The PLIER error model is the
recommended option, as it simultaneously
incorporates a smooth transition between
analysis of low and high expressors, rescaling
by feature responses, and resistance to outliers. However, options within the PLIER
framework allow simpler models to be used
that can replicate the features of other popular low-level analysis techniques, including
the analysis of perfect-match only arrays.
II
II
Glossary
M-Estimator An estimator obtained by minimizing a function. Similar to maximum-likelihood estimates.
Mean The familiar arithmetic mean of several numbers. Minimizes the sum of squares.
Median The middle rank value (or average of the two such values, if there are an even
number of data points). Minimizes the sum of absolute values.
Robustness Ability to produce reasonably accurate results in the presence of outliers.
Sensitivity The ability to detect an entity when it is present. Two types of sensitivity
are often discussed for microarrays. The first is the ability to detect a (specified) transcript
as being present in a sample. The second is the ability to detect a (specified) differential
change between two experiments when there is a change.
False positive When an entity is detected and it is not present. For microarrays, there
are two types often discussed. First, when an absent transcript is reflected as a significantly present output. Second, when a differential change is falsely indicated when there
is no change in an expression level.
False negative When an entity is not detected, and it is present. For microarrays, there
are two types of false negative rates. First, when a transcript is indicated as being absent
when it is expressed in the sample. Second, when a (specified) differential change is
falsely indicated as unchanged.
Specificity The ability to exclude an entity when it is not there. For microarrays, there
are two types of specificity. First, when an absent transcript is correctly indicated as absent.
Second, when an unchanged expression level is correctly indicated as unchanged.
Background Unwanted intensity observed on an array. Sources of background include
fluorescence of the glass, stray DNA, and many other sources. Varies with sequence of
DNA in the probe.
Residuals Real experiments have noise, and therefore any estimates of true values will
differ slightly from those observed in each data point. The difference between the
observed values and the predicted values for a particular estimate.
Error model A means of interpreting how well a model explains the observed data.
Signal A summary value for the observed intensities in a probe set reflecting a common transcript. In PLIER, estimated as the target response that best fits the data given
the feature responses.
Total response The intensity due solely to the target of interest interacting with a given
feature. Assumed in the PLIER model to consist of two components: feature response
and target response, multiplied together.
Feature response A relative measure within a probe set of the (multiplicative) difference in
intensity due to a given feature being different (in location, probe sequence, etc.) than another.
Assumed to be invariant across experiments for a given feature. Example: one PM-MM
difference is reliably twice as bright as another PM-MM difference across experiments.
The pairs differ in feature response. Feature responses cannot be compared across probe
sets due to their relative nature. Feature response is a dimensionless scaling factor.
Target response A measure within an experiment of the (multiplicative) difference in
intensity due to a given experiment having a different target abundance. Assumed to be
common to all features (background adjusted) within a probe set. Example: one PM-MM
difference is twice as bright in one experiment as another experiment. The experiments
differ in target response for this feature. Target responses cannot be compared across
probe sets due to the relative nature of feature responses.
INFORMATICS
A F F Y M E T R I X P R O D U C T F A M I LY > A R R AY S
II
II
A F F Y M E T R I X P R O D U C T F A M I LY > A R R AY S
Thus, a simple formula for goodness of fit has been derived which can be used to estimate signal and feature response by finding those
non-negative values of signal and feature response that best fit the observed data. This is the M-estimator called PLIER. Some performance
metrics for PLIER (PLIER+16 - with simple variance stabilization applied) can be found at the AffyComp web site run by Rafael Irizarry.
Other presentations on PLIER can be found at the Affymetrix web site, including the presentation at the Low-Level Analysis workshop,
discussing bias and variance issues (www.affymetrix.com/corporate/events/seminar/microarray_workshop.affx).
Calculation of Signal
PLIER drafts an initial estimate of signal (target response) and feature response for each experiment and background-adjusted feature using the
data provided. Please note that a reference table to variables used in the subsequent equations (Table 4) is provided at the end of this section.
PLIER operates by finding target responses (t(i)) for each experiment i and feature responses f(j) for each feature (pair) j that minimize the
function LL(t,f) = sum H(PM,MM, BKG, f(j),t(i)) over all i,j.
Define the following component functions:
y(i,j) = f(j)*t(i)
q = (y*y) + 4*PM*MM
e = (y + q)
(2*PM)
or (if not using mismatch)
e = (y + BKG)
PM
r = log(e)
h=
r*r
1+ r*zr
II
II
Description
Target response at each experiment i (current signal estimate)
Feature response for each feature (or feature pair) j
Intermediate value in likelihood computation
Residual on log-scale
Intensity of the j-th perfect match feature in the i-th experiment
Intensity of the j-th mismatch feature, in the i-th experiment
Total response estimate with no background
Background value (if not using MM)
German-McClure function for discounting residuals as applied to one probe pair
Extra penalty term added to log likelihood for differences in target response
Extra penalty term added to log likelihood for differences in feature response
Log-Likelihood approximation used by PLIER
INFORMATICS
A F F Y M E T R I X P R O D U C T F A M I LY > A R R AY S
where B is either BKG or MM, depending on which option has been selected by the user. L is an attenuation parameter ranging from 0.0
to 1.0 that controls how the data approach zero as PM approaches B (or becomes less than B).
Median polish is a procedure for fitting a simple additive model robustly by alternately taking the median of each column ((i,j)-log(f(j)) as
the estimate of log(t(i)), and then taking the median of each row ((i,j)-log(t(i)) as the estimate of log(f(j)), until stable estimates of each
value are reached.
This initial estimate of target response (t(i)) is output as signal if the user selects the quick option for PLIER. Note that these estimates
do not minimize the PLIER goodness of fit, and are simply provided for completeness.
Finding a Minimum
One way of finding (a) minimum of such a function is to use Newtons method. Starting with an assignment of values to target responses (t) and feature responses (f), given the rate at which the function is varying, find a good step size to take to find the next assignment
of values to target responses and feature responses. Newtons method turns out to be computationally intensive and requires inversion
of large matrices, and therefore an approximation to the method is used instead.
Identifiability Constraint
Since the method deals with two variables multiplied by each other, it is always the case that feature responses can be multiplied by
some constant and target responses can be divided by some constant and wind up with the same fit to the data. To resolve this ambiguity, it is required that sum(f(j)) = n, the number of features (not counting any used for bias removal). Note that feature responses are relative to the other features in a probe set and cannot be directly compared between probe sets.
Avoiding Local Minima
If good values for target and feature response have been found, so that there are no local improvements possible, it may be that the
method is trapped in a local minimum, and possible improvements to the estimates should be checked for. The natural method for finding an improved estimate is to examine the values a variable can plausibly take on to see if any of them improve the estimate.
In particular, in this case, each feature pair in each experiment provides a natural estimate for alternative values for target and feature
response that might be near to better minima. The reasoning is that a minimum should be near to the value that is perfect for some
feature in some experiment. These two estimates are:
t (i) = PM(i,j) MM(i,j)
f(j)
II
II
AFFYMETRIX, INC.
AFFYMETRIX UK Ltd
Mita NN Bldg., 16 F
4-1-23 Shiba, Minato-ku,
Tokyo 108-0014 Japan
Tel: +81-(0)3-5730-8200
Fax: +81-(0)3-5730-8201
[email protected]
[email protected]
www.affymetrix.com Please visit our web site for international distributor contact information.
For research use only. Not for use in diagnostic procedures.
Part No. 701903 Rev. 1
, GeneChip, HuSNP, GenFlex, Flying Objective, CustomExpress, CustomSeq , NetAffx, Tools To Take You As Far As Your Vision,
2005 Affymetrix, Inc. All rights reserved. Affymetrix,
The Way Ahead, Powered by Affymetrix, and GeneChip-compatible, are trademarks of Affymetrix, Inc. All other trademarks are the property of their respective owners. Software products may
be covered by one or more of the following patents: U.S. Patent Nos. 5,733,729; 5,795,716; 5,974,164; 6,066,454; 6,090,555, 6,185,561 6,188,783, 6,223,127; 6,228,593; 6,229,911; 6,242,180;
6,308,170; 6,361,937; 6,420,108; 6,484,183; 6,505,125; 6510,391; 6,532,462; 6,546,340; 6,687,692; 6,607,887; and other U.S. or foreign patents.