4 research outputs found
Physics-based analysis of Affymetrix microarray data
We analyze publicly available data on Affymetrix microarrays spike-in
experiments on the human HGU133 chipset in which sequences are added in
solution at known concentrations. The spike-in set contains sequences of
bacterial, human and artificial origin. Our analysis is based on a recently
introduced molecular-based model [E. Carlon and T. Heim, Physica A 362, 433
(2006)] which takes into account both probe-target hybridization and
target-target partial hybridization in solution. The hybridization free
energies are obtained from the nearest-neighbor model with experimentally
determined parameters. The molecular-based model suggests a rescaling that
should result in a "collapse" of the data at different concentrations into a
single universal curve. We indeed find such a collapse, with the same
parameters as obtained before for the older HGU95 chip set. The quality of the
collapse varies according to the probe set considered. Artificial sequences,
chosen by Affymetrix to be as different as possible from any other human genome
sequence, generally show a much better collapse and thus a better agreement
with the model than all other sequences. This suggests that the observed
deviations from the predicted collapse are related to the choice of probes or
have a biological origin, rather than being a problem with the proposed model.Comment: 11 pages, 10 figure
Effective affinities in microarray data
In the past couple of years several studies have shown that hybridization in
Affymetrix DNA microarrays can be rather well understood on the basis of simple
models of physical chemistry. In the majority of the cases a Langmuir isotherm
was used to fit experimental data. Although there is a general consensus about
this approach, some discrepancies between different studies are evident. For
instance, some authors have fitted the hybridization affinities from the
microarray fluorescent intensities, while others used affinities obtained from
melting experiments in solution. The former approach yields fitted affinities
that at first sight are only partially consistent with solution values. In this
paper we show that this discrepancy exists only superficially: a sufficiently
complete model provides effective affinities which are fully consistent with
those fitted to experimental data. This link provides new insight on the
relevant processes underlying the functioning of DNA microarrays.Comment: 8 pages, 6 figure
"Hook"-calibration of GeneChip-microarrays: Theory and algorithm
Abstract Background: The improvement of microarray calibration methods is an essential prerequisite for quantitative expression analysis. This issue requires the formulation of an appropriate model describing the basic relationship between the probe intensity and the specific transcript concentration in a complex environment of competing interactions, the estimation of the magnitude these effects and their correction using the intensity information of a given chip and, finally the development of practicable algorithms which judge the quality of a particular hybridization and estimate the expression degree from the intensity values. Results: We present the so-called hook-calibration method which co-processes the log-difference (delta) and -sum (sigma) of the perfect match (PM) and mismatch (MM) probe-intensities. The MM probes are utilized as an internal reference which is subjected to the same hybridization law as the PM, however with modified characteristics. After sequence-specific affinity correction the method fits the Langmuir-adsorption model to the smoothed delta-versus-sigma plot. The geometrical dimensions of this so-called hook-curve characterize the particular hybridization in terms of simple geometric parameters which provide information about the mean non-specific background intensity, the saturation value, the mean PM/MM-sensitivity gain and the fraction of absent probes. This graphical summary spans a metrics system for expression estimates in natural units such as the mean binding constants and the occupancy of the probe spots. The method is single-chip based, i.e. it separately uses the intensities for each selected chip. Conclusion: The hook-method corrects the raw intensities for the non-specific background hybridization in a sequence-specific manner, for the potential saturation of the probe-spots with bound transcripts and for the sequence-specific binding of specific transcripts. The obtained chip characteristics in combination with the sensitivity corrected probe-intensity values provide expression estimates scaled in natural units which are given by the binding constants of the particular hybridization.</p