Class TwoChannelMissingValuesImpl
java.lang.Object
ubic.gemma.core.analysis.preprocess.TwoChannelMissingValuesImpl
- All Implemented Interfaces:
TwoChannelMissingValues
@Component
public class TwoChannelMissingValuesImpl
extends Object
implements TwoChannelMissingValues
Computes a missing value matrix for ratiometric data sets.
Supported formats and special cases:
- Genepix: CH1B_MEDIAN etc; (various versions)
- Incyte GEMTools: RAW_DATA etc (no background values)
- Quantarray: CH1_BKD etc
- F635.Median / F532.Median (genepix as rendered in some data sets)
- CH1_SMTM (found in GPL230)
- Caltech (GPL260)
- Agilent (Ch2BkgMedian etc or CH2_SIG_MEAN etc)
- GSE3251 (ch1.Background etc)
- GPL560 (*_CY3 vs *CY5)
- GSE1501 (NormCH2)
The missing values are computed with the following considerations with respect to available data
- If the preferred quantitation type data is a missing value, then the data are considered missing (for consistency).
- We then do additional checks if there is 'signal' data available.
- If there are background values, they are used to compute signal-to-noise ratios
- If the signal values already contain missing data, these are still considered missing.
- If there are no background values, we try to compute a threshold based on a quantile of the signal
- Otherwise, values will be considered 'present' unless the signal values are zero or missing.
- Author:
- pavlidis
-
Field Summary
Fields inherited from interface ubic.gemma.core.analysis.preprocess.TwoChannelMissingValues
DEFAULT_SIGNAL_TO_NOISE_THRESHOLD -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptioncomputeMissingValues(ExpressionExperiment ee, double signalToNoiseThreshold, Collection<Double> extraMissingValueIndicators)
-
Constructor Details
-
TwoChannelMissingValuesImpl
public TwoChannelMissingValuesImpl()
-
-
Method Details
-
computeMissingValues
@Transactional public Collection<RawExpressionDataVector> computeMissingValues(ExpressionExperiment ee) - Specified by:
computeMissingValuesin interfaceTwoChannelMissingValues- Parameters:
ee- The expression experiment to analyze. The quantitation types to use are selected automatically. If you want more control use other computeMissingValues methods.- Returns:
- collection of raw expression data vectors
-
computeMissingValues
@Transactional public Collection<RawExpressionDataVector> computeMissingValues(ExpressionExperiment ee, double signalToNoiseThreshold, @Nullable Collection<Double> extraMissingValueIndicators) - Specified by:
computeMissingValuesin interfaceTwoChannelMissingValues- Parameters:
ee- The expression experiment to analyze. The quantitation types to use are selected automatically.signalToNoiseThreshold- A value such as 1.5 or 2.0; only spots for which at least ONE of the channel signal is more than signalToNoiseThreshold*background (and the preferred data are not missing) will be considered present.extraMissingValueIndicators- Values that should be considered missing. For example, some data sets use '0'. This can be null or empty and it will be ignored.- Returns:
- DesignElementDataVectors corresponding to a new PRESENTCALL quantitation type for the experiment.
-