Class ProbeMapperConfig
java.lang.Object
ubic.gemma.core.analysis.sequence.ProbeMapperConfig
Holds parameters for how mapping should be done.
- Author:
- paul
-
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final booleanstatic final booleanstatic final doubleSequence identity below which we throw hits away (expressed as a fraction)static final doubleFraction of bases which must overlap with an annotated exon.static final doubleBLAT score threshold below which we do not consider hits.static final booleanstatic final intstatic final intSequences which hybridize to this many or more sites in the genome are candidates to be considered non-specific.static final intSequences which hybridize to this many or more sites in the genome are candidates to be considered non-specific.static final doubleSequences which have more than this fraction accounted for by repeats (via repeatmasker) will not be examined if they produce multiple alignments to the genome, regardless of the alignment quality. -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptiondoubledoubledoubledoubledoubledoubleprotected intprotected voidbooleanbooleanbooleanbooleanbooleanbooleanbooleanbooleanvoidsetAllowPredictedGenes(boolean allowPredictedGenes) voidSet to use no tracks.voidSet to use all tracks, including ESTsvoidsetBlatScoreThreshold(double blatScoreThreshold) voidsetIdentityThreshold(double identityThreshold) voidsetMaximumRepeatFraction(double maximumRepeatFraction) voidsetMinimumExonOverlapFraction(double minimumExonOverlapFraction) voidsetNonRepeatNonSpecificSiteCountThreshold(double nonRepeatNonSpecificSiteCountThreshold) voidsetNonSpecificSiteCountThreshold(double nonSpecificSiteCountThreshold) voidsetTrimNonCanonicalChromosomeHits(boolean trimNonCanonicalChromosomeHits) voidsetUseEnsembl(boolean useEnsembl) voidsetUseEsts(boolean useEsts) voidsetUseKnownGene(boolean useKnownGene) voidsetUseMiRNA(boolean useMiRNA) voidsetUseMrnas(boolean useMrnas) voidsetUseRefGene(boolean useRefGene) toString()
-
Field Details
-
DEFAULT_TRIM_NONCANONICAL_CHROMOSOMES
public static final boolean DEFAULT_TRIM_NONCANONICAL_CHROMOSOMES- See Also:
-
DEFAULT_ALLOW_PARS
public static final boolean DEFAULT_ALLOW_PARS- See Also:
-
DEFAULT_ALLOW_PREDICTED
public static final boolean DEFAULT_ALLOW_PREDICTED- See Also:
-
MAX_WARNINGS
public static final int MAX_WARNINGS- See Also:
-
DEFAULT_IDENTITY_THRESHOLD
public static final double DEFAULT_IDENTITY_THRESHOLDSequence identity below which we throw hits away (expressed as a fraction)- See Also:
-
DEFAULT_MINIMUM_EXON_OVERLAP_FRACTION
public static final double DEFAULT_MINIMUM_EXON_OVERLAP_FRACTIONFraction of bases which must overlap with an annotated exon. This should probably be higher than zero, to avoid "pure intron" hits, but setting it too high can cause loss of sensitivity.- See Also:
-
DEFAULT_SCORE_THRESHOLD
public static final double DEFAULT_SCORE_THRESHOLDBLAT score threshold below which we do not consider hits. This reflects the fraction of aligned bases.- See Also:
-
NON_REPEAT_NON_SPECIFIC_SITE_THRESHOLD
public static final int NON_REPEAT_NON_SPECIFIC_SITE_THRESHOLDSequences which hybridize to this many or more sites in the genome are candidates to be considered non-specific. This is used even if the sequence does not contain a repeat.- See Also:
-
NON_SPECIFIC_SITE_THRESHOLD
public static final int NON_SPECIFIC_SITE_THRESHOLDSequences which hybridize to this many or more sites in the genome are candidates to be considered non-specific. This is used in combination with the REPEAT_FRACTION_MAXIMUM. Note that many sequences which contain repeats nonetheless only align to very few sites in the genome. Similarly, there are sequences that map to multiple sites which are _not_ repeats. This value is also not designed to care about whether the alignments are in known genes or not. Thus setting this too low could result in over-stringent filtering.- See Also:
-
REPEAT_FRACTION_MAXIMUM
public static final double REPEAT_FRACTION_MAXIMUMSequences which have more than this fraction accounted for by repeats (via repeatmasker) will not be examined if they produce multiple alignments to the genome, regardless of the alignment quality.- See Also:
-
-
Constructor Details
-
ProbeMapperConfig
public ProbeMapperConfig()
-
-
Method Details
-
getBlatScoreThreshold
public double getBlatScoreThreshold()- Returns:
- the blatScoreThreshold
-
setBlatScoreThreshold
public void setBlatScoreThreshold(double blatScoreThreshold) - Parameters:
blatScoreThreshold- the blatScoreThreshold to set
-
getIdentityThreshold
public double getIdentityThreshold()- Returns:
- the identityThreshold
-
setIdentityThreshold
public void setIdentityThreshold(double identityThreshold) - Parameters:
identityThreshold- the identityThreshold to set
-
getMaximumRepeatFraction
public double getMaximumRepeatFraction()- Returns:
- the maximumRepeatFraction
-
setMaximumRepeatFraction
public void setMaximumRepeatFraction(double maximumRepeatFraction) - Parameters:
maximumRepeatFraction- the maximumRepeatFraction to set
-
getMinimumExonOverlapFraction
public double getMinimumExonOverlapFraction() -
setMinimumExonOverlapFraction
public void setMinimumExonOverlapFraction(double minimumExonOverlapFraction) -
getNonRepeatNonSpecificSiteCountThreshold
public double getNonRepeatNonSpecificSiteCountThreshold() -
setNonRepeatNonSpecificSiteCountThreshold
public void setNonRepeatNonSpecificSiteCountThreshold(double nonRepeatNonSpecificSiteCountThreshold) -
getNonSpecificSiteCountThreshold
public double getNonSpecificSiteCountThreshold()- Returns:
- the nonSpecificSiteCountThreshold
-
setNonSpecificSiteCountThreshold
public void setNonSpecificSiteCountThreshold(double nonSpecificSiteCountThreshold) - Parameters:
nonSpecificSiteCountThreshold- the nonSpecificSiteCountThreshold to set
-
isAllowPredictedGenes
public boolean isAllowPredictedGenes() -
setAllowPredictedGenes
public void setAllowPredictedGenes(boolean allowPredictedGenes) -
isTrimNonCanonicalChromosomehits
public boolean isTrimNonCanonicalChromosomehits() -
isUseEnsembl
public boolean isUseEnsembl()- Returns:
- the useEnsembl
-
setUseEnsembl
public void setUseEnsembl(boolean useEnsembl) - Parameters:
useEnsembl- the useEnsembl to set
-
isUseEsts
public boolean isUseEsts()- Returns:
- the useEsts
-
setUseEsts
public void setUseEsts(boolean useEsts) - Parameters:
useEsts- the useEsts to set
-
isUseKnownGene
public boolean isUseKnownGene()- Returns:
- the useKnownGene
-
setUseKnownGene
public void setUseKnownGene(boolean useKnownGene) - Parameters:
useKnownGene- the useKnownGene to set
-
isUseMiRNA
public boolean isUseMiRNA()- Returns:
- the useMiRNA
-
setUseMiRNA
public void setUseMiRNA(boolean useMiRNA) - Parameters:
useMiRNA- the useMiRNA to set
-
isUseMrnas
public boolean isUseMrnas()- Returns:
- the useMrnas
-
setUseMrnas
public void setUseMrnas(boolean useMrnas) - Parameters:
useMrnas- the useMrnas to set
-
isUseRefGene
public boolean isUseRefGene()- Returns:
- the useRefGene
-
setUseRefGene
public void setUseRefGene(boolean useRefGene) - Parameters:
useRefGene- the useRefGene to set
-
setAllTracksOff
public void setAllTracksOff()Set to use no tracks. Obviously then nothing will be found, so it is wise to then switch some tracks on. -
setAllTracksOn
public void setAllTracksOn()Set to use all tracks, including ESTs -
setTrimNonCanonicalChromosomeHits
public void setTrimNonCanonicalChromosomeHits(boolean trimNonCanonicalChromosomeHits) -
toString
-
getWarnings
protected int getWarnings() -
incrementWarnings
protected void incrementWarnings()
-