Class SingleCellDataLoaderConfig
- Direct Known Subclasses:
AnnDataSingleCellDataLoaderConfig,MexSingleCellDataLoaderConfig
- Author:
- poirigui
- See Also:
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic classSingleCellDataLoaderConfig.SingleCellDataLoaderConfigBuilder<C extends SingleCellDataLoaderConfig,B extends SingleCellDataLoaderConfig.SingleCellDataLoaderConfigBuilder<C, B>> Nested classes/interfaces inherited from class ubic.gemma.core.loader.expression.sequencing.SequencingDataLoaderConfig
SequencingDataLoaderConfig.SequencingDataLoaderConfigBuilder<C extends SequencingDataLoaderConfig,B extends SequencingDataLoaderConfig.SequencingDataLoaderConfigBuilder<C, B>> Nested classes/interfaces inherited from class ubic.gemma.core.loader.expression.DataLoaderConfig
DataLoaderConfig.DataLoaderConfigBuilder<C extends DataLoaderConfig,B extends DataLoaderConfig.DataLoaderConfigBuilder<C, B>> -
Field Summary
FieldsModifier and TypeFieldDescriptionExecutor service to use to transform single-cell data. -
Constructor Summary
ConstructorsModifierConstructorDescriptionprotected -
Method Summary
Modifier and TypeMethodDescriptionbuilder()A description to use for the cell type assignment.A location where cell type can be found in a format covered byGenericMetadataSingleCellDataLoader.A name to use for the cell type assignment.A protocol to use for the cell type assignment.Console to use for reporting progress.Discard (or keep) empty cells.Default values to use for the cell-level characteristics.Default value URIs to use for the cell-level characteristics.A location where additional cell-level characteristics can be loaded.Name to use for the cell-level characteristics.Name of the CTA to mark as preferred, or null to ignore.Executor service to use to transform single-cell data.booleanWhen re-creating, ignore a compatible cell type factor that may already exist.booleanSkip single-cell data vectors if possible, or at least skip transformation that would normally be applied to them to make them usable.booleanIf true, ignore existing cell type assignment with the same name.booleanIf true, ignore existing other CLCs with the same names.booleanIgnore individual samples that lack or have incomplete data.booleanWhen parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, ignore cell IDs that cannot be matched to a sample.booleanWhen parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, use the overlap between the cell IDs from the file and those from theSingleCellDimensionto infer sample associations.booleanIf only one CTA is present, mark it as preferred.booleanPrefer single-precision for storage, even if the data is available with double-precision.booleanRe-create the cell type factor if necessary.booleanIf there is already a cell type assignment with the same name, replace it with the new one.booleanIf there are already other CLCs with the same names, replace them with the new ones.booleanWhen parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, allow for a missingsample_idcolumn, in which case barcodes are used to infer the sample a cell belongs to.Methods inherited from class ubic.gemma.core.loader.expression.sequencing.SequencingDataLoaderConfig
getDefaultSequencingMetadata, getSequencingMetadataFileMethods inherited from class ubic.gemma.core.loader.expression.DataLoaderConfig
getDataPath, getQuantitationTypeName, getQuantitationTypeNewName, getQuantitationTypeNewScaleType, getQuantitationTypeNewType, getRenamingFile, isMarkQuantitationTypeAsPreferred, isMarkQuantitationTypeAsRecomputedFromRawData, isReplaceExistingQuantitationType
-
Field Details
-
transformExecutor
Executor service to use to transform single-cell data.
-
-
Constructor Details
-
SingleCellDataLoaderConfig
protected SingleCellDataLoaderConfig(SingleCellDataLoaderConfig.SingleCellDataLoaderConfigBuilder<?, ?> b)
-
-
Method Details
-
builder
-
isIgnoreSamplesLackingData
public boolean isIgnoreSamplesLackingData()Ignore individual samples that lack or have incomplete data.This is only implemented for MEX.
-
getCellTypeAssignmentFile
A location where cell type can be found in a format covered byGenericMetadataSingleCellDataLoader.If null, no cell type assignment will be imported. However, the loader might have other provisions for loading cell types. For example,
AnnDataSingleCellDataLoadercan import cell types using a factor name. -
getCellTypeAssignmentName
A name to use for the cell type assignment.If null, it is left to the specific loader to decide the name.
-
getCellTypeAssignmentDescription
A description to use for the cell type assignment. -
getCellTypeAssignmentProtocol
A protocol to use for the cell type assignment.If non-null, this must be persistent.
-
isReplaceExistingCellTypeAssignment
public boolean isReplaceExistingCellTypeAssignment()If there is already a cell type assignment with the same name, replace it with the new one.Note that CTAs with a
nullname cannot be replaced. -
isIgnoreExistingCellTypeAssignment
public boolean isIgnoreExistingCellTypeAssignment()If true, ignore existing cell type assignment with the same name. -
getOtherCellLevelCharacteristicsFile
A location where additional cell-level characteristics can be loaded. -
getOtherCellLevelCharacteristicsNames
Name to use for the cell-level characteristics.Must match the number and order of CLCs in
otherCellLevelCharacteristicsFile. -
getOtherCellLevelCharacteristicsDefaultValues
Default values to use for the cell-level characteristics.Must match the number and order of CLCs in
otherCellLevelCharacteristicsFile. -
getOtherCellLevelCharacteristicsDefaultValueUris
Default value URIs to use for the cell-level characteristics.Must match the number and order of CLCs in
otherCellLevelCharacteristicsFile. -
isReplaceExistingOtherCellLevelCharacteristics
public boolean isReplaceExistingOtherCellLevelCharacteristics()If there are already other CLCs with the same names, replace them with the new ones.Note that other CLCs with a
nullname cannot be replaced. -
isIgnoreExistingOtherCellLevelCharacteristics
public boolean isIgnoreExistingOtherCellLevelCharacteristics()If true, ignore existing other CLCs with the same names. -
isInferSamplesFromCellIdsOverlap
public boolean isInferSamplesFromCellIdsOverlap()When parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, use the overlap between the cell IDs from the file and those from theSingleCellDimensionto infer sample associations.When this option is set, the sample ID column must be supplied for this strategy to be applied.
-
isUseCellIdsIfSampleNameIsMissing
public boolean isUseCellIdsIfSampleNameIsMissing()When parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, allow for a missingsample_idcolumn, in which case barcodes are used to infer the sample a cell belongs to. This strategy will not work in the case of a barcode collision. -
isIgnoreUnmatchedCellIds
public boolean isIgnoreUnmatchedCellIds()When parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, ignore cell IDs that cannot be matched to a sample. -
isMarkSingleCellTypeAssignmentAsPreferred
public boolean isMarkSingleCellTypeAssignmentAsPreferred()If only one CTA is present, mark it as preferred. -
getPreferredCellTypeAssignmentName
Name of the CTA to mark as preferred, or null to ignore.This setting overrides
markSingleCellTypeAssignmentAsPreferred. -
isRecreateCellTypeFactorIfNecessary
public boolean isRecreateCellTypeFactorIfNecessary()Re-create the cell type factor if necessary. -
isIgnoreCompatibleCellTypeFactor
public boolean isIgnoreCompatibleCellTypeFactor()When re-creating, ignore a compatible cell type factor that may already exist.Requires
recreateCellTypeFactorIfNecessaryto be set. -
isPreferSinglePrecision
public boolean isPreferSinglePrecision()Prefer single-precision for storage, even if the data is available with double-precision.This reduces the size of vectors and thus the storage requirement.
-
isIgnoreDataVectors
public boolean isIgnoreDataVectors()Skip single-cell data vectors if possible, or at least skip transformation that would normally be applied to them to make them usable.When this option is set,
SingleCellDataLoader.loadVectors(Collection, SingleCellDimension, QuantitationType)will throw anUnsupportedOperationException. -
getDiscardEmptyCells
Discard (or keep) empty cells.The default is to discard if empty cells are found. Setting this to false will disable the detection entirely.
- See Also:
-
getTransformExecutor
Executor service to use to transform single-cell data. -
getConsole
Console to use for reporting progress.
-