Class SingleCellDataLoaderConfig
- Direct Known Subclasses:
AnnDataSingleCellDataLoaderConfig,MexSingleCellDataLoaderConfig
- Author:
- poirigui
- See Also:
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic classSingleCellDataLoaderConfig.SingleCellDataLoaderConfigBuilder<C extends SingleCellDataLoaderConfig,B extends SingleCellDataLoaderConfig.SingleCellDataLoaderConfigBuilder<C, B>> Nested classes/interfaces inherited from class ubic.gemma.core.loader.expression.sequencing.SequencingDataLoaderConfig
SequencingDataLoaderConfig.SequencingDataLoaderConfigBuilder<C extends SequencingDataLoaderConfig,B extends SequencingDataLoaderConfig.SequencingDataLoaderConfigBuilder<C, B>> Nested classes/interfaces inherited from class ubic.gemma.core.loader.expression.DataLoaderConfig
DataLoaderConfig.DataLoaderConfigBuilder<C extends DataLoaderConfig,B extends DataLoaderConfig.DataLoaderConfigBuilder<C, B>> -
Field Summary
FieldsModifier and TypeFieldDescriptionExecutor service to use to transform single-cell data. -
Constructor Summary
ConstructorsModifierConstructorDescriptionprotected -
Method Summary
Modifier and TypeMethodDescriptionbuilder()A description to use for the cell type assignment.A location where cell type can be found in a format covered byGenericMetadataSingleCellDataLoader.A name to use for the cell type assignment.A protocol to use for the cell type assignment.A location where additional cell-level characteristics can be loaded.Name to use for the cell-level characteristics.Name of the CTA to mark as preferred, or null to ignore.Executor service to use to transform single-cell data.booleanIgnore individual samples that lack or have incomplete data.booleanWhen parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, ignore cell IDs that cannot be matched to a sample.booleanWhen parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, use the overlap between the cell IDs from the file and those from theSingleCellDimensionto infer sample associations.booleanIf only one CTA is present, mark it as preferred.booleanPrefer single-precision for storage, even if the data is available with double-precision.booleanIf there is already a cell type assignment with the same name, replace it with the new one.booleanIf there are already other CLCs with the same names, replace them with the new ones.booleanSkip single-cell data transformations, unless they are absolutely necessary.booleanWhen parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, allow for a missingsample_idcolumn, in which case barcodes are used to infer the sample a cell belongs to.Methods inherited from class ubic.gemma.core.loader.expression.sequencing.SequencingDataLoaderConfig
getDefaultSequencingMetadata, getSequencingMetadataFileMethods inherited from class ubic.gemma.core.loader.expression.DataLoaderConfig
getDataPath, getQuantitationTypeName, getQuantitationTypeNewName, getQuantitationTypeNewScaleType, getQuantitationTypeNewType, getRenamingFile, isMarkQuantitationTypeAsPreferred, isMarkQuantitationTypeAsRecomputedFromRawData, isReplaceExistingQuantitationType
-
Field Details
-
transformExecutor
Executor service to use to transform single-cell data.
-
-
Constructor Details
-
SingleCellDataLoaderConfig
protected SingleCellDataLoaderConfig(SingleCellDataLoaderConfig.SingleCellDataLoaderConfigBuilder<?, ?> b)
-
-
Method Details
-
builder
-
isIgnoreSamplesLackingData
public boolean isIgnoreSamplesLackingData()Ignore individual samples that lack or have incomplete data.This is only implemented for MEX.
-
getCellTypeAssignmentFile
A location where cell type can be found in a format covered byGenericMetadataSingleCellDataLoader.If null, no cell type assignment will be imported. However, the loader might have other provisions for loading cell types. For example,
AnnDataSingleCellDataLoadercan import cell types using a factor name. -
getCellTypeAssignmentName
A name to use for the cell type assignment.If null, it is left to the specific loader to decide the name.
-
getCellTypeAssignmentDescription
A description to use for the cell type assignment. -
getCellTypeAssignmentProtocol
A protocol to use for the cell type assignment.If non-null, this must be persistent.
-
isReplaceExistingCellTypeAssignment
public boolean isReplaceExistingCellTypeAssignment()If there is already a cell type assignment with the same name, replace it with the new one.Note that CTAs with a
nullname cannot be replaced. -
getOtherCellLevelCharacteristicsFile
A location where additional cell-level characteristics can be loaded. -
getOtherCellLevelCharacteristicsNames
Name to use for the cell-level characteristics.Must match the number and order of CLCs in
otherCellLevelCharacteristicsFile. -
isReplaceExistingOtherCellLevelCharacteristics
public boolean isReplaceExistingOtherCellLevelCharacteristics()If there are already other CLCs with the same names, replace them with the new ones.Note that other CLCs with a
nullname cannot be replaced. -
isInferSamplesFromCellIdsOverlap
public boolean isInferSamplesFromCellIdsOverlap()When parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, use the overlap between the cell IDs from the file and those from theSingleCellDimensionto infer sample associations.When this option is set, the sample ID column must be supplied for this strategy to be applied.
-
isUseCellIdsIfSampleNameIsMissing
public boolean isUseCellIdsIfSampleNameIsMissing()When parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, allow for a missingsample_idcolumn, in which case barcodes are used to infer the sample a cell belongs to. This strategy will not work in the case of a barcode collision. -
isIgnoreUnmatchedCellIds
public boolean isIgnoreUnmatchedCellIds()When parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, ignore cell IDs that cannot be matched to a sample. -
isMarkSingleCellTypeAssignmentAsPreferred
public boolean isMarkSingleCellTypeAssignmentAsPreferred()If only one CTA is present, mark it as preferred. -
getPreferredCellTypeAssignmentName
Name of the CTA to mark as preferred, or null to ignore.This setting overrides
markSingleCellTypeAssignmentAsPreferred. -
isPreferSinglePrecision
public boolean isPreferSinglePrecision()Prefer single-precision for storage, even if the data is available with double-precision.This reduces the size of vectors and thus the storage requirement.
-
isSkipTransformations
public boolean isSkipTransformations()Skip single-cell data transformations, unless they are absolutely necessary. -
getTransformExecutor
Executor service to use to transform single-cell data.
-