Class SingleCellDataLoaderConfig.SingleCellDataLoaderConfigBuilder<C extends SingleCellDataLoaderConfig,B extends SingleCellDataLoaderConfig.SingleCellDataLoaderConfigBuilder<C,B>>
- Direct Known Subclasses:
AnnDataSingleCellDataLoaderConfig.AnnDataSingleCellDataLoaderConfigBuilder,MexSingleCellDataLoaderConfig.MexSingleCellDataLoaderConfigBuilder
- Enclosing class:
SingleCellDataLoaderConfig
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionabstract Cbuild()cellTypeAssignmentDescription(String cellTypeAssignmentDescription) A description to use for the cell type assignment.cellTypeAssignmentFile(Path cellTypeAssignmentFile) A location where cell type can be found in a format covered byGenericMetadataSingleCellDataLoader.cellTypeAssignmentName(String cellTypeAssignmentName) A name to use for the cell type assignment.cellTypeAssignmentProtocol(Protocol cellTypeAssignmentProtocol) A protocol to use for the cell type assignment.Console to use for reporting progress.discardEmptyCells(Boolean discardEmptyCells) Discard (or keep) empty cells.ignoreCompatibleCellTypeFactor(boolean ignoreCompatibleCellTypeFactor) When re-creating, ignore a compatible cell type factor that may already exist.ignoreDataVectors(boolean ignoreDataVectors) Skip single-cell data vectors if possible, or at least skip transformation that would normally be applied to them to make them usable.ignoreExistingCellTypeAssignment(boolean ignoreExistingCellTypeAssignment) If true, ignore existing cell type assignment with the same name.ignoreExistingOtherCellLevelCharacteristics(boolean ignoreExistingOtherCellLevelCharacteristics) If true, ignore existing other CLCs with the same names.ignoreSamplesLackingData(boolean ignoreSamplesLackingData) Ignore individual samples that lack or have incomplete data.ignoreUnmatchedCellIds(boolean ignoreUnmatchedCellIds) When parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, ignore cell IDs that cannot be matched to a sample.inferSamplesFromCellIdsOverlap(boolean inferSamplesFromCellIdsOverlap) When parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, use the overlap between the cell IDs from the file and those from theSingleCellDimensionto infer sample associations.markSingleCellTypeAssignmentAsPreferred(boolean markSingleCellTypeAssignmentAsPreferred) If only one CTA is present, mark it as preferred.otherCellLevelCharacteristicsDefaultValues(List<String> otherCellLevelCharacteristicsDefaultValues) Default values to use for the cell-level characteristics.otherCellLevelCharacteristicsDefaultValueUris(List<String> otherCellLevelCharacteristicsDefaultValueUris) Default value URIs to use for the cell-level characteristics.otherCellLevelCharacteristicsFile(Path otherCellLevelCharacteristicsFile) A location where additional cell-level characteristics can be loaded.otherCellLevelCharacteristicsNames(List<String> otherCellLevelCharacteristicsNames) Name to use for the cell-level characteristics.preferredCellTypeAssignmentName(String preferredCellTypeAssignmentName) Name of the CTA to mark as preferred, or null to ignore.preferSinglePrecision(boolean preferSinglePrecision) Prefer single-precision for storage, even if the data is available with double-precision.recreateCellTypeFactorIfNecessary(boolean recreateCellTypeFactorIfNecessary) Re-create the cell type factor if necessary.replaceExistingCellTypeAssignment(boolean replaceExistingCellTypeAssignment) If there is already a cell type assignment with the same name, replace it with the new one.replaceExistingOtherCellLevelCharacteristics(boolean replaceExistingOtherCellLevelCharacteristics) If there are already other CLCs with the same names, replace them with the new ones.protected abstract Bself()toString()transformExecutor(ExecutorService transformExecutor) Executor service to use to transform single-cell data.useCellIdsIfSampleNameIsMissing(boolean useCellIdsIfSampleNameIsMissing) When parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, allow for a missingsample_idcolumn, in which case barcodes are used to infer the sample a cell belongs to.Methods inherited from class ubic.gemma.core.loader.expression.sequencing.SequencingDataLoaderConfig.SequencingDataLoaderConfigBuilder
defaultSequencingMetadata, sequencingMetadataFileMethods inherited from class ubic.gemma.core.loader.expression.DataLoaderConfig.DataLoaderConfigBuilder
dataPath, markQuantitationTypeAsPreferred, markQuantitationTypeAsRecomputedFromRawData, quantitationTypeName, quantitationTypeNewName, quantitationTypeNewScaleType, quantitationTypeNewType, renamingFile, replaceExistingQuantitationType
-
Constructor Details
-
SingleCellDataLoaderConfigBuilder
public SingleCellDataLoaderConfigBuilder()
-
-
Method Details
-
ignoreSamplesLackingData
Ignore individual samples that lack or have incomplete data.This is only implemented for MEX.
- Returns:
this.
-
cellTypeAssignmentFile
A location where cell type can be found in a format covered byGenericMetadataSingleCellDataLoader.If null, no cell type assignment will be imported. However, the loader might have other provisions for loading cell types. For example,
AnnDataSingleCellDataLoadercan import cell types using a factor name.- Returns:
this.
-
cellTypeAssignmentName
A name to use for the cell type assignment.If null, it is left to the specific loader to decide the name.
- Returns:
this.
-
cellTypeAssignmentDescription
A description to use for the cell type assignment.- Returns:
this.
-
cellTypeAssignmentProtocol
A protocol to use for the cell type assignment.If non-null, this must be persistent.
- Returns:
this.
-
replaceExistingCellTypeAssignment
If there is already a cell type assignment with the same name, replace it with the new one.Note that CTAs with a
nullname cannot be replaced.- Returns:
this.
-
ignoreExistingCellTypeAssignment
If true, ignore existing cell type assignment with the same name.- Returns:
this.
-
otherCellLevelCharacteristicsFile
A location where additional cell-level characteristics can be loaded.- Returns:
this.
-
otherCellLevelCharacteristicsNames
public B otherCellLevelCharacteristicsNames(@Nullable List<String> otherCellLevelCharacteristicsNames) Name to use for the cell-level characteristics.Must match the number and order of CLCs in
otherCellLevelCharacteristicsFile.- Returns:
this.
-
otherCellLevelCharacteristicsDefaultValues
public B otherCellLevelCharacteristicsDefaultValues(@Nullable List<String> otherCellLevelCharacteristicsDefaultValues) Default values to use for the cell-level characteristics.Must match the number and order of CLCs in
otherCellLevelCharacteristicsFile.- Returns:
this.
-
otherCellLevelCharacteristicsDefaultValueUris
public B otherCellLevelCharacteristicsDefaultValueUris(@Nullable List<String> otherCellLevelCharacteristicsDefaultValueUris) Default value URIs to use for the cell-level characteristics.Must match the number and order of CLCs in
otherCellLevelCharacteristicsFile.- Returns:
this.
-
replaceExistingOtherCellLevelCharacteristics
public B replaceExistingOtherCellLevelCharacteristics(boolean replaceExistingOtherCellLevelCharacteristics) If there are already other CLCs with the same names, replace them with the new ones.Note that other CLCs with a
nullname cannot be replaced.- Returns:
this.
-
ignoreExistingOtherCellLevelCharacteristics
public B ignoreExistingOtherCellLevelCharacteristics(boolean ignoreExistingOtherCellLevelCharacteristics) If true, ignore existing other CLCs with the same names.- Returns:
this.
-
inferSamplesFromCellIdsOverlap
When parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, use the overlap between the cell IDs from the file and those from theSingleCellDimensionto infer sample associations.When this option is set, the sample ID column must be supplied for this strategy to be applied.
- Returns:
this.
-
useCellIdsIfSampleNameIsMissing
When parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, allow for a missingsample_idcolumn, in which case barcodes are used to infer the sample a cell belongs to. This strategy will not work in the case of a barcode collision.- Returns:
this.
-
ignoreUnmatchedCellIds
When parsingcellTypeAssignmentFileandotherCellLevelCharacteristicsFile, ignore cell IDs that cannot be matched to a sample.- Returns:
this.
-
markSingleCellTypeAssignmentAsPreferred
If only one CTA is present, mark it as preferred.- Returns:
this.
-
preferredCellTypeAssignmentName
Name of the CTA to mark as preferred, or null to ignore.This setting overrides
markSingleCellTypeAssignmentAsPreferred.- Returns:
this.
-
recreateCellTypeFactorIfNecessary
Re-create the cell type factor if necessary.- Returns:
this.
-
ignoreCompatibleCellTypeFactor
When re-creating, ignore a compatible cell type factor that may already exist.Requires
recreateCellTypeFactorIfNecessary(boolean)to be set.- Returns:
this.
-
preferSinglePrecision
Prefer single-precision for storage, even if the data is available with double-precision.This reduces the size of vectors and thus the storage requirement.
- Returns:
this.
-
ignoreDataVectors
Skip single-cell data vectors if possible, or at least skip transformation that would normally be applied to them to make them usable.When this option is set,
SingleCellDataLoader.loadVectors(Collection, SingleCellDimension, QuantitationType)will throw anUnsupportedOperationException.- Returns:
this.
-
discardEmptyCells
Discard (or keep) empty cells.The default is to discard if empty cells are found. Setting this to false will disable the detection entirely.
- Returns:
this.- See Also:
-
transformExecutor
Executor service to use to transform single-cell data.- Returns:
this.
-
console
Console to use for reporting progress.- Returns:
this.
-
self
- Specified by:
selfin classSequencingDataLoaderConfig.SequencingDataLoaderConfigBuilder<C extends SingleCellDataLoaderConfig,B extends SingleCellDataLoaderConfig.SingleCellDataLoaderConfigBuilder<C, B>>
-
build
- Specified by:
buildin classSequencingDataLoaderConfig.SequencingDataLoaderConfigBuilder<C extends SingleCellDataLoaderConfig,B extends SingleCellDataLoaderConfig.SingleCellDataLoaderConfigBuilder<C, B>>
-
toString
- Overrides:
toStringin classSequencingDataLoaderConfig.SequencingDataLoaderConfigBuilder<C extends SingleCellDataLoaderConfig,B extends SingleCellDataLoaderConfig.SingleCellDataLoaderConfigBuilder<C, B>>
-