Class SingleCellDataLoaderConfig

Direct Known Subclasses:
AnnDataSingleCellDataLoaderConfig, MexSingleCellDataLoaderConfig

public class SingleCellDataLoaderConfig extends SequencingDataLoaderConfig
Basic configuration for loading single-cell data.
Author:
poirigui
See Also:
  • Field Details

    • transformExecutor

      @Nullable public ExecutorService transformExecutor
      Executor service to use to transform single-cell data.
  • Constructor Details

  • Method Details

    • builder

    • isIgnoreSamplesLackingData

      public boolean isIgnoreSamplesLackingData()
      Ignore individual samples that lack or have incomplete data.

      This is only implemented for MEX.

    • getCellTypeAssignmentFile

      @Nullable public Path getCellTypeAssignmentFile()
      A location where cell type can be found in a format covered by GenericMetadataSingleCellDataLoader.

      If null, no cell type assignment will be imported. However, the loader might have other provisions for loading cell types. For example, AnnDataSingleCellDataLoader can import cell types using a factor name.

    • getCellTypeAssignmentName

      @Nullable public String getCellTypeAssignmentName()
      A name to use for the cell type assignment.

      If null, it is left to the specific loader to decide the name.

    • getCellTypeAssignmentDescription

      @Nullable public String getCellTypeAssignmentDescription()
      A description to use for the cell type assignment.
    • getCellTypeAssignmentProtocol

      @Nullable public Protocol getCellTypeAssignmentProtocol()
      A protocol to use for the cell type assignment.

      If non-null, this must be persistent.

    • isReplaceExistingCellTypeAssignment

      public boolean isReplaceExistingCellTypeAssignment()
      If there is already a cell type assignment with the same name, replace it with the new one.

      Note that CTAs with a null name cannot be replaced.

    • getOtherCellLevelCharacteristicsFile

      @Nullable public Path getOtherCellLevelCharacteristicsFile()
      A location where additional cell-level characteristics can be loaded.
    • getOtherCellLevelCharacteristicsNames

      @Nullable public List<String> getOtherCellLevelCharacteristicsNames()
      Name to use for the cell-level characteristics.

      Must match the number and order of CLCs in otherCellLevelCharacteristicsFile.

    • isReplaceExistingOtherCellLevelCharacteristics

      public boolean isReplaceExistingOtherCellLevelCharacteristics()
      If there are already other CLCs with the same names, replace them with the new ones.

      Note that other CLCs with a null name cannot be replaced.

    • isInferSamplesFromCellIdsOverlap

      public boolean isInferSamplesFromCellIdsOverlap()
      When parsing cellTypeAssignmentFile and otherCellLevelCharacteristicsFile, use the overlap between the cell IDs from the file and those from the SingleCellDimension to infer sample associations.

      When this option is set, the sample ID column must be supplied for this strategy to be applied.

    • isUseCellIdsIfSampleNameIsMissing

      public boolean isUseCellIdsIfSampleNameIsMissing()
      When parsing cellTypeAssignmentFile and otherCellLevelCharacteristicsFile, allow for a missing sample_id column, in which case barcodes are used to infer the sample a cell belongs to. This strategy will not work in the case of a barcode collision.
    • isIgnoreUnmatchedCellIds

      public boolean isIgnoreUnmatchedCellIds()
      When parsing cellTypeAssignmentFile and otherCellLevelCharacteristicsFile, ignore cell IDs that cannot be matched to a sample.
    • isMarkSingleCellTypeAssignmentAsPreferred

      public boolean isMarkSingleCellTypeAssignmentAsPreferred()
      If only one CTA is present, mark it as preferred.
    • getPreferredCellTypeAssignmentName

      @Nullable public String getPreferredCellTypeAssignmentName()
      Name of the CTA to mark as preferred, or null to ignore.

      This setting overrides markSingleCellTypeAssignmentAsPreferred.

    • isPreferSinglePrecision

      public boolean isPreferSinglePrecision()
      Prefer single-precision for storage, even if the data is available with double-precision.

      This reduces the size of vectors and thus the storage requirement.

    • isSkipTransformations

      public boolean isSkipTransformations()
      Skip single-cell data transformations, unless they are absolutely necessary.
    • getTransformExecutor

      @Nullable public ExecutorService getTransformExecutor()
      Executor service to use to transform single-cell data.