Configuration Options
Last Updated: January 27, 2025
Reference guide to all workflow parameters and configuration options available in the AI Workspace.
Workflow Configuration
Basic Settings
Workflow Name
- Purpose: Identify your analysis for later reference
- Default: Auto-generated timestamp-based name
- Character limit: 50 characters maximum
Workflow Type
- Standard: Single-dataset analysis without reference integration
- Comparison: Integration with CELLxGENE Census reference data
- Requirements: Comparison requires scVI or TranscriptFormer models
Model Configuration
Model Selection
Choose the computational method for dimensionality reduction:
- PCA (Principal Component Analysis)
- scVI (Single-cell Variational Inference)
- TranscriptFormer (Transcript-level embedding model)
TranscriptFormer Variants
Sapiens
- Organism: Human only
- Training: Human-specific pre-training
Exemplar
- Organism: Human and mouse
- Training: Multi-organism pre-training
Metazoa
- Organism: Broad taxonomic support
- Training: Large-scale multi-species
Classification Settings
Cell Type Prediction
Automatic cell type annotation using trained classifiers:
Availability
- Models: scVI and TranscriptFormer only
- Organisms: Human and mouse only
Configuration
- Enable/Disable: Toggle cell type prediction
- Default: Disabled (opt-in feature)
- Processing impact: Adds 5-10 minutes to workflow time
Comparison Workflow Settings
Reference Data Selection
Similarity Search
Controls how reference cells are selected from CELLxGENE Census:
Enabled
- Method: Nearest neighbor search
- Selection: Up to 1M most similar reference cells
- Processing time: Longer due to similarity computation
Disabled
- Method: Random sampling
- Selection: Up to 250k random reference cells
- Processing time: Faster processing
Tissue Filters
Available Filters: All tissue in CELLxGENE Census.
Multiple Selection
- Select multiple tissue types simultaneously
- Logical OR relationship (cells from any selected tissue)
Job Limits
- Concurrent jobs: Maximum 3 jobs per user
- Result retention: 24 hours after completion
- File size: Maximum 5GB input files
Parameter Validation
Compatibility Checks
System automatically validates:
- Model-organism compatibility: Ensures supported combinations
- Workflow-model compatibility: Verifies comparison workflow requirements
- Classification requirements: Checks model and organism support