API reference
Top-level
Configuration dataclass for the CAPELLINI pipeline. |
|
Top-level pipeline orchestrator. |
Stages
Pre-flight stage: folder initialization and optional fresh-start cleanup. |
|
DADA2 stage: run DADA2_Pipe.R and move the generated FASTA. |
|
NCBI mapping stage: download taxonomy names and assign real NCBI taxids. |
|
MMSeqs2 stage: 16S reference, easy-search, and 3-layer NCBI/GCA assignment. |
|
SpacePHARER stage: spacer extraction, DB creation, prediction, and statistics. |
|
ProCs stage: bacterial/viral protein extraction, clustering, and PA matrix. |
|
Network stage: common abundance, shrinkage, raw/smoothed CRISPR, residual X*. |
Utilities
I/O helpers: file reading, writing, downloading, subprocess execution. |
|
Taxonomy helpers: NCBI name lookup, index sanitization, bacteria taxonomy cleaning. |
|
Numerical transformations: CLR, row normalization, shrinkage correlation. |
|
Network-level utilities: residual message passing, CRISPR smoothing, taxonomy kernels, abundance helpers. |