API reference
Top-level
Configuration dataclass for the CAPELLINI pipeline. |
|
Top-level pipeline orchestrator. |
Stages
Pre-flight stage: folder initialization and optional fresh-start cleanup. |
|
DADA2 stage: run DADA2_Pipe.R and move the generated FASTA. |
|
NCBI mapping stage: download taxonomy names and assign real NCBI taxids. |
|
MMSeqs2 stage: 16S reference, easy-search, and 3-layer NCBI/GCA assignment. |
|
SpacePHARER stage: spacer extraction, DB creation, prediction, and statistics. |
|
ProCs stage: bacterial/viral protein extraction, clustering, and PA matrix. |
|
Network stage: build common-abundance, shrinkage, CRISPR, smoothed, and X* outputs. |
Utilities
I/O helpers: file reading, writing, subprocess execution. |
|
Taxonomy helpers: NCBI name lookup, index sanitization, bacteria taxonomy cleaning. |
|
Numerical transformations: CLR, closure, message-passing, shrinkage. |
|
Network-level utilities: message passing, CRISPR smoothing, taxonomy kernels, abundance helpers. |