OpenPipelines.bio v2.1.0
BREAKING CHANGES
-
Deprecation of
metadata/duplicate_obsandmetadata/duplicate_varcomponents (PR #952). -
Deprecation of
workflows/annotation/scgpt_integration_knncomponent (PR #952). -
annotate/scanvi: Remove scarches functionality from this component, as it is already covered inintegrate/scarches(PR #986).
NEW FUNCTIONALITY
-
dataflow/concatenate_h5mu: addmodalityparameter (PR #977). -
filter_with_scrublet: addexpected_doublet_rate,stdev_doublet_rate,n_neighborsandsim_doublet_ratioarguments (PR #974). -
feature_annotation/aling_query_reference: Added a component to align a query and reference dataset (PR #948, #958, #972). -
workflows/qc/qcworkflow: Added ribosomal gene detection (PR #961). -
workflows/rna/rna_singlesample,workflows/multiomics/process_samplesworkflows: Added ribosomal gene detection (PR #968). -
scanvi: enable CUDA acceleration (PR #969). -
workflows/annotation/scvi_knnworkflow: Cell-type annotation based on scVI integration followed by KNN label transfer (PR #954). -
convert/from_h5ad_to_seurat: Add component to convert from h5ad to Seurat (PR #980). -
workflows/annotation/scanvi_scarchesworkflow: Cell-type annotation based on scANVI integration and annotation with scArches for reference mapping (PR #898). -
integrate/scarches: Implemented functionality to align the query dataset with the model registry and extend functionality to predict labels for scANVI models (PR #898). -
workflows/annotation/harmony_knnworkflow: Cell-type annotation based on harmony integration with KNN label transfer (PR #836). -
from_cellranger_multi_to_h5mu: add support forcustommodality (PR #982). -
integrate/scvi: Enable passing any .var field for gene name information instead of .var index, using the--var_gene_namesparameter (PR #986).
MAJOR CHANGES
-
Several components: when a component processes a single modality, only that modality is read into memory (PR #944)
-
The
transfer/publishcomponent is deprecated and will be removed in a future major release (PR #941).
MINOR CHANGES
-
Bump viash to
0.9.3(PR #995). -
Several workflows: refactor neighbors, leiden and UMAP in a separate subworkflow (PR #942 and PR #949).
-
grep_annotation_columnandsubset_obsp: Fix compatibility for SciPy (PR #945). -
popv: Pin numpy<2 after new release of scvi-tools (PR #946). -
Various components (
scgptandannotate): Add resource labels (PR #947, PR #950). -
feature_annotation/highly_variable_features_scanpy: Enable calculation of HVG on a subset of genes (PR #957, PR #959). -
integrate/scvi,integrate/totalviandintegrate/scarches: update base image to nvcr.io/nvidia/pytorch:24.12-py3, pin scvi-tools version to 1.1.5, unpin jax and jaxlib version (PR #970). -
annotate/celltypist: Enable passing any layer with log normalized counts, enforce checking whether counts are log normalized (PR #971). -
process_10xh5/filter_10xh5: update container base to ubuntu 24.04 (PR #983).
BUG FIXES
-
Fix
-stubruns (PR #1000). -
cluster/leiden: Fix an issue where insufficient shared memory (size of/dev/shm) causes the processing to hang. -
utils/subset_vars: Convert .var column used for subsetting of dtype "boolean" to dtype "bool" when it doesn't contain NaN values (PR #959). -
resources_test_scripts/annotation_test_data.sh: Add a layer to the annotation reference dataset with log normalized counts (PR #960). -
annotate/celltypist: Fix missing values in annotation column caused by index misalignment (PR #976). -
workflows/annotation/scgpt_annotationandworkflows/integrate/scgpt_leiden: Parameterization of HVG flavor with default methodcell_rangerinstead ofseurat_v3(PR #979). -
dataflow/merge: Resolved an issue where merging two MuData objects with overlappingvarorobscolumns sometimes resulted in an unsupported nullable dtype (PR #990), for instance when mergingpd.IntegerDtypeandpd.FloatDtype. These columns are now correctly cast to their native numpy dtypes before writing. -
workflows/annotation/harmony_knn: Only process RNA modality in the workflow (PR #988). -
Documentation CI: Fix building the documentation using CI (PR #1003).