snapatac2.pp.add_frip#
- snapatac2.pp.add_frip(adata, regions, *, inplace=True, n_jobs=8)[source]#
Add fraction of reads in peaks (FRiP) to the AnnData object.
import_datamust be ran first in order to use this function.- Parameters:
adata (
AnnData|list[AnnData]) – The (annotated) data matrix of shapen_obsxn_vars. Rows correspond to cells and columns to regions.adatacould also be a list of AnnData objects. In this case, the function will be applied to each AnnData object in parallel.regions (
dict[str,Path|list[str]]) – A dictionary containing the peak sets to compute FRiP. The keys are peak set names and the values are either a bed file name or a list of strings representing genomic regions. For example,{"promoter_frac": "promoter.bed", "enhancer_frac": ["chr1:100-200", "chr2:300-400"]}.inplace (
bool) – Whether to add the results toadata.obsor return it as a dictionary.n_jobs (
int) – Number of jobs to run in parallel whenadatais a list. Ifn_jobs=-1, all CPUs will be used.
- Returns:
If
inplace = True, directly adds the results toadata.obs. Otherwise return a dictionary containing the results.- Return type:
dict[str, list[float]] | list[dict[str, list[float]]] | None
Examples
>>> import snapatac2 as snap >>> data = snap.read(snap.datasets.pbmc5k(type='h5ad'), backed=None) >>> snap.pp.add_frip(data, {"peaks_frac": snap.datasets.cre_HEA()}) >>> print(data.obs['peaks_frac'].head()) index AAACGAAAGACGTCAG-1 0.708841 AAACGAAAGATTGACA-1 0.731711 AAACGAAAGGGTCCCT-1 0.692434 AAACGAACAATTGTGC-1 0.694849 AAACGAACACTCGTGG-1 0.687787 Name: peaks_frac, dtype: float64