scirpy.tl.repertoire_overlap
- scirpy.tl.repertoire_overlap(adata, groupby, *, target_col='clone_id', overlap_measure='jaccard', overlap_threshold=None, fraction=False, inplace=True, added_key='repertoire_overlap', airr_mod='airr')
Compute distance between cell groups based on clonotype overlap.
Adds parwise overlaps, distance matrix and linkage to
uns.Warning
This function is experimental and is likely to change in the future.
- Parameters
- adata :
AnnData|MuData|DataHandlerUnion[AnnData,MuData,DataHandler] AnnData or MuData object that contains AIRR information.
- groupby :
str Column with group labels (e.g. samples, tussue source, diagnosis, etc).
- target_col :
str(default:'clone_id') Category that overlaps among groups (
clone_idby default, but can in principle be any group or cluster)- overlap_measure :
str(default:'jaccard') Any distance measure accepted by
scipy.spatial.distance; by default it isjaccard.- overlap_threshold :
float|NoneOptional[float] (default:None) The minimum required weight to accept presence.
- fraction :
str|boolUnion[str,bool] (default:False) If
True, compute fractions of abundances relative to thegroupbycolumn rather than reporting abosolute numbers. Alternatively, a column name can be provided according to that the values will be normalized or an iterable providing cell weights directly. Setting it toFalse, assigns equal weight to all cells.- inplace :
bool(default:True) Whether results should be added to
unsor returned directly.- added_key :
str(default:'repertoire_overlap') Results will be added to
unsunder this key.- airr_mod :
str(default:'airr') Name of the modality with AIRR information is stored in the
MuDataobject. if anAnnDataobject is passed to the function, this parameter is ignored.
- adata :
- Return type
None|Tuple[DataFrame,ndarray,ndarray]Optional[Tuple[DataFrame,ndarray,ndarray]]- Returns
A DataFrame used by the pairwise scatterplot, distance matrix and linkage.