scirpy.io.read_airr

scirpy.io.read_airr(path)

Read AIRR-compliant data.

Reads data organized in the AIRR rearrangement schema.

The following columns are required:
  • cell_id

  • productive

  • locus

  • consensus_count

  • at least one of junction_aa or junction.

Currently, reading data into Scirpy has the following limitations:
  • only alpha- and beta TCR chains are supported. Other chains are ignored.

  • non-productive chains are removed

  • Each chain can contain up to two alpha and two beta chains (Dual TCR). Excess chains are removed (those with lowest read count/UMI count) and cells flagged as Multichain-cell.

For more information, see T-cell receptor model.

Parameters
path : str, Sequence[str], Path, Sequence[Path]Union[str, Sequence[str], Path, Sequence[Path]]

Path to the AIRR rearrangement tsv file. If different chains are split up into multiple files, these can be specified as a List, e.g. ["path/to/tcr_alpha.tsv", "path/to/tcr_beta.tsv"].

Return type

AnnDataAnnData

Returns

AnnData object with TCR data in obs for each cell. For more details see Data structure.