Skip to content

Output files

rustar-aligner produces the same set of output files as STAR. All files are written into the directory implied by --outFileNamePrefix, with names that match STAR’s exactly.

For the examples below, assume --outFileNamePrefix sample_. Replace sample_ with whatever you’ve configured.

Default output. SAM-format alignment file with header and records. Written when --outSAMtype SAM (the default).

Unsorted BAM equivalent of Aligned.out.sam. Written when --outSAMtype BAM Unsorted.

Coordinate-sorted BAM. Written when --outSAMtype BAM SortedByCoordinate. The sort happens in-memory by default; cap the RAM with --limitBAMsortRAM if needed.

Transcriptome-coordinate BAM. Written when --quantMode TranscriptomeSAM is set. Each record’s reference is a transcript ID rather than a chromosome; one record is emitted per transcript that the read aligns within.

The summary statistics file — the most useful output for monitoring runs and feeding into MultiQC. Contains counts and rates for: input reads, uniquely mapped reads, reads mapped to multiple loci, reads mapped to too many loci, unmapped reads (with reasons), splice junctions, mismatch rates, deletion/insertion rates, and run timing.

Format is identical to STAR’s, so MultiQC parses it without modification.

The verbose run log. Contains parameter values used, genome loading info, per-thread progress, and warnings. Useful when something looks wrong — search for WARNING or ERROR.

Per-chunk progress lines emitted during alignment. Useful for monitoring long runs.

Tab-separated table of splice junctions discovered during alignment. STAR-compatible 9-column format:

#ColumnMeaning
1chrChromosome
2startIntron start (1-based)
3endIntron end (1-based)
4strand0 = undefined, 1 = +, 2 = -
5motif0 = non-canonical, 1 = GT/AG, 2 = CT/AC, 3 = GC/AG, 4 = CT/GC, 5 = AT/AC, 6 = GT/AT
6annotated0 = novel, 1 = present in GTF / sjdb
7unique_readsNumber of uniquely-mapping reads supporting the junction
8multi_readsNumber of multi-mapping reads supporting the junction
9max_overhangMax overhang of any supporting read

Filtered using the --outSJfilter* parameters (see CLI reference).

Written when --chimSegmentMin > 0. Format depends on --chimOutType.

14-column tab-separated table when --chimOutType includes Junctions (default). One row per chimeric junction. Format is STAR-compatible. See the chimeric guide for the full column breakdown.

When --chimOutType includes WithinBAM, the chimeric segments are embedded as supplementary alignment records (FLAG 0x800) in the main BAM output, with SA tags linking the donor and acceptor halves. Tools like Arriba and STAR-Fusion know how to read either format.

Per-gene read counts. Written when --quantMode GeneCounts is set. Four-column format:

gene_id unstranded forward_stranded reverse_stranded

The first four rows are summary categories: N_unmapped, N_multimapping, N_noFeature, N_ambiguous. Subsequent rows are per-gene counts. Pick the column matching your library’s strandedness — see the quantification guide.

sample_Unmapped.out.mate1 / sample_Unmapped.out.mate2

Section titled “sample_Unmapped.out.mate1 / sample_Unmapped.out.mate2”

Written when --outReadsUnmapped Fastx is set. FASTQ output of every read that didn’t map (and reads that mapped to too many loci). Mate 2 is only present for paired-end input. For PE half-mapped pairs (one mate mapped, one not), both mates are written so the pair can be re-aligned later.

In two-pass mode (--twopassMode Basic), pass 1 produces an internal _STARpass1/SJ.out.tab that’s used to seed pass 2. The final sample_SJ.out.tab reflects pass 2’s discoveries; pass 1’s intermediate file is not retained by default.

When --outStd is set to SAM, BAM_Unsorted, or BAM_SortedByCoordinate, the corresponding alignment output is sent to stdout instead of a file. Useful for piping directly into samtools, sambamba, or any other downstream tool:

Terminal window
rustar-aligner --outStd BAM_Unsorted ... | samtools sort -@ 4 -o sample.sorted.bam

The other output files (Log.final.out, SJ.out.tab, etc.) are still written to disk normally.