Single-end alignment
Single-end (SE) alignment maps each read in a single FASTQ file independently — there’s no mate pair to constrain the alignment.
Basic command
Section titled “Basic command”rustar-aligner \ --genomeDir /path/to/genome_index \ --readFilesIn reads.fq \ --outSAMtype SAM \ --outFileNamePrefix sample_Strand information
Section titled “Strand information”For stranded RNA-seq libraries, add the strand field to the output so that downstream tools (Cufflinks, StringTie, etc.) can pick up the strand assignment from junction motifs:
--outSAMstrandField intronMotifGzipped input
Section titled “Gzipped input”Use --readFilesCommand zcat to pipe through decompression:
rustar-aligner \ --genomeDir /path/to/genome_index \ --readFilesIn reads.fq.gz \ --readFilesCommand zcat \ --outSAMtype BAM SortedByCoordinate \ --outFileNamePrefix sample_On macOS, use gzcat instead of zcat.
Threading
Section titled “Threading”Use --runThreadN to parallelise the alignment:
--runThreadN 16The alignment phase scales nearly linearly with thread count up to the IO limit of the storage holding the input/output files.
Filtering
Section titled “Filtering”The defaults match STAR. Two parameters are particularly useful to know:
--outFilterMultimapNmax(default10) — max number of alignments per read. A read mapping to more loci is discarded asMultiMapTooMany.--outFilterMismatchNoverLmax(default0.3) — max ratio of mismatches to mapped length. Tightening this to0.04is a common choice for human RNA-seq.--outFilterScoreMinOverLread(default0.66) — min alignment score relative to read length. Reads scoring below this threshold are unmapped.
See the CLI parameters reference for the full list.
Output limits
Section titled “Output limits”Multi-mapping reads emit one SAM record per locus by default (up to --outFilterMultimapNmax). To cap the number of secondary alignments written:
--outSAMmultNmax 5Reproducibility
Section titled “Reproducibility”When two alignments tie on score, rustar-aligner uses a seeded RNG (StdRng) to pick the primary. Set the seed for reproducible runs:
--runRNGseed 42The default seed is 777.
What’s next
Section titled “What’s next”- Paired-end alignment for paired libraries
- Two-pass mode for novel-junction recovery
- Output files reference for what each output file contains