You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am using STAR version 2.7.11a to map RNA-seq reads onto a reference genome, and I am encountering an issue with the --quantMode TranscriptomeSAM option. Specifically, some reads that span two CDS regions (interrupted by an intron) are missing from the transcriptome BAM file, even though they are present in the genome BAM file.
Previously, I mapped these RNA-seq reads with minimap2 directly onto the CDS sequences, and those reads aligned perfectly. However, with STAR, these reads are missing from the transcriptome BAM file, which I discovered when some expected SNPs were absent in downstream analyses.
Here is an example of one such read, as found in the genome BAM file: (i used saamtools view to have reads info)
I made a simple sketch to better visualize the read mapping:
Here is the command I used :
STAR --genomeDir $GenomeIndexDirectory --readFilesIn $FastqInputFolder/$forward $FastqInputFolder/$reverse --readFilesCommand gunzip -c --runThreadN 16 --outSAMtype BAM SortedByCoordinate --quantMode TranscriptomeSAM --outMultimapperOrder Random --outFileNamePrefix $WorkingDirectory/AlignedReads/$ind
I expected this read to appear in the transcriptome BAM file, as it maps to two CDS regions separated by an intron.
I saw issue #315 and checked my GFF file, but there don’t seem to be any apparent problems with it.
What criteria does STAR use to include reads in the transcriptome BAM file?
Could the issue be related to splicing annotations (e.g., incomplete or incorrect intron definitions)?
Are there specific parameters I can adjust to include such reads in the transcriptome BAM file?
Thank you for your help!
The text was updated successfully, but these errors were encountered:
Hi everyone,
I am using STAR version 2.7.11a to map RNA-seq reads onto a reference genome, and I am encountering an issue with the --quantMode TranscriptomeSAM option. Specifically, some reads that span two CDS regions (interrupted by an intron) are missing from the transcriptome BAM file, even though they are present in the genome BAM file.
Previously, I mapped these RNA-seq reads with minimap2 directly onto the CDS sequences, and those reads aligned perfectly. However, with STAR, these reads are missing from the transcriptome BAM file, which I discovered when some expected SNPs were absent in downstream analyses.
Here is an example of one such read, as found in the genome BAM file: (i used saamtools view to have reads info)
I made a simple sketch to better visualize the read mapping:
Here is the command I used :
STAR --genomeDir $GenomeIndexDirectory --readFilesIn $FastqInputFolder/$forward $FastqInputFolder/$reverse --readFilesCommand gunzip -c --runThreadN 16 --outSAMtype BAM SortedByCoordinate --quantMode TranscriptomeSAM --outMultimapperOrder Random --outFileNamePrefix $WorkingDirectory/AlignedReads/$ind
I expected this read to appear in the transcriptome BAM file, as it maps to two CDS regions separated by an intron.
I saw issue #315 and checked my GFF file, but there don’t seem to be any apparent problems with it.
What criteria does STAR use to include reads in the transcriptome BAM file?
Could the issue be related to splicing annotations (e.g., incomplete or incorrect intron definitions)?
Are there specific parameters I can adjust to include such reads in the transcriptome BAM file?
Thank you for your help!
The text was updated successfully, but these errors were encountered: