Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RNASeq expression worker #853

Open
ihodes opened this issue Oct 8, 2015 · 9 comments
Open

RNASeq expression worker #853

ihodes opened this issue Oct 8, 2015 · 9 comments

Comments

@ihodes
Copy link
Member

ihodes commented Oct 8, 2015

@iskandr:

PGV relies on allele-specific quantification of expression from RNAseq, how can that data be shown in Cycledash?

from #852

@ihodes ihodes added this to the Ready for PGV milestone Oct 8, 2015
@arahuja
Copy link
Contributor

arahuja commented Oct 9, 2015

Some near-term ways to address this might be:

  • allow an RNA BAM to be specified on run submission
  • allow a third BAM (the RNA BAM) to be displayed in the pileups

@ihodes
Copy link
Member Author

ihodes commented Oct 9, 2015

cf. #790 for multiple BAMs in pileup

allow an RNA BAM to be specified on run submission

Right now you can submit as many BAMs as you'd like; would you want a tag on a BAM designating it as RNA data?

@arahuja
Copy link
Contributor

arahuja commented Oct 9, 2015

Yes, you can submit any number of BAMs, but you can't submit a run with multiple tumor BAMs:

image

But assuming the "multiple BAMs in a pileup" feature allows you choose any other BAM from the project, I guess it doesn't matter too much. But, I could also see you wanting to tie a run to a specific RNA BAM to get the allele specific stats. Tagging a BAM as RNA also makes sense.

@ihodes
Copy link
Member Author

ihodes commented Oct 9, 2015

That makes sense; thanks for the explanation!

@armish
Copy link
Member

armish commented Nov 3, 2015

@iskandr: what is the summary statistics that you would like to show if we have the RNA-bam feature implemented? Some measure of within-sample normalized expression for a variant? Or simply number of reads supporting that particular variant?

Let me know if this needs an off-line discussion for a through discussion.

@iskandr
Copy link
Member

iskandr commented Nov 3, 2015

I think that (1) the number of reads that contain a variant (2) the number of reads overlapping the locus and (3) the normalized expression value for that variant would all be good to see. What do you think?

@armish
Copy link
Member

armish commented Nov 3, 2015

  1. Definitely a go-go;
  2. We have to define what a locus is for our purposes. Do you mean counting all reads falling into the gene locus for the corresponding transcript?
  3. Would be great to have that, but I first have to read on normalizing expression within a single sample. Did your manual PGV pipeline have this already?

@armish
Copy link
Member

armish commented Nov 6, 2015

@ihodes: OK for me to start working on this? I am planning to add an RNASeq BAM file field to runs as a first step just to quickly get it done. I think we can ideally have a more flexible BAM submission for users where they can attach any number of BAMs with a controlled label (e.g. Normal, Tumor, RNASeq, etc.) but that can be our next goal. What do you think?

@ihodes
Copy link
Member Author

ihodes commented Nov 6, 2015

@armish I'd defer to @iskandr as to what he needs—I had the impression we're waiting on a new tool he's developing next week, and that existing tools wouldn't get us what we need.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants