-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comparing metagenomes help? #294
Comments
Partially done in #419, "a practical guide." |
I think this has been addressed now. |
Helle @ctb and sourmash Team! I have been through the tutorials and Practical Guide which have all been extremely helpful. However, I have reached a snag I was hoping to get some guidance on. I am wanting to use sourmash to compare 40 metagenomic environmental water samples: 20 sampling sites over 2 seasons. I have assembled reads from each sample using metaSPAdes and computed signatures for each assembly using combos of k= 21, 31, 51 and scaled= 10, 100, 1000, 10000:
It is known from 16S DNAseq data that some sites should be clustering, especially the same site sampled at different time periods. However, my matrix plot is barely clustering any sites regardless of the k-mer + scaled combination. Is it possible that I'm computing the assembly signatures wrong? I was unsure if I should compute signatures on the reads or the contigs.
Please let me know if more information is needed. I appreciate your assistance in using sourmash for my project. Best Regards, |
hi @ReneKat sorry for delay in responding - in the future, just file a new issue, that way it pops up when I'm going through the issue tracker :) |
a few thoughts --
I would suggest sticking with k=21 and a scaled of 1000, and applying that to the reads rather than the assembly. If you e-mail me the .sig files at [email protected] I can poke at them a bit and see if I can find something you're missing. |
I have data from a single water sample that was filtered through a big filter followed by a small filter to capture attached vs. free-living microbes. We assembled the sequences from both sets of filters together as well as separately and our assemblies are different.
The text was updated successfully, but these errors were encountered: