-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
10x run leaks memory like crazy #559
Comments
I have no real knowledge here :). What about building signatures for all barcodes simultaneously? |
Haven't tested it yet, but at a glance, those PRs address the There may also be a necessary filter for only allowing barcodes with at least N reads to remove the 'bad' barcodes and reduce the total memory used. |
was this fixed with |
Yes, thank you!
---
Olga Botvinnik, PhD
olgabotvinnik.com <http://www.olgabotvinnik.com>
…On Thu, Jan 2, 2020 at 8:59 PM Luiz Irber ***@***.***> wrote:
was this fixed with bam2fasta, @olgabot <https://github.com/olgabot>?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#559>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAGE24DM27QNOVW62ZRVNX3Q32LWXANCNFSM4F7SE43A>
.
|
TL;DR: Running
sourmash compute
on larger 10X bam files crashed our 2TB ram machine (!!!)I've been trying to run
sourmash compute
on a few 10x bam files with 3458 and 610 barcodes, and previously I had tested files with 150 and 625 barcodes with no problem. But because the bam file is sorted by coordinate, the code iterates over each alignment, checks if the barcode associated with this alignment is already added, adds it, then adds the sequence, it ends up taking up a LOT of memory since its unknown which sequences correspond to which barcodes a priori. I crashed our 2TB ram machine runningsourmash compute
on these two files 😱The options I are:
What do you think?
The text was updated successfully, but these errors were encountered: