-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
merge devel into master #143
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Codecov Report
@@ Coverage Diff @@
## master #143 +/- ##
==========================================
+ Coverage 96.60% 96.80% +0.20%
==========================================
Files 12 14 +2
Lines 1268 1630 +362
==========================================
+ Hits 1225 1578 +353
- Misses 43 52 +9
Continue to review full report at Codecov.
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
General
compile
command. See below for more information. (OSError: [Errno 8] Exec format error #139)v0.48.0
.v0.41.0
.kb compile
is suggested.compile
kallisto
and/orbustools
binary from source. At the most basic level, it downloads the latest release source distributions from the respective GitHub repositories, compiles them, and places them wherekb
can automatically detect them.target
positional argument specifies which binary (or both) to compile. Possible values arekallisto
,bustools
andall
.--url
optional argument may be provided with a URL to a remote archive that will be used instead of the latest GitHub release. When this option is used,target
may not beall
.--ref
optional argument may be provided with a commit hash or git tag. When this option is used,target
may not beall
.-o
optional argument may be used to place the compiled binaries in a different directory. Note that if this option is used,--kallisto
and--bustools
options will have to be set appropriately when runningref
orcount
.--view
option may be used to simply view what binaries (their locations and versions) will be used bykb
.--remove
option may be used to remove existing compiled binaries.--overwrite
option may be used to overwrite existing compiled binaries.kallisto
compilation follows https://pachterlab.github.io/kallisto/source and has the same dependencies.bustools
compilation follows https://bustools.github.io/source and has the same dependencies.--cmake-arguments
argument may be used to pass in a string of additional arguments to pass directly to thecmake
command. For instance, to manually specify additional include directories,--cmake-arguments "-DCMAKE_CXX_FLAGS='-I /usr/include'"
ref
--include-attribute
and--exclude-attribute
options which can be used to include/exclude specific GTF entries based on their attributes. The argument to these options must be in the form of akey:value
pair, wherekey
is a GTF attribute name andvalue
is the value of the aforementioned attribute to include/exclude. Only one of these two options may be specified, and each option may be specified more than once. When multiple--include-attribute
are provided, GTF entries that have any one of the attributes will be processed. When multiple--exclude-attribute
are provided, GTF entries that have any one of the attributes will not be processed.count
--filter-threshold
option to specify the barcode filter threshold. This option may only be used when also providing--filter bustools
and indicates the minimum number of times a barcode must appear to be retained from filtering. (How to manually change the threshold of UMI filtering in Kb count wrapper #142)--strand
option to override automatic strandedness setting bykallisto bus
. Available options areunstranded
,forward
, andreverse
.transcript_ids
column to be a semicolon-delimited string instead of a list (only applicable when--tcc
is provided) as a workaround for an issue with writing lists to h5ad withh5py>=3
. error in writing transcript_ids for tcc h5ad #141BULK
andSMARTSEQ2
technologies. The two technologies behave identically. The FASTQs may be provided either directly via command-line (only for multiplexed samples), in which casekb
will perform demultiplexing, or as a single batch definition text file (only for demultiplexed samples). See https://pachterlab.github.io/kallisto/manual section aboutbatch.txt
for formatting. This batch textfile may also contain remote urls to FASTQ files, which will be streamed for supported operating systems. Additionally, added--parity
,--fragment-l
and--fragment-s
options, which may only be provided for these technologies. The first must always be provided, indicating the parity of the reads (single
,paired
), and the latter two may only be provided when--parity single
is also provided, specifying the mean length of the fragments and standard deviation of the fragment lengths.SMARTSEQ
technology has been deprecated and will be removed in the next release. Instead,SMARTSEQ2
should be used. See previous point for more information.SMARTSEQ3
technology.--dry-run
instead of an alias.--umi-gene
option, which deduplicates UMIs by gene. Can not be used with smartseq or bulk technologies.--em
option, which estimated gene abundances using the EM algorithm. Can not be used with smartseq or bulk technologies, or with--tcc
.-o
option tobustools count
already exists, but as a directory. For instance,counts_unfiltered/cells_x_genes
. Such folders are removed before running the command.--gene-names
option, which may only be used with--h5ad
or-loom
and not--tcc
. By specifying this option, the output h5ad or loom matrix will be aggregated by gene names instead of IDs.BDWTA
(BD Rhapsody),SPLIT-SEQ
,Visium
(10x).