Bot releases are hidden (Show)
--memory-limit
/-M
option.--mutual-cover
to cluster sequences by mutual coverage percentage of the cluster representative and member sequence.--symmetric
for computing greedy vertex cover with symmetric edges.--approx-id
option and the approx_pident
output field not to work correctly when using the --anchored-swipe
option.--no-reassign
to prevent reassignment to closest representative for the greedy vertex cover and clustering workflows.--connected-component-depth
to activate clustering of connected components at a given maximum depth for the greedy vertex cover and the clustering workflows.--shapes-30x10
with sensitivity approximately equivalent to --mid-sensitive
.--round-coverage
and --round-approx-id
to set per round cutoffs for cascaded clustering.-DKEEP_TARGET_ID
is now obsolete and the corresponding function is always available.--include-lineage
to the taxonomic classification format to include taxonomic lineage in the output.Published by bbuchfink over 1 year ago
-f json-flat
).--sam-query-len
to output query length in SAM format.Published by bbuchfink over 1 year ago
makedb
workflow.--target-indexed
option.--tmpdir
option for the makedb
workflow.makedb
workflow.--no-parse-seqids
to disable parsing of sequence accessions.--ignore-warnings
option could not be used for the makedb
workflow.Published by bbuchfink over 1 year ago
-DX86=OFF
.Published by bbuchfink over 1 year ago
Database file is not a BLAST database
error message for the prepdb
workflow.greedy-vertex-cover
workflow without the --out
and --centroid-out
options.greedy-vertex-cover
workflow to only produce a trivial clustering.Published by bbuchfink over 1 year ago
\t
in sequence titles, and a warning message is produced.N/A
, and a warning message is produced.Traceback error
in certain cases.qlen
and score
output fields not to be reported correctly for the realign
workflow.realign
workflow.Missing fields in input line
error when clustering.linclust
workflow.Published by bbuchfink over 1 year ago
Published by bbuchfink over 1 year ago
--iterate
) now uses a linear-time feature as the first search round.linclust
command to cluster using only a single linear-time search round.view
workflow.merge-daa
workflow to merge DAA files.--max-target-seqs/-k
option for the DAA view
workflow.--ignore-warnings
option for the cluster
and deepclust
workflows.--lin-stage1
option can now be used without compilation using the -DEXTRA=ON
cmake option._lin
suffix for sensitivity keywords for the --iterate
option to activate linear-time feature.--linsearch
to activate linear-time feature for the search workflows.ppos
and positive
output fields not to work for the realign
workflow.Published by bbuchfink over 1 year ago
recluster
workflow.invalid varint encoding
error when using the DAA format.view
workflow.Published by bbuchfink over 1 year ago
cluster
workflow to cluster protein sequences.realign
workflow to generate clustering output.recluster
workflow to correct errors in clusterings.reassign
workflow to reassign cluster members to their closest centroid.-M/--memory-limit
to set a memory limit for clustering workflows.--approx-id
option to filter alignments by approximate sequence identity and to set an approximate sequence identity threshold for clustering.--member-cover
option to set the coverage threshold of the cluster member sequence.--cluster-steps
option to set steps for cascaded clustering.--clusters
option to specify clustering input file.blastx
mode will now mask any open reading frame below the minimum required length as specified by --min-orf
.blastx
mode will only count unmasked letters towards the block size.--soft-masking
with possible values 0
and tantan
to permit soft-masking using the tantan algorithm.inflate error
in multiprocessing mode.--swipe
to compute full Smith Waterman alignments of all queries against all targets.--faster
.approx_pident
and corrected_bitscore
to the tabular format.--lin-stage1
option to linearize comparisons in the seeding stage by only considering hits against the longest query sequence for identical seeds (only supported when compiled with -DEXTRA=ON
).--kmer-ranking
option to rank sequences when --lin-stage1
is used (only supported when compiled with -DKEEP_TARGET_ID=ON
).--no-block-size-limit
to deactivate upper limits for the block size when the --memory-limit
option is used.greedy-vertex-cover
workflow to compute clustering based on alignments.--edge-format
option to set edge format for greedy vertex cover.--edges
option to set input file for greedy vertex cover.--centroid-out
option to output centroid sequences for greedy vertex cover.--unaligned-targets
option to generate an output file of unaligned targets.--header
option can now be used with the parameter simple
to enable simple headers for the tabular format, or without a parameter to enable headers for the clustering format.--mp-self
to optimize self-alignment in multiprocessing mode.--query-or-subject-cover
to report alignments if the query or the subject cover (or both) are above the given threshold.--comp-based-stats 2
option (now equivalent to --comp-based-stats 3
).--anchored-swipe
to activate anchored SWIPE extension.Published by bbuchfink over 2 years ago
diamond view
workflow to report a zero bit score for all alignments.Published by bbuchfink almost 3 years ago
_SC_LEVEL3_CACHE_SIZE
.--unal 1
with the cigar
output field.illegal instruction
error on systems that did not support AVX2.Published by bbuchfink almost 3 years ago
Published by bbuchfink about 3 years ago
--max-hsps
will now recompute a full-matrix Smith Waterman alignment with the ranges of the known HSPs masked in the target.--max-hsps
can now be used together with --ext full
.--iterate
option.--seed-cut
to set a complexity cutoff for indexed seeds.--freq-masking
to enable masking seeds based on frequency.--motif-masking (0,1)
to enable or disable motif masking.--masking seg
to enable SEG masking of target sequences (BLAST default) instead of tantan masking.full_sseq
output field to contain invalid information or to produce an error when using a BLAST database.--custom-matrix
option not to function correctly.Published by bbuchfink over 3 years ago
--masking 0
combined with the global ranking mode.Published by bbuchfink over 3 years ago
prepdb
. The command line is: diamond prepdb -d /path/to/database
. This call runs quickly and will write some small auxiliary files into the database directory.--iterate
) to search the query dataset with increasing sensitivity, only searching queries at the target sensitivity that do not produce a significant alignment at a lower sensitivity search. For example, using --sensitive --iterate
will first search the query file at default sensitivity, and search all query sequences again in --sensitive
mode that fail to align in the first round.-g
) to set a limit on the number of Smith Waterman extensions per query, with the target sequences ranked by their ungapped extension scores.--fast
sensitivity mode that is faster and less sensitive than the default mode.--algo ctg
) to improve performance for small query files.--comp-based-stats (3,4)
in combination with --ext full
.Traceback error
when using --comp-based-stats (3,4)
in rare cases.full_sseq
output field to always contain unmasked sequences.--compress zstd
) (requires compilation using cmake -DWITH_ZSTD=ON
).--ignore-warnings
).Published by bbuchfink over 3 years ago
blastdb_aliastool
).--algo 1
).--skip-missing-seqids
to ignore cases of missing sequences--seqidlist
option.--min-orf
parameter now defaults to 1 in frameshift alignment mode.Published by bbuchfink over 3 years ago
.dmnd
database files. This feature is not yet available through all release channels. It can currently be accessed by downloading the GitHub release version or by compiling from source. Taxonomy features are not yet supported for BLAST databases.
--seqidlist
to filter the database by sequence accession (only supported for BLAST databases).--dbsize
option not to function correctly.makeidx
and the option --target-indexed
that provide an optimisation specialized for small databases (<10 Mb). (see: https://github.com/bbuchfink/diamond/wiki/5.-Advanced-topics#small-database-optimization)--mp-recover
to recover aborted runs in multiprocessing mode.Published by bbuchfink over 3 years ago
--ext full
).prot.accession2taxid.FULL.gz
taxonomy mapping file from NCBI.--gapped-filter-evalue
to set the e-value threshold of the gapped filter heuristic.qseq_translated
to print the translation of the aligned part of the query sequence.--query/-q
when running alignment in blastx mode.full_qseq_mate
to print the sequence of the query's mate (enabled when using two query files in blastx mode).Published by bbuchfink almost 4 years ago
--custom-matrix
).--comp-based-stats (2,3,4)
(the feature is not enabled by default and does not support translated searches at the moment).diamond view
.--top
parameter is used.--no-unlink
to prevent unlinking of temporary files.