Skip to content

databases: accept local FASTA paths for easy nucleotide DB creation#4

Open
antonvnv wants to merge 1 commit into
pskvins:masterfrom
antonvnv:databases-local-index
Open

databases: accept local FASTA paths for easy nucleotide DB creation#4
antonvnv wants to merge 1 commit into
pskvins:masterfrom
antonvnv:databases-local-index

Conversation

@antonvnv
Copy link
Copy Markdown

Building a nucleotide database from a FASTA file previously required manually chaining createdb, splitsequence, makepaddedseqdb, and createindex with the right flags. Now a single command does it:

mmseqs databases ./input.fasta.gz outdb tmp

Both relative (./...) and absolute (/...) paths work — any argument containing '/' that isn't a known database name is treated as a local file. Protein inputs are rejected with a clear error since the indexing pipeline is nucleotide-specific.

This keeps databases as the single entry point to maintain indexing requirements, and makes it suitable for reindexing external or already manually downloaded databases.

Building a nucleotide database from a FASTA file previously
required manually chaining createdb, splitsequence, makepaddedseqdb,
and createindex with the right flags. Now a single command does it:

  mmseqs databases ./input.fasta.gz outdb tmp

Both relative (./...) and absolute (/...) paths work — any argument
containing '/' that isn't a known database name is treated as a local
file. Protein inputs are rejected with a clear error since the
indexing pipeline is nucleotide-specific.

This keeps `databases` as the single entry point to maintain indexing
requirements, and makes it suitable for reindexing external or already
manually downloaded databases.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant