Fast download BLAST databases from NCBI

Database files (volumes) are downloaded in parallel: number of threads to use is determined automatically. MD5 checksum is verified and the database volume extracted upon download. Database volumes are not downloaded in a particular order. The volumes are updated if a newer version is available on the server, or re-downloaded if corrupt. Aborted downloads are safely resumed.

ncbi-blast-dbs is faster than NCBI's update_blastdb.pl. But unlike update_blastdb.pl, which is a pure Perl script, ncbi-blast-dbs delegates download and checksum verification to wget and md5sum / md5 and is thus not as universal.

Installation

gem install ncbi-blast-dbs

Usage

List available BLAST databases

ncbi-blast-dbs

Download all volumes of a BLAST database

ncbi-blast-dbs nt nr

Databases are downloaded one after the other. Volumes of each database are downloaded in parallel. Downloads are placed in the current directory.

NCBI expects users to submit their email address when downloading data from their FTP server. To comply with that, download as:

email="my email address here" ncbi-blast-dbs nr