Publicly Hosted MetaScope Databases and Bowtie 2 Indices
Sean Lu
2025-01-30
Source:vignettes/vignette_docs/publicly_hosted_indices_and_accessions.Rmd
publicly_hosted_indices_and_accessions.RmdIntroduction
To mitigate the effort of downloading reference genomes, taxonomy identifier databases, and constructing Bowtie 2 indices, we provide pre-compiled versions as described below. Current versions of all of these files can be constructed with MetaScope’s functions if users choose not to download the files here.
Link to Box Drive
All databases are hosted centrally on a Box drive for download. A brief explanation of available files follows:
Taxonomy Annotation Database (2024_Accession_Taxa)
This file is a zip compressed folder containing a database of
taxonomy information prepared with download_accessions() as
part of MetaScope’s MetaRef module.
It is highly recommended that all users download this
database if they are using NCBI or SILVA databases. Otherwise, users
will need to run MetaScope::download_accessions() to obtain
this database.
MetaScope Bowtie2 Indices
(metascope_bowtie2_indices)
Users of the MetaScope pipeline can easily and efficiently obtain
reference genomes of interest using the download_refseq()
function. However, the additional step of creating Bowtie 2 indices from
these genomes to align them with sample data can add hours to these
initial steps.
This folder contains various pre-built bowtie2 indices that can be
used with the the align_target_bowtie() and
filter_host_bowtie functions. The user’s local
path to these indices should be supplied to the lib_dir
argument of those functions.
The following indices are available in Box drive:
-
2024_ncbi_16SThe NCBI 16S ribosomal RNA database -
Greenegenes 13_8The Greengenes 13.8 16S ribosomal RNA database -
MetaScope RefSeq 2023The entire bacteria, fungi, homo_sapiens, human_T2T, mus_musculus, and viruses refseq nucleotide database downloaded viadownload_refseq()in 2023 -
MetaScope RefSeq 2022The entire bacteria, fungi, human, mouse, and viruses refseq nucleotide database downloaded viadownload_refseq()in 2022 -
MetaScope RefSeq 2020The entire bacteria, fungi, human/mouse, Phi X 174, and viral refseq nucleotide database downloaded viadownload_refseq()in 2020 -
PathoScope RefSeq 2018The entire bacteria, mouse, and viral refseq nucleotide database downloaded via PathoScope 2.0 software in 2018 -
PathoScope RefSeq 2015The entire bacteria, fungi, human/mouse, phix174, and viral refseq nucleotide database downloaded via PathoScope 2.0 software in 2015 -
SILVA 138.1The SILVA 138.1 16S ribosomal RNA database