Phage Genomes – Sep 2022

See our publication in PHAGE to read about how this dataset is produced and some of our analyses of it. Please consider citing this paper if you are using this database of information on this webpage. You can also generate an up-to-date version of the database, with useful files for vConTACT2, MASH, and IToL using our Perl script available on Github. Updates to the script this month include a new column to the tsv outputs which include anything identified as “host” or “lab_host” within the original Genbank files. However, these values may be inconsistent or downright bizarre (so please use them with caution).

We also recently added annotations using PHROGs (more details available here), and you can download the updated annotations from HERE (please note that we won’t be re-uploading the updated annotations on a monthly basis, as the file is huge. Having the first ~19,000 already annotated will save users a lot of time when using the Perl script themselves).

If you don’t want to run the script yourself, please download all of the files ready-made from below:

1Oct2022_data_excluding_refseq.tsv
1Oct2022_data.tsv
1Oct2022_genomes.db
1Oct2022_genomes_excluding_refseq.fa
1Oct2022_genomes.fa
1Oct2022_genomes.fa.msh
1Oct2022_itol_family_annotations.txt
1Oct2022_itol_genus_annotations.txt
1Oct2022_itol_host_annotations.txt
1Oct2022_itol_length_annotations.txt
1Oct2022_itol_lowest_taxa_annotations.txt
1Oct2022_itol_node_label_annotations.txt
1Oct2022_itol_subfamily_annotations.txt
1Oct2022_phages_downloaded_from_genbank.gb
1Oct2022_refseq_genomes.fa
1Oct2022_vConTACT2_family_annotations.tsv
1Oct2022_vConTACT2_gene_to_genome.csv
1Oct2022_vConTACT2_genus_annotations.tsv
1Oct2022_vConTACT2_host_annotations.tsv
1Oct2022_vConTACT2_lowest_taxa_annotations.tsv
1Oct2022_vConTACT2_proteins.faa
1Oct2022_vConTACT2_subfamily_annotations.tsv
PHROGs HMMs for consistent annotation of genomes (see the new –PHROG optional flag)
GenomesDB Directory (please note that this doesn’t get updated each month, it’s just here as a time-saver if you run the script yourself. This version is for 13/Dec/2021)


Oct2022