HPIDB3.0

Host-Pathogen Interaction Database

About

Identification and analysis of host–pathogen interactions (HPI) is essential to study infectious diseases. However, HPI data are sparse in existing molecular interaction databases, especially for agricultural host–pathogen systems. Therefore, resources that annotate, predict and display the HPI that underpin infectious diseases are critical for developing novel intervention strategies. HPIDB 3.0 is a resource for HPI data, and contains 45, 238 manually curated entries in the current release.

Citing HPIDB

Statistics

Database Construction and Data Source

External Database Citations

Citing HPIDB

Ammari, M.G., Gresham, C.R., McCarthy, F.M. and Nanduri, B., 2016. HPIDB 2.0: a curated database for host–pathogen interactions. Database, 2016. [DOI]
Kumar, R. and Nanduri, B., 2010. HPIDB-a unified resource for host-pathogen interactions BMC bioinformatics, 11(6), p.S16. [DOI]

Statistics

Currently HPIDB contains 55,505 unique protein interactions between 55 host and 523 pathogen species. Unique protein interactions are defined by unique values in the following columns: host protein id, pathogen protein id, pmid, interaction type and detection method.

Download the HPIDB (excluding predicted interologs) database in PSI-MITAB(2.5) file format.
Download the HPIDB interolog predictions database in PSI-MITAB(2.5) file format.

Distributions within HPIDB

To see pie charts for the following: distribution of HPIs from different sources, distribution of the host and pathogen species in HPIDB, distribution of abundant host and pathogen species in HPIDB, please visit our old site.

Most abundant pathogen species in HPIDB

Pathogen Species	Host Species	Interactions
Influenza	Multiple Hosts	9,957
Herpes viruses	Multiple Hosts	8,174
Saccharomyces cerevisiae	Multiple Hosts	6,862
Papillomaviruses	Multiple Hosts	6,515
Human immunodeficiency virus	Multiple Hosts	4,366
Yersinia	Multiple Hosts	4,026
Bacillus	Multiple Hosts	3,069
Hepatitis C virus	Multiple Hosts	2,617
Francisella tularensis	Multiple Hosts	1,371
Measles virus	Multiple Hosts	1,030

HPIDB File Formats

The database is organized as a 26 column table. The first 15 columns are the standard PSI-MI TAB Format/MITAB25 format columns available for most PPI databases. The 11 additional columns specific to HPIDB along with the standard 15 columns are described below. Each row in the database represents one protein-protein interaction pair.

Description of first 15 standard PSI-MI Tab format columns

protein_xref_1 (identifier for the first protein)
protein_xref_2 (identifier for the second protein)
alternative_identifiers_1 (alternative identifiers for the first protein)
alternative_identifiers_2 (alternative identifiers for the second protein)
rotein_alias_1 (alias name for the first protein)
protein_alias_2 (alias name for the second protein)
detection_method (detection method used)
author_name (author’s name)
pmid (publication id)
protein_taxid_1 (taxonomy id for the first protein)
protein_taxid_2 (taxonomy id for the second protein)
interaction_type (interaction type)
source_database_id (source database)
database_identifier (identifier for the source database)
confidence (confidence score for interaction)

Description of 10 columns unique to HPIDB

protein_xref_1_unique (unique id for the first protein)
protein_xref_2_unique(unique id for the second protein)
protein_taxid_1_cat (taxonomy category for the first protein)
protein_taxid_2_cat (taxonomy category for the second protein)
protein_taxid_1_name (taxonomy name for the first protein)
protein_taxid_2_name (taxonomy name for the second protein)
protein_seq1 (sequence of the first protein)
protein_seq2 (sequence of the second protein)
source_database (source database for the derived PPI)
protein_xref_1_display_id (display identifier for first protein)
protein_xref_2_display_id (display identifier for second protein)

External Database Citations

Aranda, B. et al. (2011) "PSICQUIC and PSISCORE: accessing and scoring molecular interactions." Nat Meth 8 2010; 528-529. doi:10.1038/nmeth.1637. [Nature] [PSICQUIC Registry]
Aranda B, Achuthan P, Alam-Faruque Y. et al. (2010) "The IntAct molecular interaction database in 2010." Nucleic Acids Res. 2010; 38:D525-D531. doi: 10.1093/nar/gkp878. [PubMed] [DOI] [www.ebi.ac.uk/intact]
Stark C, Breitkreutz BJ, Chatr-Aryamontri A, Boucher L, Oughtred R, Livstone MS, Nixon J, Van Auken K, Wang X, Shi X, Reguly T, Rust JM, Winter A, Dolinski K, Tyers M (2011) "The BioGRID Interaction Database: 2011 update." Nucleic Acids Res. 2011 Jan; 39(Database issue):D698-704. [PubMed] [DOI] [www.thebiogrid.org]
Ceol A, Chatr Aryamontri A, Licata L, Peluso D, Briganti L, Perfetto L, Castagnoli L, Cesareni G (2010) "MINT, the molecular interaction database: 2009 update." Nucleic Acids Res. 2010 Jan; 38(Database issue):D532-9. [PubMed] [DOI] [mint.bio.uniroma2.it/mint]
Bader GD, Betel D, Hogue CW (2003) "BIND: the Biomolecular Interaction Network Database." Nucleic Acids Res. 2003; 31:248-250. [PubMed] [bond.unleashedinformatics.com]
Navratil1 V., de Chassey1 B., Meyniel1 L., Delmotte1 S., Gautier1i C., AndrÃ©3 P.,, Lotteau1 V., and Rabourdin-Combe1 C. (2009) "VirHostNet: a knowledgebase for the management and the analysis of proteome wide virus-host interaction networks." Nucleic Acids Res. 2009 Jan;37(Database issue):D661-8. [PubMed] [DOI] [pbildb1.univ-lyon1.fr/virhostnet]
The UniProt Consortium (2015) "UniProt: a hub for protein information." Nucleic Acids Res. 43: D204-D212. [DOI] UniProtKB
Xenarios, I; Salwinski, L; Duan, XJ; Higney, P; Kim, SM; Eisenberg, D (2002) "DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions." Nucleic Acids Research 30 (1): 303-5. [DOI] [PMC] [DIP]
Goll, J., Rajagopala, S. V., et al. (2008) "MPIDB: The microbial protein interaction database." Bioinformatics Volume 24,Issue 15: 1743-1744. [Bioinformatics] [MPIDB]
Brown K.R. and Jurisica I (2005) "Online Predicted Human Interaction Database." Bioinformatics Volume 21,Issue 9: 2076-2082. [Bioinformatics] [I2D]
Breuer et al (2013) "InnateDB: systems biology of innate immunity and beyond - recent updates and continuing curation." Nucl. Acids Res. 2013; 41 (D1). [Nucl. Acid Res.] [InnateDB]

Search Database By

HPIDB3.0

About

Citing HPIDB

Statistics

Database Construction and Data Source

External Database Citations

Citing HPIDB

Statistics

Distributions within HPIDB

Most abundant pathogen species in HPIDB

HPIDB File Formats

Description of first 15 standard PSI-MI Tab format columns

Description of 10 columns unique to HPIDB

External Database Citations