The Topology Data Bank of Transmembrane Proteins (TOPDB) is the most complete and comprehensive collection of transmembrane protein datasets containing experimentally derived topology information. It contains records for 5678 transmembrane proteins with information gathered from the literature and from public databases available on the World Wide Web.
The database collects the details of various experiments carried out to learn about the topology of particular transmembrane proteins. The experimental techniques include fusion with reporter enzymes, glycolysation studies, protease accessibility, immunolocalisation, etc. In addition to literature-derived data, an extensive collection of structural data was also compiled from Protein Data Bank (PDB) and from Protein Data Bank of Transmembrane Proteins (PDBTM) by utilising the TMDET algorithm and literature data to determine localisation of the two sides of the structures.
While literature-derived data can not be collected automatically, data based on 3D structures provides semi-automatic and continuously updated information for the database. Structural data is the most reliable information about transmembrane topologies, but the topology information is often incomplete. Therefore, for each protein in the database the most probable topology consistent with the collected experimental constraints was also calculated using CCTOP and HMMTOP transmembrane topology prediction algorithms for α-helical and β-barrel transmembrane proteins, respectively.
Each record in TOPDB also contains the indispensable information about the given protein such as its sequence, name, organism and cross references to various databases (PDB, PDBTM, UniProt and literature references from PubMed).
The TOPDB is designed to address the broad gap between the large number of transmembrane proteins in sequence databases and the publicly available topology information of experimentally or computationally studied transmembrane proteins.
From August of 2023, TOPDB, PDBTM and TOPDOM database are integrated and they use a common SQL based database.