DeepLoc 2.1: multi-label membrane protein type prediction using protein language models
Citations Over TimeTop 1% of 2024 papers
Abstract
DeepLoc 2.0 is a popular web server for the prediction of protein subcellular localization and sorting signals. Here, we introduce DeepLoc 2.1, which additionally classifies the input proteins into the membrane protein types Transmembrane, Peripheral, Lipid-anchored and Soluble. Leveraging pre-trained transformer-based protein language models, the server utilizes a three-stage architecture for sequence-based, multi-label predictions. Comparative evaluations with other established tools on a test set of 4933 eukaryotic protein sequences, constructed following stringent homology partitioning, demonstrate state-of-the-art performance. Notably, DeepLoc 2.1 outperforms existing models, with the larger ProtT5 model exhibiting a marginal advantage over the ESM-1B model. The web server is available at https://services.healthtech.dtu.dk/services/DeepLoc-2.1.
Related Papers
- → Mapping transmembrane binding partners for E-cadherin ectodomains(2020)50 cited
- → Investigation of transmembrane proteins using a computational approach(2008)25 cited
- → Prediction of membrane proteins based on classification of transmembrane segments(1998)57 cited
- Progress on the transmembrane protein in plants(2009)
- Prediction of transmembrane helical segments in transmembrane proteins based on wavelet transform(2006)