Protein Language Models for drug discovery

From ISLAB/CAISR
Revision as of 10:58, 25 September 2024 by Islab (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Title Protein Language Models for drug discovery
Summary Leveraging the sequence-based transformer protein language model for improving potential drug targets identification
Keywords Transformer, Protein interaction, Language model, drug target identification
TimeFrame Fall 2024
References Chen, J., Gu, Z., Xu, Y., Deng, M., Lai, L. and Pei, J., 2023. QuoteTarget: A sequence‐based transformer protein language model to identify potentially druggable protein targets. Protein Science, 32(2), p.e4555.

Chen, L., Fan, Z., Chang, J., Yang, R., Hou, H., Guo, H., Zhang, Y., Yang, T., Zhou, C., Sui, Q. and Chen, Z., 2023. Sequence-based drug design as a concept in computational drug design. Nature Communications, 14(1), p.4217.

Chen, D., Liu, J. and Wei, G.W., 2024. Multiscale topology-enabled structure-to-sequence transformer for protein–ligand interaction predictions. Nature Machine Intelligence, 6(7), pp.799-810.

Jiang, J., Chen, L., Ke, L., Dou, B., Zhang, C., Feng, H., Zhu, Y., Qiu, H., Zhang, B. and Wei, G., 2024. A review of transformers in drug discovery and beyond. Journal of Pharmaceutical Analysis, p.101081.

Prerequisites
Author
Supervisor Prayag Tiwari, Ali Amirahmadi
Level Master
Status Open


Protein Language Models for drug discovery

Traditional drug target identification is mainly carried out through biological experiments or high-throughput screening techniques, which often require a lot of time and money costs, so the number of drug molecules that can be screened is very limited. With the rapid development of science and technology, various DL and ML models, such as transformers, have emerged one after another, providing a new way for drug target identification. This greatly saves the cost of drug target identification and accelerates the process of drug research and development.

The sequence-based transformer protein language model is an innovative tool for identifying potential drug targets. It can independently extract features from amino acid sequences for drug target protein identification, which has potential significance for identifying drug binding sites.

References:

Chen, J., Gu, Z., Xu, Y., Deng, M., Lai, L. and Pei, J., 2023. QuoteTarget: A sequence‐based transformer protein language model to identify potentially druggable protein targets. Protein Science, 32(2), p.e4555.

Chen, L., Fan, Z., Chang, J., Yang, R., Hou, H., Guo, H., Zhang, Y., Yang, T., Zhou, C., Sui, Q. and Chen, Z., 2023. Sequence-based drug design as a concept in computational drug design. Nature Communications, 14(1), p.4217.

Chen, D., Liu, J. and Wei, G.W., 2024. Multiscale topology-enabled structure-to-sequence transformer for protein–ligand interaction predictions. Nature Machine Intelligence, 6(7), pp.799-810.

Jiang, J., Chen, L., Ke, L., Dou, B., Zhang, C., Feng, H., Zhu, Y., Qiu, H., Zhang, B. and Wei, G., 2024. A review of transformers in drug discovery and beyond. Journal of Pharmaceutical Analysis, p.101081.