login
Home / Papers / RTK_RAG: Leveraging Retrieval Augmented Generation with Multi-Window Convolutional Neural Networks...

RTK_RAG: Leveraging Retrieval Augmented Generation with Multi-Window Convolutional Neural Networks for Superior ATP Binding Site Prediction in Receptor Tyrosine Kinases

88 Citations•2025•
Sin-Siang Wei, Wei-En Jhang, Yu-Chen Liu
Journal of Chemical Information and Modeling

RTK_RAG, a framework that integrates retrieval-augmented generation (RAG) and utilizes protein language models (PLMs) with a multiwindow convolutional neural network (MCNN) architecture to improve ATP binding site prediction for RTKs, demonstrates the potential of RAG-based frameworks for enhancing functional predictions in specialized protein families.

Abstract

Receptor tyrosine kinases (RTKs) are key regulators of cellular signaling and are frequently involved in cancer development. As their activation depends on ATP binding to the kinase domain, precisely identifying ATP binding sites is critical for mechanistic studies and targeted therapy development. However, general ATP binding site prediction methods often fall short for RTKs due to their diverse structural features across different protein families. To address this challenge, we introduce RTK_RAG, a framework that integrates retrieval-augmented generation (RAG) and utilizes protein language models (PLMs) with a multiwindow convolutional neural network (MCNN) architecture to improve ATP binding site prediction for RTKs. When tested on an independent RTK data set, RTK_RAG outperforms general ATP binding site predictors on multiple evaluation metrics. By accounting for RTK-specific structural differences, our study provides a reliable tool for researching RTK function and facilitating the development of novel kinase inhibitors. Moreover, this approach demonstrates the potential of RAG-based frameworks for enhancing functional predictions in specialized protein families, offering a generalizable strategy for improving binding site identification in specific protein families.