DEEP LEARNING SCORING FUNCTION FOR PROTEIN-LIGAND MOLECULAR DOCKING

Published in 21/11/2024 - ISBN: 978-65-272-0843-3

Paper Title
DEEP LEARNING SCORING FUNCTION FOR PROTEIN-LIGAND MOLECULAR DOCKING
Authors
  • Fabiano Mendonça
  • Yago Mendes Paes
  • Andrej Aderhold
  • Adriano Werhli
  • Karina Machado
Modality
Poster
Subject area
Proteins and Proteomics
Publishing Date
21/11/2024
Country of Publishing
Brazil | Brasil
Language of Publishing
Inglês
Paper Page
https://www.even3.com.br/anais/xmeeting-2024/837169-deep-learning-scoring-function-for-protein-ligand-molecular-docking
ISBN
978-65-272-0843-3
Keywords
Protein-ligand molecular docking, Virtual screening, Convolutional neural network, Scoring function
Summary
Protein-Ligand Molecular Docking is a method that employs computational techniques to predict the best orientation and conformation of a ligand in the binding site of a target receptor. This can be applied for virtual screening to identify which ligands have the potential to bind to the target molecule. Being an in-silico approach, molecular docking reduces the cost of projects searching for drug candidates and accelerates the process of identifying a required ligand to achieve the desired effect. Usually, this process occurs in two stages: initially, the search for the best position/conformation of a ligand into the receptor's binding site, followed by the calculation of estimated binding affinity using a scoring function (SF). Developing an accurate score function is essential for the success of molecular docking studies, as it directly influences the reliability of the predictions made by the docking software. Therefore, score functions are continuously refined and validated using experimental data and benchmark datasets. The SFs used in molecular docking can be classified into different categories: physic-based, knowledge-based, empirical and machine learning. Between the types of SFs, ML-based SFs have been demonstrating robustness and can be adapted well to specific simulation conditions. These SFs are trained considering different strategys to choose the features and as target attribute experimental values of binding affinities, obtained from databases as PDBBind and DUD-E. Different ML algorithms can be used for training SFs, for example Random Forest (Delta-Vina), Convolutional Neural Networks (DeepDock, GNINA 1.0), among others. There are many studies aiming to improve the accuracy of SFs through the use of deep learning The application of neural networks has shown high potential, significantly reducing the time required for SF calculation, thus allowing testing of a larger number of ligands. In this work, we proposed the implementation of a scoring function using deep learning techniques, using TensorFlow 2 and Keras, with a convolutional neural network architecture. The same network architecture was used for training with two different datasets PDBbind versions v2016 and v2019. PDBbind is a database of protein-ligand complexes that contains various complexes organized by similarity, with experimental binding affinities. Additionally, PDBbind has a benchmark called CASF (Comparative Assessment of Scoring Functions) to evaluate scoring functions. To represent the molecules for training the network, four-dimensional vectors were used. The first three dimensions represent the physical positions of the atoms in the molecules, while the fourth dimension represents different features of the atoms. These features include molecular descriptors such as type of atoms, the number of bonds, and so on. Features are calculated using Openbabel and the charges have been added using UCSF Chimera. During training, each protein-ligand complex is rotated 24 times to make the network more robust, reducing sensitivity to the initial position of the protein-ligand complex. The network evaluation under the scoring power metric of CASF-2016, when comparing the proposed SF with others SFs, all proposed models presented very satisfactory results, being inferior only to DeltaVinaRF20.
Title of the Event
20º Congresso Brasileiro de Bioinformática: X-Meeting 2024
City of the Event
Salvador
Title of the Proceedings of the event
X-Meeting presentations
Name of the Publisher
Even3
Means of Dissemination
Meio Digital

How to cite

MENDONÇA, Fabiano et al.. DEEP LEARNING SCORING FUNCTION FOR PROTEIN-LIGAND MOLECULAR DOCKING.. In: X-Meeting presentations. Anais...Salvador(BA) Hotel Deville Prime, 2024. Available in: https//www.even3.com.br/anais/xmeeting-2024/837169-DEEP-LEARNING-SCORING-FUNCTION-FOR-PROTEIN-LIGAND-MOLECULAR-DOCKING. Access in: 15/12/2025

Paper

Even3 Publicacoes