The drug development pipeline is a costly and lengthy process. Identifying high-quality “hit” compounds—those with high potency, selectivity, and favorable metabolic properties—at the earliest stages is important for reducing cost and accelerating the path to clinical trials. For the last decade, scientists have looked to machine learning to make this initial screening process more efficient.
Computer-aided drug design is used to computationally screen for compounds that interact with a target protein. However, the ability to accurately and rapidly estimate the strength of these interactions remains a challenge.
“Machine learning promised to bridge the gap between the accuracy of gold-standard, physics-based computational methods and the speed of simpler empirical scoring functions,” said Dr. Benjamin P. Brown, an assistant professor of pharmacology at the Vanderbilt University School of Medicine Basic Sciences.
“Unfortunately, its potential has so far been unrealized because current ML methods can unpredictably fail when they encounter chemical structures that they were not exposed to during their training, which limits their usefulness for real-world drug discovery.”
Brown is the single author on a Proceedings of the National Academy of Sciences paper titled “A generalizable deep learning framework for structure-based protein-ligand affinity ranking” that addresses this “generalizability gap.”
In the paper, he proposes a targeted approach: instead of learning from the entire 3D structure of a protein and a drug molecule, Brown proposes a task-specific model architecture that is intentionally restricted to learn only from a representation of their interaction space, which captures the distance-dependent physicochemical interactions between atom pairs.
“By constraining the model to this view, it is forced to learn the transferable principles of molecular binding rather than structural shortcuts present in the training data that fail to generalize to new molecules,” Brown said.
A key aspect of Brown’s work was the rigorous evaluation protocol he developed. “We set up our training and testing runs to simulate a real-world scenario: If a novel protein family were discovered tomorrow, would our model be able to make effective predictions for it?” he said.
To do this, he left out entire protein superfamilies and all their associated chemical data from the training set, creating a challenging and realistic test of the model’s ability to generalize.
Brown’s work provides several key insights for the field:
- Task-specific specialized architectures provide a clear avenue for building generalizable models using today’s publicly available datasets. By designing a model with a specific “inductive bias” that forces it to learn from a representation of molecular interactions rather than from raw chemical structures, it generalizes more effectively.
- Rigorous, realistic benchmarks are critical. The paper’s validation protocol revealed that contemporary ML models performing well on standard benchmarks can show a significant drop in performance when faced with novel protein families. This highlights the need for more stringent evaluation practices in the field to accurately gauge real-world utility.
- Current performance gains over conventional scoring functions are modest, but the work establishes a clear, reliable baseline for a modeling strategy that doesn’t fail unpredictably, which is a critical step toward building trustworthy AI for drug discovery.
Brown, a core faculty member of the Center for AI in Protein Dynamics, knows that there is more work to be done. His current project focused exclusively on scoring—ranking compounds based on the strength of their interaction with the target protein—which is only part of the structure-based drug discovery equation.
“My lab is fundamentally interested in modeling challenges related to scalability and generalizability in molecular simulation and computer-aided drug design. Hopefully, soon we can share some additional work that aims to advance these principles,” Brown said.
For now, significant challenges remain, but Brown’s work on building a more dependable approach for machine learning in structure-based computer-aided drug design has clarified the path forward.
More information: Benjamin P. Brown, A generalizable deep learning framework for structure-based protein–ligand affinity ranking, Proceedings of the National Academy of Sciences (2025). doi.org/10.1073/pnas.2508998122
Journal information: Proceedings of the National Academy of Sciences
Provided by Vanderbilt University
News
Novel Investment Paradigms for Regenerative Healthcare Ecosystems
Introduction The transition toward regenerative healthcare ecosystems—anchored in wellness optimization, disease prevention, eradication strategies, and healthy longevity—necessitates a structural reconfiguration of capital architectures, governance models, and incentive design. Regenerative healthcare, by definition, transcends episodic [...]
What If Consciousness Exists Beyond Your Brain
Scientists still don’t know how consciousness emerges from the brain. New ideas suggest it may not emerge at all, but instead be a basic feature of reality. Is consciousness produced by the brain, or [...]
Scientists Discover Way To Treat Lung Cancer and Its Deadly Side Effect Together
A new approach using lipid nanoparticles to deliver genetic material is showing promise in tackling two major challenges in lung cancer at once.Researchers at Oregon State University have designed a new way to tackle two of [...]
Saunas Activate Your Immune System
A brief sauna session may quietly mobilize the immune system. A sauna session may do more than raise your heart rate and body temperature. A new study from Finland found that it also briefly [...]
Why music from your youth still has such an intense effect years later: A psychological perspective
You're driving, and suddenly a familiar song fills the air. Before you even know it, a wave of emotions comes over you – not just memories, but a deep, almost physical feeling. This powerful [...]
AI to antibody in days: breaking the wet lab bottleneck via high-throughput integration
The role of artificial intelligence (AI) in drug design has fundamentally shifted from a speculative tool to a central pillar of pharmaceutical research and development (R&D). Sino Biological plays a critical role in this [...]
Regenerative Healthcare by Design: Engineering Health-Centric Buildings and Urban Ecosystems
Introduction The next evolution of healthcare will not be confined to hospitals, clinics, or episodic interventions—it will be embedded into the infrastructure of everyday life. Regenerative health ecosystems require a systemic re-architecture of how [...]
Scientists Warn: Humanity Has Pushed the Planet Past Its Limits
Human population and consumption have surpassed Earth’s limits, increasing risks to climate and global stability. The Earth is already operating beyond its capacity to sustainably support the global population, according to new research highlighting [...]
Breakthrough Study Reveals Why Damaged Nerves Struggle To Heal
A newly identified molecular mechanism reveals how neurons weigh survival against repair after injury. Scientists at the Icahn School of Medicine at Mount Sinai have identified a molecular switch in neurons that limits the regrowth of [...]
Popular Vitamin B3 Supplements May Help Cancer Cells Survive, Scientists Warn
A new study raises important questions about widely used NAD+ supplements, suggesting that compounds often taken to boost energy and support healthy aging may have unintended consequences in cancer treatment. Millions of Americans take [...]
Scientists Discover Cancer Tumors Are “Addicted” to This Common Antioxidant
Cancer cells may be exploiting a common antioxidant as fuel, revealing a potential weakness that future therapies could target. Cancer cells may be tapping into an unexpected energy source: an antioxidant long associated with [...]
Nanotube injector transfers cytoplasmic contents and organelles between living cells safely
Cells are not isolated units; they continuously exchange proteins, genetic material, and even entire organelles with their neighbors. Intercellular transfer influences how tissues develop, respond to stress, and repair damage. In certain cancers, for [...]
CEO of America’s largest public hospital system is ready to replace radiologists with AI
The chief executive of America’s largest public hospital system says he is prepared to start replacing radiologists with artificial intelligence in some circumstances, once the regulatory landscape catches up. Mitchell H. Katz, MD, president [...]
Our books now available worldwide!
Online Sellers other than Amazon, Routledge, and IOPP Indigo Global Health Care Equivalency in the Age of Nanotechnology, Nanomedicine and Artifcial Intelligence Global Health Care Equivalency In The Age Of Nanotechnology, Nanomedicine And Artificial [...]
Study finds higher heart disease risk in long COVID patients
People with long COVID are at increased risk of developing cardiovascular disease, according to a new study from Karolinska Institutet published in eClinicalMedicine. The results show that the risk of conditions such as cardiac arrhythmias [...]
The Corona variant Cicada is here – we know that
Online and on social media, reports are piling up about a new Sars-Cov-2 variant that is currently on the rise: BA.3.2, also known as Cicada. That's what it's all about: The Omicron variant BA.3.2, [...]














