The drug development pipeline is a costly and lengthy process. Identifying high-quality “hit” compounds—those with high potency, selectivity, and favorable metabolic properties—at the earliest stages is important for reducing cost and accelerating the path to clinical trials. For the last decade, scientists have looked to machine learning to make this initial screening process more efficient.
Computer-aided drug design is used to computationally screen for compounds that interact with a target protein. However, the ability to accurately and rapidly estimate the strength of these interactions remains a challenge.
“Machine learning promised to bridge the gap between the accuracy of gold-standard, physics-based computational methods and the speed of simpler empirical scoring functions,” said Dr. Benjamin P. Brown, an assistant professor of pharmacology at the Vanderbilt University School of Medicine Basic Sciences.
“Unfortunately, its potential has so far been unrealized because current ML methods can unpredictably fail when they encounter chemical structures that they were not exposed to during their training, which limits their usefulness for real-world drug discovery.”
Brown is the single author on a Proceedings of the National Academy of Sciences paper titled “A generalizable deep learning framework for structure-based protein-ligand affinity ranking” that addresses this “generalizability gap.”
In the paper, he proposes a targeted approach: instead of learning from the entire 3D structure of a protein and a drug molecule, Brown proposes a task-specific model architecture that is intentionally restricted to learn only from a representation of their interaction space, which captures the distance-dependent physicochemical interactions between atom pairs.
“By constraining the model to this view, it is forced to learn the transferable principles of molecular binding rather than structural shortcuts present in the training data that fail to generalize to new molecules,” Brown said.
A key aspect of Brown’s work was the rigorous evaluation protocol he developed. “We set up our training and testing runs to simulate a real-world scenario: If a novel protein family were discovered tomorrow, would our model be able to make effective predictions for it?” he said.
To do this, he left out entire protein superfamilies and all their associated chemical data from the training set, creating a challenging and realistic test of the model’s ability to generalize.
Brown’s work provides several key insights for the field:
- Task-specific specialized architectures provide a clear avenue for building generalizable models using today’s publicly available datasets. By designing a model with a specific “inductive bias” that forces it to learn from a representation of molecular interactions rather than from raw chemical structures, it generalizes more effectively.
- Rigorous, realistic benchmarks are critical. The paper’s validation protocol revealed that contemporary ML models performing well on standard benchmarks can show a significant drop in performance when faced with novel protein families. This highlights the need for more stringent evaluation practices in the field to accurately gauge real-world utility.
- Current performance gains over conventional scoring functions are modest, but the work establishes a clear, reliable baseline for a modeling strategy that doesn’t fail unpredictably, which is a critical step toward building trustworthy AI for drug discovery.
Brown, a core faculty member of the Center for AI in Protein Dynamics, knows that there is more work to be done. His current project focused exclusively on scoring—ranking compounds based on the strength of their interaction with the target protein—which is only part of the structure-based drug discovery equation.
“My lab is fundamentally interested in modeling challenges related to scalability and generalizability in molecular simulation and computer-aided drug design. Hopefully, soon we can share some additional work that aims to advance these principles,” Brown said.
For now, significant challenges remain, but Brown’s work on building a more dependable approach for machine learning in structure-based computer-aided drug design has clarified the path forward.
More information: Benjamin P. Brown, A generalizable deep learning framework for structure-based protein–ligand affinity ranking, Proceedings of the National Academy of Sciences (2025). doi.org/10.1073/pnas.2508998122
Journal information: Proceedings of the National Academy of Sciences
Provided by Vanderbilt University

News
Scientist tackles key roadblock for AI in drug discovery
The drug development pipeline is a costly and lengthy process. Identifying high-quality "hit" compounds—those with high potency, selectivity, and favorable metabolic properties—at the earliest stages is important for reducing cost and accelerating the path [...]
Nanoplastics with environmental coatings can sneak past the skin’s defenses
Plastic is ubiquitous in the modern world, and it's notorious for taking a long time to completely break down in the environment - if it ever does. But even without breaking down completely, plastic [...]
Chernobyl scientists discover black fungus feeding on deadly radiation
It looks pretty sinister, but it might actually be incredibly helpful When reactor number four in Chernobyl exploded, it triggered the worst nuclear disaster in history, one which the surrounding area still has not [...]
Long COVID Is Taking A Silent Toll On Mental Health, Here’s What Experts Say
Months after recovering from COVID-19, many people continue to feel unwell. They speak of exhaustion that doesn’t fade, difficulty breathing, or an unsettling mental haze. What’s becoming increasingly clear is that recovery from the [...]
Study Delivers Cancer Drugs Directly to the Tumor Nucleus
A new peptide-based nanotube treatment sneaks chemo into drug-resistant cancer cells, providing a unique workaround to one of oncology’s toughest hurdles. CiQUS researchers have developed a novel molecular strategy that allows a chemotherapy drug to [...]
Scientists Begin $14.2 Million Project To Decode the Body’s “Hidden Sixth Sense”
An NIH-supported initiative seeks to unravel how the nervous system tracks and regulates the body’s internal organs. How does your brain recognize when it’s time to take a breath, when your blood pressure has [...]
Scientists Discover a New Form of Ice That Shouldn’t Exist
Researchers at the European XFEL and DESY are investigating unusual forms of ice that can exist at room temperature when subjected to extreme pressure. Ice comes in many forms, even when made of nothing but water [...]
Nobel-winning, tiny ‘sponge crystals’ with an astonishing amount of inner space
The 2025 Nobel Prize in chemistry was awarded to Richard Robson, Susumu Kitagawa and Omar Yaghi on Oct. 8, 2025, for the development of metal-organic frameworks, or MOFs, which are tunable crystal structures with extremely [...]
Harnessing Green-Synthesized Nanoparticles for Water Purification
A new review reveals how plant- and microbe-derived nanoparticles can power next-gen water disinfection, delivering cleaner, safer water without the environmental cost of traditional treatments. A recent review published in Nanomaterials highlights the potential of green-synthesized nanomaterials (GSNMs) in [...]
Brainstem damage found to be behind long-lasting effects of severe Covid-19
Damage to the brainstem - the brain's 'control center' - is behind long-lasting physical and psychiatric effects of severe Covid-19 infection, a study suggests. Using ultra-high-resolution scanners that can see the living brain in [...]
CT scan changes over one year predict outcomes in fibrotic lung disease
Researchers at National Jewish Health have shown that subtle increases in lung scarring, detected by an artificial intelligence-based tool on CT scans taken one year apart, are associated with disease progression and survival in [...]
AI Spots Hidden Signs of Disease Before Symptoms Appear
Researchers suggest that examining the inner workings of cells more closely could help physicians detect diseases earlier and more accurately match patients with effective therapies. Researchers at McGill University have created an artificial intelligence tool capable of uncovering [...]
Breakthrough Blood Test Detects Head and Neck Cancer up to 10 Years Before Symptoms
Mass General Brigham’s HPV-DeepSeek test enables much earlier cancer detection through a blood sample, creating a new opportunity for screening HPV-related head and neck cancers. Human papillomavirus (HPV) is responsible for about 70% of [...]
Study of 86 chikungunya outbreaks reveals unpredictability in size and severity
The symptoms come on quickly—acute fever, followed by debilitating joint pain that can last for months. Though rarely fatal, the chikungunya virus, a mosquito-borne illness, can be particularly severe for high-risk individuals, including newborns and older [...]
Tiny Fat Messengers May Link Obesity to Alzheimer’s Plaque Buildup
Summary: A groundbreaking study reveals how obesity may drive Alzheimer’s disease through tiny messengers called extracellular vesicles released from fat tissue. These vesicles carry lipids that alter how quickly amyloid-β plaques form, a hallmark of [...]
Ozone exposure weakens lung function and reshapes the oral microbiome
Scientists reveal that short-term ozone inhalation doesn’t just harm the lungs; it reshapes the microbes in your mouth, with men facing the greatest risks. Ozone is a toxic environmental pollutant with wide-ranging effects on [...]