The drug development pipeline is a costly and lengthy process. Identifying high-quality “hit” compounds—those with high potency, selectivity, and favorable metabolic properties—at the earliest stages is important for reducing cost and accelerating the path to clinical trials. For the last decade, scientists have looked to machine learning to make this initial screening process more efficient.
Computer-aided drug design is used to computationally screen for compounds that interact with a target protein. However, the ability to accurately and rapidly estimate the strength of these interactions remains a challenge.
“Machine learning promised to bridge the gap between the accuracy of gold-standard, physics-based computational methods and the speed of simpler empirical scoring functions,” said Dr. Benjamin P. Brown, an assistant professor of pharmacology at the Vanderbilt University School of Medicine Basic Sciences.
“Unfortunately, its potential has so far been unrealized because current ML methods can unpredictably fail when they encounter chemical structures that they were not exposed to during their training, which limits their usefulness for real-world drug discovery.”
Brown is the single author on a Proceedings of the National Academy of Sciences paper titled “A generalizable deep learning framework for structure-based protein-ligand affinity ranking” that addresses this “generalizability gap.”
In the paper, he proposes a targeted approach: instead of learning from the entire 3D structure of a protein and a drug molecule, Brown proposes a task-specific model architecture that is intentionally restricted to learn only from a representation of their interaction space, which captures the distance-dependent physicochemical interactions between atom pairs.
“By constraining the model to this view, it is forced to learn the transferable principles of molecular binding rather than structural shortcuts present in the training data that fail to generalize to new molecules,” Brown said.
A key aspect of Brown’s work was the rigorous evaluation protocol he developed. “We set up our training and testing runs to simulate a real-world scenario: If a novel protein family were discovered tomorrow, would our model be able to make effective predictions for it?” he said.
To do this, he left out entire protein superfamilies and all their associated chemical data from the training set, creating a challenging and realistic test of the model’s ability to generalize.
Brown’s work provides several key insights for the field:
- Task-specific specialized architectures provide a clear avenue for building generalizable models using today’s publicly available datasets. By designing a model with a specific “inductive bias” that forces it to learn from a representation of molecular interactions rather than from raw chemical structures, it generalizes more effectively.
- Rigorous, realistic benchmarks are critical. The paper’s validation protocol revealed that contemporary ML models performing well on standard benchmarks can show a significant drop in performance when faced with novel protein families. This highlights the need for more stringent evaluation practices in the field to accurately gauge real-world utility.
- Current performance gains over conventional scoring functions are modest, but the work establishes a clear, reliable baseline for a modeling strategy that doesn’t fail unpredictably, which is a critical step toward building trustworthy AI for drug discovery.
Brown, a core faculty member of the Center for AI in Protein Dynamics, knows that there is more work to be done. His current project focused exclusively on scoring—ranking compounds based on the strength of their interaction with the target protein—which is only part of the structure-based drug discovery equation.
“My lab is fundamentally interested in modeling challenges related to scalability and generalizability in molecular simulation and computer-aided drug design. Hopefully, soon we can share some additional work that aims to advance these principles,” Brown said.
For now, significant challenges remain, but Brown’s work on building a more dependable approach for machine learning in structure-based computer-aided drug design has clarified the path forward.
More information: Benjamin P. Brown, A generalizable deep learning framework for structure-based protein–ligand affinity ranking, Proceedings of the National Academy of Sciences (2025). doi.org/10.1073/pnas.2508998122
Journal information: Proceedings of the National Academy of Sciences
Provided by Vanderbilt University
News
AI-designed universal coronavirus vaccine clears first human trial
Key Takeaways Super-Antigen Technology: Uses AI and machine learning to analyze viral genomes, creating a single vaccine that targets essential features across entire virus families, including coronaviruses and Ebola. Human Trials & Safety: Phase [...]
Researchers Discover a Hidden Vitamin D Problem That Persists Year-Round
A new study suggests that some groups may not experience the expected seasonal boost in vitamin D levels, even during the sunniest months of the year. Many people assume that spending more time outdoors [...]
Researchers Solve the Mystery Behind a Billion-Dollar Dental Implant Disease
Researchers have uncovered why a common and costly dental implant infection often resists antibiotics. Dental implants have helped tens of millions of people regain a full set of stable, functional teeth, something traditional dentures [...]
Nanoparticles inspired by lung fluid improve therapies targeting respiratory system
The CIC biomaGUNE Center for Cooperative Research in Biomaterials has developed pulmonary surfactant nanoparticles (the blend of lipids and proteins that line the alveoli and enables breathing), which are encapsulated [...]
Scientists Finally Uncover How a “Forever Chemical” Causes Birth Defects
PFDA, a PFAS “forever chemical,” can cause craniofacial birth defects by disrupting retinoic acid regulation during fetal development, revealing the first clear molecular mechanism behind the link. Researchers have long linked perfluoroalkyl and polyfluoroalkyl substances (PFAS), [...]
Scientists Have Discovered These Deadly Parasites Are Secretly Swapping DNA
Leishmania parasites appear to evolve through widespread genetic exchange, reshaping assumptions about how they adapt and spread. A parasite long thought to spread mostly by cloning itself may be far more genetically dynamic than [...]
Stanford’s Revolutionary New Microscope Reveals Living Cells in Stunning Detail
Stanford researchers have developed a microscope that can show how nanostructures interact inside living cells at the highest resolution achieved so far. The view into living cells just got better. Stanford researchers have merged [...]
What Bundibugyo Ebola vaccines and treatments are under development
By Mariam Sunny and Jennifer Rigby May 29 (Reuters) – Global health authorities are racing to identify medical options to help contain an Ebola outbreak in eastern Democratic Republic of Congo, linked to the [...]
Why More People in Their 30s Are Suddenly Getting Colon Cancer
A major Swiss study found that colorectal cancer is becoming increasingly common in adults under 50, even as rates decline in older age groups. Researchers in Switzerland have identified a concerning trend: while colorectal [...]
Researchers Compare MS Models to Human Tissue in Search for Better Therapies
Researchers identified key differences between two widely used multiple sclerosis models, showing how each can better study myelin damage, immune responses, and repair. The findings may improve efforts to develop treatments that restore lost [...]
Scientists Discover Genetic “Off Switch” That Supercharges CAR T Cells Against Cancer
A new study reveals a possible way to make CAR T-cell therapy more durable and effective by targeting a single gene-regulating protein. CAR T-cell therapy is widely seen as a breakthrough in personalized cancer [...]
New Vitamin B12-Based Therapy Could Change How Brain Cancer Is Treated
Researchers have identified a vitamin B12–based compound that appears capable of crossing the blood–brain barrier and selectively accumulating in glioblastoma tissue. For decades, one of the biggest problems in brain cancer treatment has had [...]
Simple Fiber Supplement Cuts Knee Arthritis Pain in Just 6 Weeks, Study Finds
A daily inulin supplement may help reduce knee osteoarthritis pain while revealing a possible link between gut health, muscle function, and pain sensitivity. For millions of people living with knee osteoarthritis, managing chronic pain [...]
This Common Vitamin May Help Stop Prediabetes From Turning Into Diabetes
Vitamin D may help prevent type 2 diabetes in people with specific genetic variations, offering a possible path toward personalized diabetes prevention. More than 40% of U.S. adults have prediabetes, a condition in which [...]
Ebola, hantavirus: Is the world prepared for the next pandemic?
Funding cuts to health research and a growing antivaccine movement are making it harder than ever to respond to viruses. The World Health Organization (WHO) has declared that an Ebola outbreak in Uganda and [...]
May 2026 Healthcare News and Trends: Market Signals That Matter
Artificial intelligence is dominating headlines, telehealth has settled into a new normal, and digital health continues to promise transformation. However, much of what is being discussed in healthcare today reflects potential rather than reality. [...]















