An international team of scientists, including from the University of Cambridge, have launched a new research collaboration that will leverage the same technology behind ChatGPT to build an AI-powered tool for scientific discovery.
The team launched the initiative, called Polymathic AI earlier this week, alongside the publication of a series of related papers on the arXiv open access repository.
“This will completely change how people use AI and machine learning in science,” said Polymathic AI principal investigator Shirley Ho, a group leader at the Flatiron Institute’s Center for Computational Astrophysics in New York City.
The idea behind Polymathic AI “is similar to how it’s easier to learn a new language when you already know five languages,” said Ho.
Starting with a large, pre-trained model, known as a foundation model, can be both faster and more accurate than building a scientific model from scratch. That can be true even if the training data isn’t obviously relevant to the problem at hand.
“It’s been difficult to carry out academic research on full-scale foundation models due to the scale of computing power required,” said co-investigator Miles Cranmer, from Cambridge’s Department of Applied Mathematics and Theoretical Physics and Institute of Astronomy. “Our collaboration with Simons Foundation has provided us with unique resources to start prototyping these models for use in basic science, which researchers around the world will be able to build from—it’s exciting.”
“Polymathic AI can show us commonalities and connections between different fields that might have been missed,” said co-investigator Siavash Golkar, a guest researcher at the Flatiron Institute’s Center for Computational Astrophysics.
“In previous centuries, some of the most influential scientists were polymaths with a wide-ranging grasp of different fields. This allowed them to see connections that helped them get inspiration for their work. With each scientific domain becoming more and more specialized, it is increasingly challenging to stay at the forefront of multiple fields. I think this is a place where AI can help us by aggregating information from many disciplines.”
“Despite rapid progress of machine learning in recent years in various scientific fields, in almost all cases, machine learning solutions are developed for specific use cases and trained on some very specific data,” said co-investigator Francois Lanusse, a cosmologist at the Center national de la recherche scientifique (CNRS) in France.
“This creates boundaries both within and between disciplines, meaning that scientists using AI for their research do not benefit from information that may exist, but in a different format, or in a different field entirely.”
Polymathic AI’s project will learn using data from diverse sources across physics and astrophysics (and eventually fields such as chemistry and genomics, its creators say) and apply that multidisciplinary savvy to a wide range of scientific problems. The project will “connect many seemingly disparate subfields into something greater than the sum of their parts,” said project member Mariel Pettee, a postdoctoral researcher at Lawrence Berkeley National Laboratory.
“How far we can make these jumps between disciplines is unclear,” said Ho. “That’s what we want to do—to try and make it happen.”
ChatGPT has well-known limitations when it comes to accuracy (for instance, the chatbot says 2,023 times 1,234 is 2,497,582 rather than the correct answer of 2,496,382). Polymathic AI’s project will avoid many of those pitfalls, Ho said, by treating numbers as actual numbers, not just characters on the same level as letters and punctuation. The training data will also use real scientific datasets that capture the physics underlying the cosmos.
Transparency and openness are a big part of the project, Ho said. “We want to make everything public. We want to democratize AI for science in such a way that, in a few years, we’ll be able to serve a pre-trained model to the community that can help improve scientific analyses across a wide variety of problems and domains.”
More information: Michael McCabe et al, Multiple Physics Pretraining for Physical Surrogate Models, arXiv (2023). DOI: 10.48550/arxiv.2310.02994
Siavash Golkar et al, xVal: A Continuous Number Encoding for Large Language Models, arXiv (2023). DOI: 10.48550/arxiv.2310.02989
Francois Lanusse et al, AstroCLIP: Cross-Modal Pre-Training for Astronomical Foundation Models, arXiv (2023). DOI: 10.48550/arxiv.2310.03024
News
Scientists Just Discovered a Cellular Survival System That Was Never Supposed To Exist
A surprising backup pathway allows cells to make a crucial amino acid when their primary machinery fails. For decades, biologists believed cells had only one way to access a molecule they cannot live without. New [...]
Artificial cells gain porous membranes, enabling lab reactions and drug release
Artificial cells created in the laboratory offer a wide range of potential applications. Until now, however, their membranes—unlike those of real cells—have been virtually impermeable. Researchers at the Max Planck Institute for Polymer Research, [...]
Popular Weight-Loss Drugs Like Ozempic Linked to Lower Breast Cancer Risk
Ozempic and similar weight-loss drugs were linked to a striking 30% reduction in breast cancer risk in a study of more than 110,000 women. Popular weight-loss and diabetes medications such as Ozempic, Wegovy, Mounjaro, [...]
Stanford Scientists Discover Explosive New Type of Immune Cell
Scientists studying the remarkable regenerative abilities of planarian flatworms have uncovered a previously unknown type of immune cell with an unusually destructive defense strategy. What if an immune cell could wipe out nearby threats [...]
Big Pharma-backed SonoThera sounds off with $125M series B for bubble-based genetic delivery
Bay Area biotech SonoThera is bubbling to a clinical boil after raising a $125 million series B with the backing of some of the biggest names in pharma. Vida Ventures led the raise, with the venture [...]
Joint initiative of 5 EU countries calls for ‘unified approach’ to pharma framework amid US drug pricing pressure
With drug pricing pressure building from the U.S., a healthcare-focused consortium of five European countries is calling for a “unified approach” to strengthen Europe’s pharmaceutical framework and access to innovative medicines. Belgium, the Netherlands, [...]
Our books now available worldwide!
Online Sellers other than Amazon, Routledge, and IOPP Indigo Global Health Care Equivalency in the Age of Nanotechnology, Nanomedicine and Artifcial Intelligence Global Health Care Equivalency In The Age Of Nanotechnology, Nanomedicine And Artificial [...]
Molecular Manufacturing: The Future of Nanomedicine – New book from NanoappsMedical Inc.
This book explores the revolutionary potential of atomically precise manufacturing technologies to transform global healthcare, as well as practically every other sector across society. This forward-thinking volume examines how envisaged Factory@Home systems might enable the cost-effective [...]
NanoMedical Brain/Cloud Interface – Explorations and Implications. A new book from Frank Boehm
New book from Frank Boehm, NanoappsMedical Inc Founder: This book explores the future hypothetical possibility that the cerebral cortex of the human brain might be seamlessly, safely, and securely connected with the Cloud via [...]
New book from Nanoappsmedical Inc. – Global Health Care Equivalency
A new book by Frank Boehm, NanoappsMedical Inc. Founder. This groundbreaking volume explores the vision of a Global Health Care Equivalency (GHCE) system powered by artificial intelligence and quantum computing technologies, operating on secure [...]
UCLA Scientists Uncover a “Hidden Weakness” in Some of the World’s Deadliest Cancers
A new study has uncovered an unexpected vulnerability in some of the deadliest cancers. Researchers at UCLA have identified a previously hidden weakness in some of the most aggressive cancers, pointing to a possible new way [...]
AI-designed universal coronavirus vaccine clears first human trial
Key Takeaways Super-Antigen Technology: Uses AI and machine learning to analyze viral genomes, creating a single vaccine that targets essential features across entire virus families, including coronaviruses and Ebola. Human Trials & Safety: Phase [...]
Researchers Discover a Hidden Vitamin D Problem That Persists Year-Round
A new study suggests that some groups may not experience the expected seasonal boost in vitamin D levels, even during the sunniest months of the year. Many people assume that spending more time outdoors [...]
Researchers Solve the Mystery Behind a Billion-Dollar Dental Implant Disease
Researchers have uncovered why a common and costly dental implant infection often resists antibiotics. Dental implants have helped tens of millions of people regain a full set of stable, functional teeth, something traditional dentures [...]
Nanoparticles inspired by lung fluid improve therapies targeting respiratory system
The CIC biomaGUNE Center for Cooperative Research in Biomaterials has developed pulmonary surfactant nanoparticles (the blend of lipids and proteins that line the alveoli and enables breathing), which are encapsulated [...]
Scientists Finally Uncover How a “Forever Chemical” Causes Birth Defects
PFDA, a PFAS “forever chemical,” can cause craniofacial birth defects by disrupting retinoic acid regulation during fetal development, revealing the first clear molecular mechanism behind the link. Researchers have long linked perfluoroalkyl and polyfluoroalkyl substances (PFAS), [...]















