The study, conducted at the virtual urgent care clinic Cedars-Sinai Connect in LA, compared recommendations given in about 500 visits of adult patients with relatively common symptoms – respiratory, urinary, eye, vaginal and dental.
A new study led by Prof. Dan Zeltzer, a digital health expert from the Berglas School of Economics at Tel Aviv University, compared the quality of diagnostic and treatment recommendations made by artificial intelligence (AI) and physicians at Cedars-Sinai Connect, a virtual urgent care clinic in Los Angeles, operated in collaboration with Israeli startup K Health. The paper was published in Annals of Internal Medicine and presented at the annual conference of the American College of Physicians (ACP). This work was supported with funding by K Health.
Prof. Zeltzer explains: “Cedars-Sinai operates a virtual urgent care clinic offering telemedical consultations with physicians specializing in family and emergency care. Recently, an AI system was integrated into the clinic algorithm based on machine learning that conducts initial intake through a dedicated chat, incorporates data from the patient’s medical record, and provides the attending physician with detailed diagnostic and treatment suggestions at the start of the visit -including prescriptions, tests, and referrals. After interacting with the algorithm, patients proceed to a video visit with a physician who ultimately determines the diagnosis and treatment. To ensure reliable AI recommendations, the algorithm-trained on medical records from millions of cases, only offers suggestions when its confidence level is high, giving no recommendation in about one out of five cases. In this study, we compared the quality of the AI system’s recommendations with the physicians’ actual decisions in the clinic.”
The researchers examined a sample of 461 online clinic visits over one month during the summer of 2024. The study focused on adult patients with relatively common symptoms-respiratory, urinary, eye, vaginal and dental. In all visits reviewed, the algorithm initially assessed patients, provided recommendations, and then treated them by a physician in a video consultation. Afterwards, all recommendations from both the algorithm and the physicians were evaluated by a panel of four doctors with at least ten years of clinical experience, who rated each recommendation on a four-point scale: optimal, reasonable, inadequate, or potentially harmful. The evaluators assessed the recommendations based on the patients’ medical histories, the information collected during the visit, and transcripts of the video consultations.
The compiled ratings led to interesting conclusions: AI recommendations were rated as optimal in 77% of cases, compared to only 67% of the physicians’ decisions; at the other end of the scale, AI recommendations were rated as potentially harmful in a smaller portion of cases than physicians’ decisions (2.8% of AI recommendations versus 4.6% of physicians’ decisions). In 68% of the cases, the AI and the physician received the same score; in 21% of cases, the algorithm scored higher than the physician; and in 11% of cases, the physician’s decision was considered better.
The explanations provided by the evaluators for the differences in ratings highlight several advantages of the AI system over human physicians: First, the AI more strictly adheres to medical association guidelines-for example, not prescribing antibiotics for a viral infection; second, AI more comprehensively identifies relevant information in the medical record-such as recurrent cases of a similar infection that may influence the appropriate course of treatment; and third, AI more precisely identifies symptoms that could indicate a more serious condition, such as eye pain reported by a contact lens wearer, which could signal an infection. On the other hand, physicians are more flexible than the algorithm and have an advantage in assessing the patient’s real condition. For example, suppose a COVID-19 patient reports shortness of breath. A doctor may recognize it as a relatively mild respiratory congestion in that case. In contrast, based solely on the patient’s answers, the AI might unnecessarily refer them to the emergency room.
Prof. Zeltzer concludes: “In this study, we found that AI, based on a targeted intake process, can provide diagnostic and treatment recommendations that are, in many cases, more accurate than those made by physicians. One limitation of the study is that we do not know which physicians reviewed the AI’s recommendations in the available chart, or to what extent they relied on these recommendations. Thus, the study only measured the accuracy of the algorithm’s recommendations and not their impact on the physicians. The study’s uniqueness lies in the fact that it tested the algorithm in a real-world setting with actual cases, while most studies focus on examples from certification exams or textbooks. The relatively common conditions included in our study represent about two-thirds of the clinic’s case volume. Thus, the findings can be meaningful for assessing AI’s readiness to serve as a decision-support tool in medical practice. We can envision a near future in which algorithms assist in an increasing portion of medical decisions, bringing certain data to the doctor’s attention, and facilitating faster decisions with fewer human errors. Of course, many questions still remain about the best way to implement AI in the diagnostic and treatment process, as well as the optimal integration between human expertise and artificial intelligence in medicine.”
Other authors involved in the study include Zehavi Kugler, MD; Lior Hayat, MD; Tamar Brufman, MD; Ran Ilan Ber, PhD; Keren Leibovich, PhD; Tom Beer, MSc; and Ilan Frank, MSc., Caroline Goldzweig, MD MSHS, and Joshua Pevnick, MD, MSHS.
- Dan Zeltzer, Zehavi Kugler, Lior Hayat, et al. Comparison of Initial Artificial Intelligence (AI) and Final Physician Recommendations in AI-Assisted Virtual Urgent Care Visits. Ann Intern Med. [Epub 4 April 2025]. doi:10.7326/ANNALS-24-03283, https://www.acpjournals.org/doi/10.7326/ANNALS-24-03283

News
Genetically-engineered immune cells show promise for preventing organ rejection
A Medical University of South Carolina team reports in Frontiers in Immunology that it has engineered a new type of genetically modified immune cell that can precisely target and neutralize antibody-producing cells complicit in organ rejection. [...]
Building and breaking plastics with light: Chemists rethink plastic recycling
What if recycling plastics were as simple as flicking a switch? At TU/e, Assistant Professor Fabian Eisenreich is making that vision a reality by using LED light to both create and break down a [...]
Generative AI Designs Novel Antibiotics That Defeat Defiant Drug-Resistant Superbugs
Harnessing generative AI, MIT scientists have created groundbreaking antibiotics with unique membrane-targeting mechanisms, offering fresh hope against two of the world’s most formidable drug-resistant pathogens. With the help of artificial intelligence, MIT researchers have [...]
AI finds more breast tumors earlier than traditional double radiologist review
AI is detecting tumors more often and earlier in the Dutch breast cancer screening program. Those tumors can then be treated at an earlier stage. This has been demonstrated by researchers led by Radboud [...]
Lavender oil could speed recovery after brain surgery
A week of lavender-scented nights helped brain surgery patients sleep more deeply, shorten delirium, and feel calmer, pointing to a simple, natural aid for post-surgery care. A randomized controlled trial investigating the therapeutic impact [...]
Targeting Nanoparticles for Heart Repair
Scientists have engineered dual-membrane nanoparticles that home in on heart tissue after a heart attack, delivering regenerative molecules while evading the body’s immune defences. Myocardial infarction, better known as a heart attack, is a [...]
Natural Compound Combo Restores Aging Brain Cells
Scientists have identified a natural compound combination that reverses aging-related brain cell decline and removes harmful Alzheimer’s-linked proteins. The treatment, combining nicotinamide (vitamin B3) and the green tea antioxidant epigallocatechin gallate, restores guanosine triphosphate [...]
Silver Nanoparticles Get a Green Makeover: An Eco-Friendly Way to Target Diabetes
Researchers have developed an eco-friendly method to produce silver nanoparticles from the roots of Martynia annua, showing strong antioxidant and anti-diabetic potential while avoiding the toxic by-products of conventional synthesis. Silver nanoparticles are particularly popular in research because [...]
Quantum Breakthrough: Scientists Find “Backdoor” to 60-Year-Old Superconducting Mystery
A Copenhagen team has unlocked a clever “backdoor” into studying rare quantum states once thought beyond reach. Scientists at the Niels Bohr Institute, University of Copenhagen, have discovered a new approach for investigating rare [...]
3D-Printed Nylon Filters With Titanium Dioxide For Greywater Treatment
A team of researchers has developed a novel water filtration system that combines nanotechnology with 3D printing, aiming to create a low-cost, sustainable solution for greywater treatment. As reported in Micro & Nano Letters, the study demonstrates this [...]
New COVID variant ‘Stratus’ is spreading in the U.S. and worldwide
A new COVID variant is climbing the ranks in the U.S., becoming the third-most common strain of the summer. Variant XFG, colloquially known as "Stratus," was first detected in Southeast Asia in January but [...]
Fat Molecule May Control How You Feel Emotion
Key Questions Answered Q: What did researchers discover about the serotonin 5-HT1A receptor? A: They mapped how it activates different brain signaling pathways, offering insight into how mood and emotion are regulated at the [...]
Nanodevice uses sound to sculpt light, paving the way for better displays and imaging
Light can behave in very unexpected ways when you squeeze it into small spaces. In a paper in the journal Science, Mark Brongersma, a professor of materials science [...]
ChatGPT helps speed up patient screening for clinical trials
A new study in the academic journal Machine Learning: Health discovers that ChatGPT can accelerate patient screening for clinical trials, showing promise in reducing delays and improving trial success rates. Researchers at UT Southwestern Medical Centre used [...]
New Study Reveals This Popular Fruit Is Actually a “Superfood”
A new peer-reviewed article argues that grapes deserve a place among today’s top superfoods. A recent article published in the peer-reviewed Journal of Agriculture and Food Chemistry takes a closer look at the term [...]
Experimental Drug Reverses PTSD Symptoms in Mice – Already in Human Trials
Excessive levels of GABA released by astrocytes impair the brain’s ability to extinguish fear responses in PTSD, but a newly identified drug target offers promising hope for treatment. Many people with post-traumatic stress disorder (PTSD) [...]