Academic research - Relatix Health

April 8, 2026

Mustafa Suleyman: AI development won’t hit a wall anytime soon—here’s why

We evolved for a linear world. If you walk for an hour, you cover a certain distance. Walk for two hours and you cover double that distance. This intuition served us well on the savannah. But it catastrophically fails when confronting AI and the core exponential trends at its heart.

From the time I began work on AI in 2010 to now, the amount of training data that goes into frontier AI models has grown by a staggering 1 trillion times—from roughly 10¹⁴ flops (floating-point operations‚ the core unit of computation) for early systems to over 10²⁶ flops for today’s largest models. This is an explosion. Everything else in AI follows from this fact.

The skeptics keep predicting walls. And they keep being wrong in the face of this epic generational compute ramp. Often, they point out that Moore’s Law is slowing. They also mention a lack of data, or they cite limitations on energy.

But when you look at the combined forces driving this revolution, the exponential trend seems quite predictable. To understand why, it’s worth looking at the complex and fast-moving reality beneath the headlines.

Think of AI training as a room full of people working calculators. For years, adding computational power meant adding more people with calculators to that room. Much of the time those workers sat idle, drumming their fingers on desks, waiting for the numbers to come through for their next calculation. Every pause was wasted potential. Today’s revolution goes beyond more and better calculators (although it delivers those); it is actually about ensuring that all those calculators never stop, and that they work together as one.

Three advances are now converging to enable this. First, the basic calculators got faster. Nvidia’s chips have delivered an over sevenfold increase in raw performance in just six years, from 312 teraflops in 2020 to 2,250 teraflops today. Our own Maia 200 chip, launched this January, delivers 30% better performance per dollar than any other hardware in our fleet. Second, the numbers arrive faster thanks to a technology called HBM, or high bandwidth memory, which stacks chips vertically like tiny skyscrapers; the latest generation, HBM3, triples the bandwidth of its predecessor, feeding data to processors fast enough to keep them busy all the time. Third, the room of people with calculators became an office and then a whole campus or city. Technologies like NVLink and InfiniBand connect hundreds of thousands of GPUs into warehouse-size supercomputers that function as single cognitive entities. A few years ago this was impossible.

These gains all come together to deliver dramatically more compute. Where training a language model took 167 minutes on eight GPUs in 2020, it now takes under four minutes on equivalent modern hardware. To put this in perspective: Moore’s Law would predict only about a 5x improvement over this period. We saw 50x. We’ve gone from two GPUs training AlexNet, the image recognition model that kicked off the modern boom in deep learning in 2012, to over 100,000 GPUs in today’s largest clusters, each one individually far more powerful than its predecessors.

Then there’s the revolution in software. Research from Epoch AI suggests that the compute required to reach a fixed performance level halves approximately every eight months, much faster than the traditional 18-to-24-month doubling of Moore’s Law. The costs of serving some recent models have collapsed by a factor of up to 900 on an annualized basis. AI is becoming radically cheaper to deploy.

The numbers for the near future are just as staggering. Consider that leading labs are growing capacity at nearly 4x annually. Since 2020, the compute used to train frontier models has grown 5x every year. Global AI-relevant compute is forecast to hit 100 million H100-equivalents by 2027, a tenfold increase in three years. Put all this together and we’re looking at something like another 1,000x in effective compute by the end of 2028. It’s plausible that by 2030 we’ll bring an additional 200 gigawatts of compute online every year—akin to the peak energy use of the UK, France, Germany, and Italy put together.

What does all this get us? I believe it will drive the transition from chatbots to nearly human-level agents—semiautonomous systems capable of writing code for days, carrying out weeks- and months-long projects, making calls, negotiating contracts, managing logistics. Forget basic assistants that answer questions. Think teams of AI workers that deliberate, collaborate, and execute. Right now we’re only in the foothills of this transition, and the implications stretch far beyond tech. Every industry built on cognitive work will be transformed.

The obvious constraint here is energy. A single refrigerator-size AI rack consumes 120 kilowatts, equivalent to 100 homes. But this hunger collides with another exponential: Solar costs have fallen by a factor of nearly 100 over 50 years; battery prices have dropped 97% over three decades. There is a pathway to clean scaling coming into view.

The capital is deployed. The engineering is delivering. The $100 billion clusters, the 10-gigawatt power draws, the warehouse-scale supercomputers … these are no longer science fiction. Ground is being broken for these projects now across the US and the world. As a result, we are heading toward true cognitive abundance. At Microsoft AI, this is the world our superintelligence lab is planning for and building.

Skeptics accustomed to a linear world will continue predicting diminishing returns. They will continue being surprised. The compute explosion is the technological story of our time, full stop. And it is still only just beginning.

Mustafa Suleyman is CEO of Microsoft AI.

April 8, 2026

The Download: water threats in Iran and AI’s impact on what entrepreneurs make

This is today’s edition of The Download, our weekday newsletter that provides a daily dose of what’s going on in the world of technology.

Desalination plants in the Middle East are increasingly vulnerable

As the conflict in Iran has escalated, a crucial resource is under fire: the desalinization technology that supplies water in the region.

President Donald Trump has threatened to destroy “possibly all desalinization plants” in Iran if the Strait of Hormuz is not reopened. The impact on farming, industry, and—crucially—drinking in the Middle East could be severe. Find out why.

—Casey Crownhart

This story is part of MIT Technology Review Explains, our series untangling the complex, messy world of technology to help you understand what’s coming next. You can read more from the series here.

AI is changing how small online sellers decide what to make

For small entrepreneurs, deciding what to sell and where to make it has traditionally been a slow, labor-intensive process. Now that work is increasingly being done by AI.

Tools like Alibaba’s Accio compress weeks of product research and supplier hunting into a single chat. Business owners and e-commerce experts say they’re making sourcing more accessible—and slashing the time from product idea to launch.

Read the full story on how AI is leveling the path to global manufacturing.

—Caiwei Chen

The gig workers who are training humanoid robots at home

When Zeus, a medical student in Nigeria, returns to his apartment from a long day at the hospital, he straps his iPhone to his forehead and records himself doing chores.

Zeus is a data recorder for Micro1, which sells the data he collects to robotics firms. As these companies race to build humanoids, videos from workers like Zeus have become the hottest new way to train them.

Micro1 has hired thousands of them in more than 50 countries, including India, Nigeria, and Argentina. The jobs pay well locally, but raise thorny questions around privacy and informed consent. The work can be challenging—and weird. Read the full story.

—Michelle Kim

This is our latest story to be turned into an MIT Technology Review Narrated podcast, which we’re publishing each week on Spotify and Apple Podcasts. Just navigate to MIT Technology Review Narrated on either platform, and follow us to get all our new content as it’s released.

The must-reads

I’ve combed the internet to find you today’s most fun/important/scary/fascinating stories about technology.

1 Anthropic’s new model found security problems in every OS and browser
Claude Mythos has been heralded as a cybersecurity “reckoning.” (The Verge)
+ Anthrophic is limiting the rollout over hacking fears. (CNBC)
+ It’s also launching a project that lets Mythos flag vulnerabilities. (Gizmodo)
+ Apple, Google, and Microsoft have joined the initiative. (ZDNET)

2 Iranian hackers are targeting American critical infrastructure
Their focus is on energy and water infrastructure. (Wired)
+ They’re targeting industrial control devices. (TechCrunch)

3 Google’s AI Overviews deliver millions of incorrect answers per hour
Despite a 90% accuracy rate. (NYT $)
+ AI means the end of internet search as we’ve known it. (MIT Technology Review)

4 Elon Musk is trying to oust OpenAI CEO Sam Altman in a lawsuit
As remedies for Altman allegedly defrauding him. (CNBC)
+ Musk wants any damages given to OpenAI’s nonprofit arm. (WSJ $)

5 ICE has admitted it’s using powerful spyware
The tools that can intercept encrypted messages. (NPR)
+ Immigration agencies are also weaponizing AI videos. (MIT Technology Review)

6 Greece has joined the countries banning kids from social media
Under-15s will be blocked from 2027. (Reuters)
+ Australia introduced the world’s first social media ban for children. (Guardian)
+ Indonesia recently rolled out the first one in Southeast Asia. (DW)
+ Experts say they’re a lazy fix. (CNBC)

7 Intel will help Elon Musk build his Terafab in Texas
They aim to manufacture chips for AI projects. (Engadget)
+ Musk says it will be the largest-ever semiconductor factory. (Engadget)
+ Future AI chips could be built on glass. (MIT Technology Review)

8 TikTok is building a second billion-euro data center in Finland
It’s moving data storage for European users. (Reuters)
+ Finland has become a magnet for data centers. (Bloomberg $)
+ But nobody wants one in their backyard. (MIT Technology Review)

9 Plans for Canada’s first “virtual gated community” have sparked a row
The AI-powered surveillance system has divided neighbors. (Guardian)
+ Is the Pentagon allowed to surveil Americans with AI? (MIT Technology Review)

10 The high-tech engineering of the “space toilet” has been revealed
Artemis II is the first mission to carry one around the world. (Vox)

Quote of the day

“This case has always been about Elon generating more power and more money for what he wants. His lawsuit remains nothing more than a harassment campaign that’s driven by ego, jealousy and a desire to slow down a competitor.”

—OpenAI criticizes Musk’s legal action in an X post.

One More Thing

Inside the US government’s brilliantly boring websites

You may not notice it, but your experience on every US government website is carefully crafted.

Each site aligns an official web design and a custom typeface. They aim to make government websites not only good-looking but accessible and functional for all.

MIT Technology Review dug into the system’s history and features. Find out what we discovered.

—Jon Keegan

We can still have nice things

A place for comfort, fun and distraction to brighten up your day. (Got any ideas? Drop me a line.)

+ Rejoice in the splendor of the “Earthset” image captured by Artemis II.
+ Meet the fearless cat chasing off bears.
+ This document vividly explains what makes the octopus so unique.
+ Revealed: the rhythmic secret that makes emo music so angsty.

April 8, 2026

Personalized audiovisual gamma stimulation enhances neural connectivity and entrainment beyond fixed 40 Hz protocols

BackgroundConventional 40 Hz gamma stimulation is applied across individuals, potentially overlooking inter-individual neural variability.ObjectiveThis study evaluated conversation gamma frequency (CGF)–a personalized gamma frequency derived from task engagement–against the fixed 40 Hz and individual gamma frequency (IGF) derived from auditory responses.MethodsIn Experiment 1, gamma center frequencies were measured under resting, reading, and conversation conditions. In Experiment 2, EEG was used to compare neural entrainment effects across CGF, 40 Hz, and IGF conditions.ResultsConversation gamma frequency stimulation induced stronger neural activation and functional connectivity in the frontal, temporal, and parietal cortices compared to 40 Hz or IGF. Theta-gamma coupling analysis revealed significantly increased phase synchronization under CGF compared to 40 Hz with enhanced connectivity. However, entrainment declined as the frequency difference between CGF, and 40 Hz increased, emphasizing the limitation of fixed-frequency stimulation.ConclusionThese findings provide EEG-based mechanistic evidence that individualized gamma stimulation may represent a hypothesis-generating strategy for future neurorehabilitation research in aging and neurodegenerative conditions.

April 8, 2026

Correction: Prevalence of body-focused repetitive behaviors among the general population of Saudi Arabia: a cross-sectional study

April 8, 2026

STAT+: A decade ago, these drugs tore apart the FDA. Today, they might be some patients’ best hope

A year after the worst day of her life, Debra Miller received a voicemail she couldn’t quite make out. In a thick accent, a man said something about research and left a phone number. She called but couldn’t get through. “I didn’t know what country code to put in,” she said.

Debra moved on, but the voice kept tumbling through her brain. She was desperate. Her first child, Hawken, had been diagnosed 13 months before with Duchenne muscular dystrophy. In blunt tones she would never forget, a doctor had told her that her 5-year-old boy would slowly lose the ability to walk and die by 18.

When she finally figured out the digits, a Dutch scientist explained he was launching a startup around one of the most counterintuitive ideas in modern genetics: that sometimes you can fix a broken gene by breaking it just a little bit more.

That strategy, known as exon skipping, would taunt Debra for two decades, always promising a therapy just out of reach. It prompted her to raise $1.3 million for the Dutch scientist and helped turn her fledgling advocacy group, CureDuchenne, into a powerhouse. Eventually, the idea spread far beyond the Netherlands and Debra’s home in Newport Beach, Calif., stirring tenuous hope for a life-altering treatment.

Exon-skipping drugs sparked a civil war within the Food and Drug Administration. Under pressure from advocates and companies, a top official overrode reviewers to approve the first of several candidates. One company, Sarepta Therapeutics, has since earned over $5.5 billion from from drugs that may or may not provide much benefit.

Throughout, by the fickle winds of scientific misfortune, mother and son remained waiting — until about two and a half years ago. That’s when Hawken enrolled in a clinical trial for a new exon-skipping drug Debra helped support. The results from him and 38 other patients have since stunned some of the field’s top experts.

Continue to STAT+ to read the full story…

April 8, 2026

Study of Collection and Analysis of Clinical, Anamnestic, Functional, Biological Data Through the Support of Artificial Intelligence to Evaluate the Possibility of Defining a Digital Twin of the Lung (LUCE)

Conditions: Thoracic Neoplasm; Pulmonary Nodules; Non-oncological Pulmonary Pathologies

Sponsors: Casa di Cura Dott. Pederzoli

Recruiting

April 7, 2026

Telephone-Based Mental Health Promotion for Rural Women in Brazilian Agrarian Reform Communities: Pre-Post Pilot Study

Background: Women living in rural agrarian reform communities face intersecting challenges related to social, economic, racial, and gender vulnerabilities, which significantly increase their likelihood of developing physical and mental health problems. Despite the potential of telephone-based interventions to promote mental health, there is a lack of studies assessing their feasibility and effectiveness among underserved populations in Brazil. Objective: This study aimed to assess the feasibility and effectiveness of a telephone-based intervention on mental health outcomes among women living in a rural agrarian reform community in Brazil. Methods: We conducted a descriptive, prospective pilot study with a pretest and posttest design. Data were collected at 3 time points: baseline, 1 week, and 1 month after the intervention. The outcomes assessed included quality of life, social support, self-efficacy, and common mental disorder symptoms. Nonparametric tests were used to analyze the data. The intervention consisted of 3 phone calls supported by a workbook, with content based on cognitive behavioral and psychiatric nursing principles. Results: Of the 31 women enrolled, 23 (74.2%) completed all 3 phone-based sessions. There was a significant reduction in common mental disorder symptoms (Kendall =0.280; =.002), particularly in the somatic domain (=.02). Moreover, participants reported improved perceptions of the physical domain of quality of life (Kendall =0.131; =.049). All women rated the intervention positively, with more than half emphasizing its practical usefulness. Conclusions: The telephone-based intervention was feasible and showed promising results in improving mental health outcomes among women in a rural setting. These findings support integrating low-intensity, remote psychosocial strategies into primary health care, especially those led by nurses, to increase access to mental health promotion for vulnerable populations.
<img src="https://jmir-production.s3.us-east-2.amazonaws.com/thumbs/29b6717618113fd23ab574e12f131acb" />

April 7, 2026

Real-World Data From a Group Parent Management Training Program Enhanced Using Artificial Intelligence: Qualitative Study

Background: Parent management training (PMT) is an evidence-based intervention for children with disruptive behavior problems. However, access to care is often limited by cost, availability of clinicians, scheduling, and transportation barriers. Integrating artificial intelligence (AI) into group PMT may improve accessibility, personalization, and adherence, while preserving therapeutic quality. Objective: This study explored caregivers’ experiences with a hybrid PMT program that combined live therapist-led group sessions with asynchronous support from Pat, an AI conversational agent designed to augment the therapists’ support to caregivers. Methods: A total of 88 caregivers of children aged 3-14 years (mean 7.98, SD 2.45 years) from Argentina and Paraguay participated in eight weekly online group sessions led by human therapists and supplemented by Pat. Caregivers were asked to provide the net promoter score (NPS) and their perceived contribution of their experience in remote group sessions and with Pat. Caregiver perspectives were analyzed using thematic analysis by multiple coders with consensus and interrater reliability assessment. Results: The average NPS was 76.92, indicating excellent satisfaction. Regarding participants’ perceptions of the overall program, the most frequent theme was useful strategies (73/202, 36.1%), reflecting the value placed on clear, structured, and practical tools to address everyday parenting challenges. Most comments about Pat were positive (156/164, 95.1%), particularly highlighting its 24/7 accessibility and constant availability (69/164, 42.1%). Recommendations for improvement mainly focused on enhancing the user experience and incorporating additional functionalities. Regarding perceived contribution to progress, caregivers attributed, on average, 61% to Pat and 46% to the group sessions. Conclusions: Combining therapist-led PMT group sessions with AI support appears feasible, acceptable, and valued by caregivers and may expand reach without sacrificing quality. Overall, caregivers valued the useful strategies and the professional and peer support and reported positive changes. Regarding Pat, the most valued aspect is the constant and immediate support. This model integrates human expertise with the accessibility and continuity provided by AI, reducing barriers, such as time, cost, and clinician availability.

April 7, 2026

Willingness of Patients With Mental Disorders to Engage in Online Psychotherapy: Multicenter Cross-Sectional Survey

Background: China faces a high prevalence of mental disorders but low treatment uptake, a gap driven by limited awareness and unevenly distributed mental health resources. While online psychotherapy has the potential to expand access, patient willingness remains insufficiently explored. Objective: This study aimed to investigate the willingness of Chinese patients with mental disorders to engage in online psychotherapy and to identify associated factors. Methods: A multicenter, cross-sectional survey was conducted using a structured questionnaire to assess the attitudes and willingness of patients with mental disorders in China to engage in online psychotherapy. Willingness to engage in online psychotherapy was assessed using a 0 to 100 rating scale, with higher scores indicating greater willingness. Univariate analysis, correlation analysis, and multivariate linear regression analyses were used to identify factors influencing willingness. Results: Among 361 eligible participants, the mean willingness score for online psychotherapy was 70 (SD 28.56). In total, 86.4% (n=312) of participants preferred short-term therapy (1 to 10 sessions), while 92.5% (n=334) expected the cost per session to remain less than CNY ¥400 (US $55.50). Participants most preferred therapist-guided online individual therapy (n=142, 39.3%). Convenience (124/361, 34.3%) and perceived anonymity (“no one will know about the illness”; 119/361, 33.0%) were the 2 most commonly reported perceived benefits of online psychotherapy. The leading barrier was concerns about data security and privacy (108/303, 35.6%), followed by difficulty in establishing therapeutic rapport (60/303, 19.8%). The regression analysis revealed that age, self-stigma, satisfaction with current psychiatric medications, and satisfaction with previous online psychotherapy significantly influenced patients’ willingness to seek online psychotherapy. Conclusions: This multicenter study reveals a high level of willingness to engage in online psychotherapy among Chinese patients, with self-stigma as a key barrier. These findings support the development of tailored services, stigma reduction interventions, and infrastructure investment to enhance mental health care delivery.

April 7, 2026

The Digital Path to AI in Cancer Care

As cancer care becomes data-driven, artificial intelligence (AI) will play an increasingly central role across the treatment continuum, from biomarker identification and drug development to clinical trial recruitment and diagnostics. In this corner of healthcare, the ability of AI to interpret and annotate tumor sample slides that have been digitized is taking center stage. While the promise is great, and AI interpretation is already influencing some clinical care, it has not yet reached critical mass.

“There’s something like a billion slides created every year for diagnostic purposes, and today most of those, about 85%, are still read by a pathologist with a microscope on physical glass slides,” said David West, CEO and co-founder of digital pathology company Proscia. In practice, that means pathologists manually examine slides, identify cancer, grade tumors, and dictate reports in a traditional approach to diagnosing cancer that has seen little change in decades.

But that foundation is now shifting. Advances in slide scanning, cloud storage, and AI are turning digital pathology images into data that can be analyzed at scale. At Memorial Sloan Kettering Cancer Center, large archives of digitized slides helped launch Paige AI, one of the earliest companies to train deep learning systems on pathology images linked to clinical and genomic outcomes. This yielded the first U.S. Food and Drug Administration (FDA)-approved diagnostic using AI and digital pathology: Paige Prostate Detect. The company, which was acquired last year by AI-enabled precision medicine company Tempus, now combines Paige’s digital pathology-based AI with Tempus’s broad genomic sequencing data platform.

Researchers in the field say the implications of AI in digital pathology extend beyond image analysis. Mohamed Omar, MD, an associate professor of computational biology at Cedars-Sinai Medical Center, Los Angeles, noted that large language models can help clinicians navigate a research landscape that produces “hundreds of papers every single day” to inform ongoing cancer research. Multimodal AI tools promise to unlock even more insights from digital pathology data by combining it with genomic, radiomic, and clinical data to build powerful new models of both common and rare cancers for diagnosis, drug development, and clinical trial enrollment.

While adoption is in its early stages, the advent of faster and less expensive scanners is bringing digital pathology within reach of both regional and rural hospitals. Razik Yousfi, senior vice president and general manager of AI products at Tempus, and a co-founder of Paige, predicts that within the next 10 years, the majority of pathology workflows will be digital. The ultimate goal of the application of AI here is not to replace human pathologists, but to empower them with a capable assistant while spreading adoption beyond major medical centers.

Building the foundations

As the field of applying AI to digital pathology progresses, it needs to build the groundwork for a wider range of potential applications that could address rare cancers and other areas without an abundance of data. One such project is called Atlas, a collaboration between researchers in Korea, Germany, and the United States to build a foundation model trained using 1.2 million histopathology whole-slide images from 490,000 cases sourced from the Mayo Clinic and Charité – Universitätsmedizin Berlin.

Foundation models like Atlas allow large-scale pre-training of data to develop numerical representations called embeddings that capture both the structural and contextual features of slides in the dataset. Atlas incorporates a diversity of diseases, staining types, and scanners, and uses multiple image magnifications during training. This broad approach confers power and utility. It allows the digitized representations of the histology to be adapted, queried, or fine-tuned to very specific downstream tasks using much less data than would be needed to build a one-off model.

As such, a foundation model provides a reusable digitized computational backbone that can be tapped across a wide range of uses, like tumor classification, detection of morphologic structures, biomarker quantification, and outcome prediction. In short, foundational models make the process of querying digital pathology images more efficient compared with past approaches.

Andrew P. Norgan, MD, PhD
CMO, Mayo Clinic

“In the case of pathology, the successful AI models developed using ‘conventional’ neural network approaches before the advent of FMs (foundation models) typically required huge amounts of training data to achieve high performance and generalizability—the ability to work across datasets distinct from the training data,” said lead Atlas researcher Andrew P. Norgan, MD, PhD, CMO of Mayo Clinic Digital Pathology and assistant professor of laboratory medicine and pathology. “We think of FMs as [an] enabler that allows model development in pathology … to move from artisanal or craft processes to more scalable and reproducible processes that should allow for the rapid development of high-quality models to address problems in pathology.”

At Paige AI, the company’s early work resulted in the first FDA-approved AI diagnostic, Paige Prostate Detect. Its algorithm was built using a technique called multiple instance learning instead of traditional supervised neural network techniques that require detailed human annotation of slides, a time-consuming and expensive method that could expose the learning to human error. The difference between the two methods is that traditional neural networks expose AI to a slide with cancer and tell it that there is cancer present. In multiple instance learning, the model is shown unannotated slides and is tasked with finding the cancer.

Even this approach, however, required a very large dataset. It became apparent to company leaders that the heavy lifting required to get Paige Prostate Detect to work wasn’t scalable.

“We had kind of cracked this recipe,” said Yousfi. “We know how to use a lot of GPU (graphics processing unit) compute, and if we get a ton of data and a lot of compute, we can build anything. But GPU infrastructure is very expensive, and it takes a lot of time to train a very large system.”

Perhaps the most important factor moving Paige away from this model is that it will not work when there is only a small amount of data available. This blocks the ability to train AI to recognize rare cancers for which sample counts are low. The company needed a different approach.

“We had this idea [for] a new system that was basically trained on all of the images we had access to, independent of the organ and indication and tissue and task,” Yousfi said. “Back then, we didn’t know what that thing was called. But ultimately, that became what everyone is calling today a foundation model.”

Originally trained on 200,000 slides, Paige’s new model now includes 3.5 million images and roughly two billion parameters, making it the backbone for other downstream applications the company builds today. This ability to use foundation models as the AI and data encyclopedia for smaller applications will ultimately propel the field of digital pathology forward by widening the playing field.

Going multimodal

To address more complex predictive problems, additional data types can be integrated. Clinical, radiologic, or genomic data can be combined with morphologic embeddings or used during training to help the model learn which tissue features carry a signal of disease or identify a biomarker. These approaches aim to support precision oncology by making morphologic data computable and aligning slide-derived features with other cancer-focused datasets. “These approaches can surface subtle or ‘latent’ patterns in pathology slides and align them with other data sources,” Norgan said. Pathologist and oncology care teams can then evaluate and interpret the features identified by the models within the clinical and biological context.

“In this way, pathologists and oncology teams use these outputs as decision-support tools, while clinical judgment remains central to diagnostic interpretation and therapeutic decision making,” Norgan added.

Atlas has now been succeeded by Atlas2, which was trained on 5.5 million pathology images and is now a two billion-parameter model, making it one of the largest pathology foundation models to date. The team has explored distilling methods to create smaller, more efficient, and targeted versions of the model that retain performance, with an eye toward finding a balance between scale and deployability.

Proscia is embarking on a different multimodal approach that combines vision models with language models, with the intent of creating methods to query the morphology of digitized slides. Their efforts in vision-language models (VLMs) combine textual data with visual data and allow the model to describe the morphology of a slide, answer questions about what it contains, find images in a database based on a text query, and even follow multimodal instructions such as “circle the tumor area on this image.”

In short, a VLM can be engaged in the same way you can engage a human. “I could go ask a pathologist to point out all the areas of tumor-infiltrating lymphocytes,” West said. “Now, because language-vision models are encoding language and images in the same space, they can do that, too. You can ask the model to describe what is happening in an image, and it will tell you exactly what it sees.”

At Cedars-Sinai, Omar’s work with large language models takes a less direct route of leveraging queries to gather information from research studies or even images. “Basically, you could go to the tool, ask questions, and the tool will provide you with pieces of code,” he explained. “These pieces of code are what you use on the slide to get more information.”

Atlas provides a similar function at the Mayo Clinic, Norgan noted. Because the model-generated embeddings in the digitized slide also encode semantic information, the Atlas team is now building a slide search function, which would allow researchers or clinicians to identify and access slides, or regions of slides, with related features.

Democratizing care

Although it will take time to disseminate the tools needed for AI-enabled digitized models of cancer care to smaller health systems, the future is now at Moffitt Cancer Center, where the research hospital is engaged in a top-to-bottom digitization of its system.

According to Marilyn Bui, MD, PhD, senior member of the departments of pathology and machine learning, the comprehensive cancer center plans for full digital adoption across clinical and research labs by 2027. Last August, it entered a multi-year collaboration with integrated AI and digital pathology company PathAI to deploy its cloud-based digital pathology image management system for both research and clinical applications.

Within the pathology department, the transition will mean that all glass slides will be scanned and reviewed digitally, providing the basis for applying AI computational tools to assist pathologists. Bui said that the cancer center is accelerating its move toward clinical AI adoption: “Just today I received an email asking which AI algorithms we plan to incorporate for clinical utility—prostate cancer, breast cancer, general tumor detection,” she said. “For us, it’s no longer just research.”

Moffitt is taking a hybrid approach to algorithm development and deployment within the system. Some AI tools will come from commercial vendors and will be validated internally, while others will be developed by investigators through the center’s translational pathology work. Taking this approach will allow it to apply AI to both common cancers and the rare tumor types Moffitt frequently encounters.

While the digital initiative will be transformational, Bui emphasized that the goal is not to replace pathologists but to enhance their capabilities. She prefers to refer to AI as augmented intelligence to reflect this. “Artificial intelligence suggests a robot replacing us,” she said. “But what we mean is augmented intelligence—tools that assist and enhance our ability to make clinical decisions.”

Further, Moffitt intends to integrate digitized slide data with genomic, proteomic, and clinical outcome data to build a multimodal data environment that could advance precision oncology. “Digital pathology and AI will allow us to extract far more information from tissue samples,” Bui said, “making our diagnoses more actionable for the clinical team and ultimately improving patient care.”

The promise of AI in oncology isn’t just better algorithms, it’s broader access. The maturation of computational pathology and its dissemination from large cancer centers like Moffitt to regional and rural health systems has the potential to provide levels of care typically only available at large research hospitals in community settings as well.

“It’s about democratizing access to care,” said Omar. “For a person in Maine or Wisconsin or another place to have access to the same high-quality care that you would get from a larger academic medical center in LA or New York, slides have to be digitized.”

Over the next 10 years, there could be a compelling business case for hospitals to embrace digital pathology. As the cost of scanners comes down and a broad range of diagnostic tools becomes available, digitizing routine H&E slides could become common.

While genetic cancer testing can cost hundreds of dollars, Omar pointed out that pathology slides “cost $5 [and] they are available universally, in all patients with cancer.” As AI models increasingly identify genomic-level insights directly from those inexpensive images, it represents a “huge win for accessibility, making AI work for patients who cannot afford genetic tests,” Omar said. If there is broad adoption of digital pathology “it is very easy to roll out any kind of AI models and computational tools across the board, across situations and locations that don’t have access to care.”

“At the end of the day, all slides will be digitized,” he concluded. “It’s just a matter of time.”

Chris Anderson, a Maine native, has been a B2B editor for more than 25 years. He was the founding editor of Security Systems News and Drug Discovery News, and led the print launch and expanded coverage as editor in chief of Clinical OMICs, now named Inside Precision Medicine.

The post The Digital Path to AI in Cancer Care appeared first on Inside Precision Medicine.