Biohub Releases Protein Biology World Model to Address Disease

Biohub, the non-profit research organization co-founded by Priscilla Chan, MD, and Mark Zuckerberg, has now unveiled the latest update to the ESM protein language model family, with expanded capabilities in binder design and protein function mapping for therapeutic discovery. The release comes just seven months after Biohub recruited the team behind EvolutionaryScale. 

The system includes ESMC (Evolutionary Scale Modeling Cambrian), a language model trained on approximately 2.8 billion sequences drawn from a breadth of life, including organisms adapted to extreme environments, and more than 20,000 types of proteins found in the human body. Evolutionary information encoded in ESMC is translated into atomic-resolution protein structures and interactions using the design engine and prediction model, ESMFold2.  

Alex Rives, PhD, head of science at Biohub and former chief scientist at EvolutionaryScale, presented the work at this week’s “AI in Biology” symposium at Cold Spring Harbor Laboratory.  

These models aim to transform the earliest stages of drug discovery by making biology more programmable. While traditional discovery workflows rely on slow and resource intensive experimental screens to identify promising drug candidates, rational protein design guided by in silico predictions has the potential to dramatically accelerate development timelines. 

“We’re at an exciting point in protein biology where accurate digital representations allow asking experimental questions at a scale that wouldn’t be possible in the laboratory,” Rives told GEN Edge. 

ESMC provides a foundation for modeling the sequence, structure, and function of proteins. ESMFold2 predicts the structure of proteins and biomolecular complexes. Features derived from the representations of the model capture fundamental principles of structure and function that form a compositional grammar for protein biology. [Biohub]
ESMC provides a foundation for modeling the sequence, structure, and function of proteins. ESMFold2 predicts the structure of proteins and biomolecular complexes. Features derived from the representations of the model capture fundamental principles of structure and function that form a compositional grammar for protein biology. [Biohub]

ESMFold2 designed high-affinity protein binders against five disease targets in cancer and immunology: receptor tyrosine kinases implicated in tumor growth (EGFR and PDGFRβ), immune checkpoints exploited by cancer cells to evade immune surveillance (PD-L1 and CTLA-4), and a regulator of immune cell signaling (CD45).  

Lab-validated designs achieved hit rates ranging from 36–88% for compact mini-binders and 15–29% for antibody-derived formats, while also demonstrating nanomolar binding affinity, high specificity, and favorable stability profiles consistent with potential clinical utility. Notably, binders for PD-L1 showed therapeutic function and restored T-cell signaling in laboratory tests by blocking the same pathway as approved checkpoint therapies. 

Rather than requiring multiple sequence alignments (MSAs) to build representations, ESMFold2 captures evolutionary information encoded during pretraining. The model also uses a looped transformer architecture, which allows compute to scale at inference time and avoids overfitting that can arise when training is constrained by limited experimental protein structures. 

In benchmarking, ESMFold2 performed favorably when compared against Chai-1 from Chai Discovery, Boltz-1 from MIT (whose developers have since launched a public benefit corporation), and AlphaFold 3 from Google DeepMind. 

The models are accessible under the highly permissive Massachusetts Institute of Technology (MIT) license for both commercial and non-commercial use. The work is described as a preprint that has not yet been peer reviewed. 

“All in” on AI biology 

Last November, Chan and Zuckerberg made the pledge to go “all in on AI-powered biology,” announcing that the organization’s scientific teams will now unite under a single entity, known as Biohub, where the duo would place the majority of their philanthropic effort.  

Concurrently, Rives and EvolutionaryScale colleagues were recruited to tackle disease by decoding the “grammar” of amino acids through billions of years of evolution. That same mission had once secured a whopping $142 million seed round when the startup unveiled in 2024. The raise was led by Nat Friedman, Daniel Gross and Lux Capital, and included participation from Amazon Web Services (AWS) and NVentures, Nvidia’s corporate venture arm. 

Biohub continues to advance virtual biology by building digital representations of molecules, genomes, cells, and living systems. The new ESM release joins Biohub’s growing ecosystem of biology models, including TranscriptFormer, which was published in Science earlier this month.  

The organization recently invested $500 million in the Virtual Biology Initiative, a five-year campaign to accelerate the creation of technologies and multi-modal datasets to build predictive models of biology. The commitment comes a few months after the organization announced a collaboration with Arc Institute and Tahoe Therapeutics to build the largest single cell chemical perturbation dataset to power the virtual cell. 

Evolution is all you need 

Biohub has applied the new ESM models to generate the ESM Atlas, a mapping of 6.8 billion sequences and 1.1 billion predicted structures to protein function using ESMC’s representations. Generating this atlas would have taken “billions of years of experimental work,” but was condensed into a couple of weeks with computational inference. The ESM Atlas is released open-source. 

By probing ESMC’s representations using sparse autoencoders (SAEs), a technique for identifying interpretable structure in large language models, the authors found that the model independently learned hierarchical organization covering the basic chemistry of individual amino acids, local structural interactions, and functional concepts across unrelated proteins, despite being trained only on sequence data. 

Notably, ESM Atlas SAE feature clusters brought together RNA-guided DNA endonucleases, eukaryotic Fanzor proteins and their evolutionary ancestor, prokaryotic TnpB, despite their high evolutionary divergence and low sequence similarity. These insights could support the development of new gene-editing tools. 

While the preprint’s results are still a step away from clinical impact, Rives reiterates the power of open science in placing these tools in the hands of researchers working directly in translational research. 

Biohub is partnering with a number of platform partners, including AWS Bio Discovery, Benchling, Phylo, Tamarind Bio, Modal, Tool Universe, and SandboxAQ, to make the models widely available.

The post Biohub Releases Protein Biology World Model to Address Disease appeared first on GEN – Genetic Engineering and Biotechnology News.

Marmoset models for the study of autism spectrum disorders

Neurological and psychiatric disorders are a highly prevalent source of global disability. For the majority of these conditions, including autism spectrum disorder (ASD), disease-modifying treatments remain unavailable, and existing pharmacological interventions are largely palliative.

The Download: keeping up with AI, and the future of IVF

This is today’s edition of The Download, our weekday newsletter that provides a daily dose of what’s going on in the world of technology.

Stay on top of what’s going on in AI this summer

Here at MIT Technology Review, we understand exactly how relentless the pace of news from the world of artificial intelligence feels. New models and capabilities crop up as fast as we can cover them, and the ripple effects they send through tech and wider society are never far behind.

Our unique strength lies in cutting through the day-to-day noise to help you understand what’s really happening, and what lies around the corner.

That’s why we created our list of 10 Things That Matter in AI Right Now, unveiled at our flagship AI event EmTech AI a few weeks back (check the list out if you haven’t already!) And it’s why we publish so many stories dedicated to explaining how AI works, and what’s coming next. We also regularly run live subscriber-only Roundtables events—you can still catch up on last week’s session, where we explored how AI might enter the physical realm via world models.

Right now, there’s a 25% discount on subscriptions. Sign up now to deepen your understanding of AI this summer. You can also join the conversation by subscribing to The Algorithm, our free weekly newsletter all about the latest in AI.

MIT Technology Review Narrated: what’s next for IVF

IVF has brought millions of babies into the world over the last four decades. But the process can still be slow, painful, and expensive—and far from guaranteed to work. Now, a wave of new technologies aims to change that

Researchers are using AI to identify promising sperm and embryos, developing robotic systems that could automate parts of the IVF process, and even exploring controversial genetic editing techniques designed to prevent inherited disease.

The technologies could make IVF more effective and accessible. But they’re also raising difficult ethical questions about how far reproductive medicine should go.

—Jessica Hamzelou

This is our latest story to be turned into an MIT Technology Review Narrated podcast, which we publish each week on Spotify and Apple Podcasts. Just navigate to MIT Technology Review Narrated on either platform, and follow us to get all our new content as it’s released.

The must-reads

I’ve combed the internet to find you today’s most fun/important/scary/fascinating stories about technology.

1 NASA unveiled plans for three uncrewed missions to the Moon this year
They’re part of preparations for a crewed landing in 2028. (The Verge)
+ And steps to build the first lunar base at the Moon’s south pole. (NBC News)
+ Jeff Bezos’s Blue Origin will lead the first uncrewed mission. (WP $)
+ NASA is building the first nuclear reactor-powered spacecraft. (MIT Technology Review)

2 Samsung’s largest unions have approved a landmark bonus scheme
The deal averts a massive strike at the world’s largest memory-chip maker. (WSJ $)
+ Chip workers will get an average bonus of about $340,000. (Bloomberg $)
+ The dispute centered on who profits from the AI boom. (BI)
+ Resistance to AI is growing. (MIT Technology Review)

3 Elon Musk accused the Pentagon of misusing Starlink for drones
He says military use of the system violates SpaceX rules. (Ars Technica)
+ The DoD is disputing a Starlink price hike during the Iran war. (Reuters $)
+ Stratospheric internet could take off this year. (MIT Technology Review)

4 China has overhauled the world’s biggest surveillance network with AI
Beijing is pushing law enforcement towards predictive policing. (FT $)
+ Police use of smart glasses is also booming in China. (Gizmodo)
+ LLMs could supercharge mass surveillance. (MIT Technology Review)

5 Space Force is awarding SpaceX $2 billion for a military data network
It will connect military sensors and weapons platforms worldwide. (Reuters $)
+ The contract comes amid concerns about SpaceX’s AI business. (WSJ $)
+ Speculation is growing around a possible SpaceX-Tesla merger.  (CNBC)

6 Taiwan suspects Nvidia chips were smuggled to China via Japan
To circumvent US restrictions. (Bloomberg $)
+ Is China about to win the AI race? (MIT Technology Review)

7 Booming AI chip demand has created two new $1 trillion companies
South Korea’s SK Hynix and the US’ Micron have hit the landmark. (BBC)

8 AI has sparked a surge in demand for cybersecurity experts
Thanks to a glut of new code and alarm over powerful models. (NYT $)
+ AI is making online swindles easier. (MIT Technology Review)

9 Internet is coming back in Iran after a three-month blackout
Although it isn’t clear if the reconnection is permanent. (Wired $)

10 Physicists are rethinking the role of gravity in quantum mechanics
There’s a new theory for how our everyday world emerges. (New Scientist $)

Quote of the day

“AI and its capabilities represent something analogous to the Second Coming.” 

—Jeremy Nixon, the cofounder of AGI House and a former Google Brain researcher, tells the New York Times how Silicon Valley’s innovations could affect the pope.

One More Thing

animal crossing concepts

ANDREW MERRITT


Inside the experimental world of animal infrastructure

In the mid-2000s, toads were meeting a gruesome end near Ede, a leafy old town in the Netherlands. Residents responded by building wildlife tunnels beneath the road to help them reach their breeding ponds safely.

The crossings became popular. But a few years later, researchers found the local toad population had crashed from more than 10,000 to fewer than 1,000.

The case reflects a wider global push to build wildlife crossings and other forms of “animal infrastructure.” But do they actually help animal populations recover? Read the full story to find out.

—Matthew Ponsford

We can still have nice things

A place for comfort, fun, and distraction to brighten up your day. (Got any ideas? Drop me a line.)

+ The votes for “International Mollusc of the Year” are finally in.
+ Track aircraft in real time across a gorgeous 3D digital globe using live flight data.
+ NASA’s Psyche spacecraft has delivered breathtaking new close-up images of Mars.
+ This deep dive into instant coffee reveals the extraordinary engineering effort behind making it vaguely drinkable.