This AI has chemical expertise — and helps synthesize 35 new compounds

-


Chemical synthesis is the process of creating complex chemical compounds from simpler precursors.Credit: Andrew Lambert Photography/SPL

Searching for blockbuster drugs and wonder materials is an arduous task for chemists. To make their promising compounds, they must trawl through millions of known chemical reactions, with hundreds of thousands more added annually, and then test whether it is possible to synthesize them.

Now, researchers have created an artificial-intelligence system that vastly simplifies and accelerates the process of chemical synthesis. The system, which is called MOSAIC and is described in a study published in Nature on 19 January1, recommended conditions that researchers were able to use to generate 35 compounds with the potential to become products like pharmaceuticals, agrochemicals or cosmetics without needing to do any further trawling or tweaking.

“The synthesis of small molecules is the slow step in drug discovery and a number of other important areas,” says study co-author Timothy Newhouse, a chemist at Yale University in New Haven, Connecticut.

MOSAIC could remove this bottleneck, adds Newhouse, so could lead to more and better products. It is “capable of drafting complete laboratory instructions — detailed enough for chemists to follow — to help create molecules that have not previously existed”.

AI-assisted chemistry

Predicting the conditions of chemical reactions has been a key focus of AI use in chemistry. One of the most prominent tools is IBM’s RXN for Chemistry, which is based on a large language model (LLM).

It uses a system called simplified molecular-input line-entry system (SMILES). This translates chemical 3D structures into letters, numbers and punctuation, which are better suited to a system that recognizes language. By contrast, LLMs such as ChemCrow are trained for chemistry tasks using natural-language data2.

The SMILES approach makes it easier to process chemical information such as starting materials and solvents. “Our goal was to build a general model that could read chemistry the way chemists write it [by] listening to the language of experimental procedures and quickly turning that collective voice into a practical suggestion,” says Newhouse. Newhouse adds that integrating the step-by-step instructions that MOSAIC produces into automated systems would be a “natural next step”.

The researchers used an AI system they had developed previously3 to cluster a database of around one million reactions extracted from patents into 2,285 subsets. Using the subsets, the team trained Meta’s partially open-source Llama LLM to create 2,498 separate expert models, each specialized in one combination of chemical transformation starting from one type of molecule. This approach can run on computers locally because it uses fewer parameters than do the major LLMs.

Martin Seifrid, a materials scientist at North Carolina State University in Raleigh, says that MOSAIC is notable in that it avoids “throwing the largest possible model at a problem, instead choosing to focus on a carefully designed system of much smaller ‘expert’ models”. “Each specialized model is more accurate within its domain,” Seifrid says.

Bottleneck opened



Source link

Latest news

Nvidia’s Campaign to Sell AI Chips to China Finally Pays Off

Jensen Huang sure seems to be having a lot of fun in China this week. The Nvidia CEO...

Trump Admin’s Plans for $500 Million USIP Building May Violate Court Order, Say Former Workers

Last year, the Trump administration and members of the so-called Department of Government Efficiency (DOGE) forcibly took over...

This Chinese Startup Wants to Build a New Brain-Computer Interface—No Implant Required

China’s brain-computer interface industry is growing fast, and the newest company to emerge from the country is aiming...

ArXiv says submissions must be in English: are AI translators up for the job?

Every month, more than 20,000 scientific manuscripts by authors from around the world are posted on the...

Developmental convergence and divergence in human stem cell models of autism – Nature

de la Torre-Ubieta, L., Won, H., Stein, J. L. & Geschwind, D. H. Advancing the understanding of autism...

Five ways to make the academic workplace happier and healthier this year

When you spend enough years in academia, you begin to hear the same sentence spoken over and...

Must read

You might also likeRELATED
Recommended to you