Amazon Has New Frontier AI Models—and a Way for Customers to Build Their Own

-


Amazon has announced a new family of frontier artificial intelligence models—and a new way for customers to build frontier models of their own.

The ecommerce giant announced the second generation of its Nova AI models at re:Invent, a company conference held in Las Vegas. The models are nowhere near as popular as those offered by rivals like OpenAI and Google, but Amazon’s plan to make them highly customizable could see them gain traction with its cloud users.

Amazon detailed two improved large language models, Nova Lite and Nova Pro; a new realtime voice model called Nova Sonic; and a more experimental model called Nova Omni that performs a simulated kind of reasoning using images, audio, and video as well as text. The new models are being made available today to a limited number of customers.

More significantly, given the importance of its cloud business, Amazon is also releasing a tool called Nova Forge that will let customers create specialized frontier models by adding their own training data to unfinished versions of the Nova 2 Lite and Pro models.

It is already possible to fine-tune off-the-shelf AI models like Google’s Gemini and OpenAI’s GPT. But Amazon’s approach lets customers add data at various stages of model training, including the process of building the base model, a stage known as custom pre-training that is normally reserved for large AI labs.

“Everyone is looking for a frontier model that’s an expert in their domain,” Rohit Prasad, who leads Amazon’s AI efforts, told WIRED ahead of today’s announcements. Prasad says that Amazon developed the technologies behind Nova Forge to empower internal teams, including those developing Alexa and AI agents to build custom models. “This is essentially a new open training paradigm,” he says.

One customer that has already tested the approach is Reddit, which used Nova Forge to create a custom model to identify content that breaks the platform’s rules.

Fine-tuning a conventional model would not work, says Reddit chief technology officer Chris Slowe, because most models are designed to avoid offensive or violent content entirely, meaning they would refuse to analyze some materials. Slowe says that custom pre-training, combined with conventional fine-tuning, produced a frontier model that is expert at understanding and using Reddit.

“Other LLMs understand Reddit as a concept, and how Reddit works, but they’re not down in the weeds,” Slowe says. “We really built a Reddit expert model.”



Source link

Ariel Shapiro
Ariel Shapiro
Uncovering the latest of tech and business.

Latest news

The Official WIRED Ranking of the Best Pajama Brands for Women

The best pajamas for women are made with breathable fabrics and relaxed fits, with designs so gorgeous that...

Why Missile Alerts and War Updates Trigger Doomscrolling

As missiles crossed the Persian Gulf this weekend and explosions were reported across the region, millions of people...

How Journalists Are Reporting From Iran With No Internet

Coordinated Israeli and American strikes hit a military compound in Tehran on Saturday, killing dozens of senior regime...

L.L.Bean Promo Codes and Coupons: Up to 75% Off

L.L. Bean is infamous for its outdoorsy appeal, ranging from outerwear and supplies to withstand the elements to...

What Is That Mysterious Metallic Device US Chief Design Officer Joe Gebbia Is Using?

Joe Gebbia, cofounder of Airbnb and the US Chief Design Officer appointed by Trump, was spotted in San...

The ‘European’ Jolla Phone Is an Anti-Big-Tech Smartphone

“There are Chinese components as well—we are totally open about it—but the key is that as we compile...

Must read

You might also likeRELATED
Recommended to you