Amazon Has New Frontier AI Models—and a Way for Customers to Build Their Own

-


Amazon has announced a new family of frontier artificial intelligence models—and a new way for customers to build frontier models of their own.

The ecommerce giant announced the second generation of its Nova AI models at re:Invent, a company conference held in Las Vegas. The models are nowhere near as popular as those offered by rivals like OpenAI and Google, but Amazon’s plan to make them highly customizable could see them gain traction with its cloud users.

Amazon detailed two improved large language models, Nova Lite and Nova Pro; a new realtime voice model called Nova Sonic; and a more experimental model called Nova Omni that performs a simulated kind of reasoning using images, audio, and video as well as text. The new models are being made available today to a limited number of customers.

More significantly, given the importance of its cloud business, Amazon is also releasing a tool called Nova Forge that will let customers create specialized frontier models by adding their own training data to unfinished versions of the Nova 2 Lite and Pro models.

It is already possible to fine-tune off-the-shelf AI models like Google’s Gemini and OpenAI’s GPT. But Amazon’s approach lets customers add data at various stages of model training, including the process of building the base model, a stage known as custom pre-training that is normally reserved for large AI labs.

“Everyone is looking for a frontier model that’s an expert in their domain,” Rohit Prasad, who leads Amazon’s AI efforts, told WIRED ahead of today’s announcements. Prasad says that Amazon developed the technologies behind Nova Forge to empower internal teams, including those developing Alexa and AI agents to build custom models. “This is essentially a new open training paradigm,” he says.

One customer that has already tested the approach is Reddit, which used Nova Forge to create a custom model to identify content that breaks the platform’s rules.

Fine-tuning a conventional model would not work, says Reddit chief technology officer Chris Slowe, because most models are designed to avoid offensive or violent content entirely, meaning they would refuse to analyze some materials. Slowe says that custom pre-training, combined with conventional fine-tuning, produced a frontier model that is expert at understanding and using Reddit.

“Other LLMs understand Reddit as a concept, and how Reddit works, but they’re not down in the weeds,” Slowe says. “We really built a Reddit expert model.”



Source link

Ariel Shapiro
Ariel Shapiro
Uncovering the latest of tech and business.

Latest news

Scientists Thought Parkinson’s Was in Our Genes. It Might Be in the Water

Amy Lindberg settled quickly into life at Lejeune. She played tennis and ran on her lunch breaks, flitting...

Kids and Teen Influencers in Australia Say ‘Bye-Bye’ to Social Media

When 15-year-old Carlee Jade Clements wakes up, her first thought is to record a Get Ready With Me...

Silicon Valley Is All About the Hard Sell These Days

OpenAI CEO Sam Altman was at the center of Silicon Valley’s most visible publicity push in recent memory...

Get (or Gift) 2 Years of Spectacular Shaves for $80 Right Now

Razors are one of the most heavily and competitively marketed products in American capitalism. Made with steel and...

Intel Takes Major Step in Plan to Acquire Chip Startup SambaNova

Intel has signed a term sheet to acquire the AI chip startup SambaNova Systems, two sources with direct...

These Down Comforter Deals Can Help When It’s Cold Outside, Baby

Down comforter deals are usually easier to find during the warmest months of the year, but we've rustled...

Must read

You might also likeRELATED
Recommended to you