Meta exec denies the company artificially boosted Llama 4’s benchmark scores

April 7, 2025

57

A Meta exec on Monday denied a rumor that the company trained its new AI models to present well on specific benchmarks while concealing the models’ weaknesses.

The executive, Ahmad Al-Dahle, VP of generative AI at Meta, said in a post on X that it’s “simply not true” that Meta trained its Llama 4 Maverick and Llama 4 Scout models on “test sets.” In AI benchmarks, test sets are collections of data used to evaluate the performance of a model after it’s been trained. Training on a test set could misleadingly inflate a model’s benchmark scores, making the model appear more capable than it actually is.

Over the weekend, an unsubstantiated rumor that Meta artificially boosted its new models’ benchmark results began circulating on X and Reddit. The rumor appears to have originated from a post on a Chinese social media site from a user claiming to have resigned from Meta in protest over the company’s benchmarking practices.

Reports that Maverick and Scout perform poorly on certain tasks fueled the rumor, as did Meta’s decision to use an experimental, unreleased version of Maverick to achieve better scores on the benchmark LM Arena. Researchers on X have observed stark differences in the behavior of the publicly downloadable Maverick compared with the model hosted on LM Arena.

Al-Dahle acknowledged that some users are seeing “mixed quality” from Maverick and Scout across the different cloud providers hosting the models.

“Since we dropped the models as soon as they were ready, we expect it’ll take several days for all the public implementations to get dialed in,” Al-Dahle said. “We’ll keep working through our bug fixes and onboarding partners.”

Source link

A post-Trump restoration is still possible

Amazon shares fall as it prepares $200bn AI spending blitz

What happens next in Venezuela?

Trump’s Venezuela punt could turn into an oil-drilling own goal

Asda and Morrisons’ private equity owners raise £6.5bn in property deals

Top 10 highest goal scorer in the world cup history

Top 10 Youngest Players Ever to Play in the FIFA World Cup

Canadian Ice Hockey Legend Claude Lemieux Dies at 60

Too soon? Bears, 49ers among early bets to win next year’s Super Bowl

Luka Doncic as a Laker: What we’ve seen a year in and what’s next

Meta exec denies the company artificially boosted Llama 4’s benchmark scores

Former Tesla product manager wants to make luxury goods impossible to fake, starting with a chip

What to know about Netflix’s landmark acquisition of Warner Bros.

YouTube rolls out an AI playlist generator for Premium users

LEAVE A REPLY Cancel reply

Most Popular

Top 10 highest goal scorer in the world cup history

Top 10 Youngest Players Ever to Play in the FIFA World Cup

Canadian Ice Hockey Legend Claude Lemieux Dies at 60

Former Tesla product manager wants to make luxury goods impossible to fake, starting with a chip

Recent Comments

EDITOR PICKS

Top 10 highest goal scorer in the world cup history

Top 10 Youngest Players Ever to Play in the FIFA World Cup

Canadian Ice Hockey Legend Claude Lemieux Dies at 60

POPULAR POSTS

Top 10 highest goal scorer in the world cup history

Top 10 Youngest Players Ever to Play in the FIFA World Cup

Canadian Ice Hockey Legend Claude Lemieux Dies at 60

POPULAR CATEGORY

ABOUT US

FOLLOW US