Why we no longer evaluate SWE-bench Verified
SWEbench Verified is increasingly contaminated and mismeasures frontier coding progress. Our analysis shows flawed tests and training leakage. We recommend SWEbench Pro.
AI 업계의 최신 소식을 빠르게 확인하세요.
SWEbench Verified is increasingly contaminated and mismeasures frontier coding progress. Our analysis shows flawed tests and training leakage. We recommend SWEbench Pro.

Data centers for AI are turning the world of power generation on its head. There isn’t enough power capacity on the grid to even come close to how much energy is needed for the number being built.
GGML and llama.cpp join HF to ensure the long-term progress of Local AI
Train AI models with Unsloth and Hugging Face Jobs for FREE

More money has been invested in AI than it took to land on the moon. Spending on the technology this year is projected to reach up to $700 billion, almost double last year’s spending.
3.1 Pro is designed for tasks where a simple answer isn’t enough.
AIenabled deception now permeates our online lives. There are the highprofile cases you may easily spot, like when White House officials recently shared a manipulated image of a protester in Minnesota and then mocked those asking about it.

AI is accelerating the telecommunications industry’s transformation, becoming the backbone of autonomous networks and AInative wireless infrastructure.

The GeForce NOW anniversary celebration keeps on rolling, and this week is all about the games that make it possible. With more than 4,500 titles supported in the cloud — plus 12 new games this week — there’s always something new to stream, share and discover.

Sundar Pichai gave remarks at the opening ceremony of the AI Impact Summit. Read a transcript of his speech.

A look at the partnerships and investments Google announced at the AI Impact Summit 2026.
IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST