In a key benchmark, Claude Sonnet 4.5, a next-generation language model, runs autonomously for 30 hours on a single task.
If you have a PC enrolled in the Windows Insider Program’s Dev or Beta channels, you’ve got an update to install.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results