New secret math benchmark stumps AI models and PhDs alike
Epoch AI allowed Fields Medal winners Terence Tao and Timothy Gowers to review portions of the benchmark. “These are extremely challenging,” Tao said in feedback provided to Epoch. “I think that in the near term basically the only way to…
Ars Live: Our first encounter with manipulative AI
While Bing Chat’s unhinged nature was caused in part by how Microsoft defined the “personality” of Sydney in the system prompt (and unintended side-effects of its architecture with regard to conversation length), Ars Technica’s saga with the chatbot began when…
Anthropic hires its first “AI welfare” researcher
The researchers propose that companies could adapt the “marker method” that some researchers use to assess consciousness in animals—looking for specific indicators that may correlate with consciousness, although these markers are still speculative. The authors emphasize that no single feature…
Claude AI to process secret government data through new Palantir deal
An ethical minefield Since its founders started Anthropic in 2021, the company has marketed itself as one that takes an ethics- and safety-focused approach to AI development. The company differentiates itself from competitors like OpenAI by adopting what it calls…
ChatGPT has a new vanity domain name, and it may have cost $15 million
On Wednesday, OpenAI CEO Sam Altman merely tweeted “chat.com,” announcing that the company had acquired the short domain name, which now points to the company’s ChatGPT AI assistant when visited in a web browser. As of Thursday morning, “chatgpt.com” still…