Debates over AI benchmarks — and how they’re reported by AI labs — are spilling out into public view. This week, an OpenAI employee accused Elon Musk’s AI company, xAI, of publishing misleading ...
XAI Grok 4 Benchmarks are showing it is the leading model. Humanity Last Exam at 35 and 45 for reasoning is a big improvement from about 21 for other top models. If these leaked Grok 4 benchmarks are ...
In just two years, Elon Musk’s xAI has become one of a dozen or so labs capable of developing state-of-the-art AI models. Now xAI is out with its Grok 3 large language model, which beats ...
The artificial intelligence community is in the midst of a heated debate over xAI’s Grok 3 model. OpenAI’s Boris Power has accused xAI of manipulating benchmark evaluations to artificially enhance ...
Elon Musk's xAI has launched its new flagship AI model, Grok-4, which demonstrates leading performance in various academic, reasoning, and coding benchmarks. Elon Musk's xAI today announced Grok 4, ...
Yesterday, just as OpenAI celebrated its 10-year anniversary, the AI company launched GPT-5.2, its latest series of AI models to power ChatGPT. The latest release is allegedly in response to OpenAI’s ...
xAI, the artificial intelligence company founded by Elon Musk, has recently unveiled grok-code-fast-1, a groundbreaking agentic coding model designed to revolutionize how developers approach software ...
Elon Musk’s xAI Holdings Corp. has released grok-code-fast-1, a dedicated agentic coding artificial intelligence model that is extremely speedy and designed to strike a “compelling balance between ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results