Technology

2198 readers

17 users here now

Post articles or questions about technology

founded 2 years ago

MODERATORS

New Open Source AI model beats DeepSeek's performance using just 14% of the data its Chinese competitor needed (decrypt.co)

submitted 1 month ago by [email protected] to c/[email protected]

5 comments fedilink hide all child comments

cross-posted from: https://lemmy.sdf.org/post/29607342

Archived

Here is the data at Hugging Face.

A team of international researchers from leading academic institutions and tech companies upended the AI reasoning landscape on Wednesday with a new model that matched—and occasionally surpassed—one of China's most sophisticated AI systems: DeepSeek.

OpenThinker-32B, developed by the Open Thoughts consortium, achieved a 90.6% accuracy score on the MATH500 benchmark, edging past DeepSeek's 89.4%.

The model also outperformed DeepSeek on general problem-solving tasks, scoring 61.6 on the GPQA-Diamond benchmark compared to DeepSeek's 57.6. On the LCBv2 benchmark, it hit a solid 68.9, showing strong performance across diverse testing scenarios.

...

top 5 comments

sorted by: hot top controversial new old

[–] [email protected] 16 points 1 month ago

Sounds like a real Sputnik moment happening here, live before our eyes.

[–] [email protected] 15 points 1 month ago (1 children)

When you already have the answer, it's incredibly simple to reverse engineer the question

[–] [email protected] 14 points 1 month ago (1 children)

[–] [email protected] 10 points 1 month ago

In the 7-bit ASCII character set, ASCII code 42 is represented by the character *, which represents anything you want it to.

[–] [email protected] 2 points 1 month ago

Becnhmarks are great if not self-masturbatory, but what about UX with them?