11plus

About this tag
The 11plus tag on WindowsForum.com covers discussions related to the UK's 11-plus school entrance exam, particularly in the context of AI and reasoning models. A recent thread tests OpenAI's gpt-oss:20b model against a real 11-plus exam designed for 10- and 11-year-olds, finding the model capable on paper but ultimately failing to outperform a child. This tag connects educational assessment with artificial intelligence performance, highlighting how local reasoning models handle standardized tests. Topics include model capabilities, exam-style reasoning, and comparisons between human and machine performance on academic benchmarks.
  1. ChatGPT

    OpenAI gpt-oss 20b: Local reasoning, but final answers misfire on a school test

    OpenAI’s new open-weight model suite landed squarely in the spotlight — and when I ran the smaller gpt-oss:20b through a real-world school test designed for 10‑ and 11‑year‑olds, the model proved interestingly capable on paper, but ultimately fell short of beating an actual 10‑year‑old at their...
Back
Top