gru:Bz
Average millennial living life on the edge (of the Midwest). Probably too immature for Micro.blog but I like it here.
OpenAI releases o1, its first model with ‘reasoning’ abilities
The training behind o1 is fundamentally different from its predecessors, OpenAI’s research lead, Jerry Tworek, tells me, though the company is being vague about the exact details.
With o1, it trained the model to solve problems on its own using a technique known as reinforcement learning, which teaches the system through rewards and penalties.