OpenAI: Learning to Reason with LLMs

openai.com

cross-posted to:
[email protected]
[email protected]

-1

OpenAI: Learning to Reason with LLMs

openai.com

howrar@lemmy.caM to Reinforcement Learning@lemmy.caEnglish · 2 months ago

cross-posted to:
[email protected]
[email protected]

OpenAI just put out a blog post about a new model trained via RL (I’m assuming this isn’t the usual RLHF) to perform chain of thought reasoning before giving the user its answer. As usual, there’s very little detail about how this is accomplished so it’s hard for me to get excited about it, but the rest of you might find this interesting.

You must log in or register to comment.

Chat

Warning: Some posts on this platform may contain adult material intended for mature audiences only. Viewer discretion is advised. By clicking ‘Continue’, you confirm that you are 18 years or older and consent to viewing explicit content.

OpenAI: Learning to Reason with LLMs

OpenAI: Learning to Reason with LLMs