These Clues Hint at the True Nature of OpenAI's Shadowy Q* Project

from WIRED 4 months ago

The name may be an allusion to Q-learning... Some have suggested that the name may also be related to the A* search algorithm... That appears to be a reference to the idea of training algorithms with so-called synthetic training data...
WIREDhttps://www.wired.com/story/fast-forward-clues-hint-openai-shadowy-q-project/

Subbarao Kambhampati... thinks that Q* may involve using huge amounts of synthetic data, combined with reinforcement learning, to train LLMs to specific tasks...
WIREDhttps://www.wired.com/story/fast-forward-clues-hint-openai-shadowy-q-project/

Q* could be an effort to use reinforcement learning and a few other techniques to improve a large language model's ability to solve tasks by reasoning through steps along the way... it's unclear whether it would automatically suggest AI systems could evade human control.
WIREDhttps://www.wired.com/story/fast-forward-clues-hint-openai-shadowy-q-project/

Read at WIRED

#Q* #reinforcement learning #synthetic data #large language models #AI control

[

]

[

...

]

These Clues Hint at the True Nature of OpenAI's Shadowy Q* ProjectThese Clues Hint at the True Nature of OpenAI's Shadowy Q* Project Briefly

These Clues Hint at the True Nature of OpenAI's Shadowy Q* Project
These Clues Hint at the True Nature of OpenAI's Shadowy Q* Project
Briefly