These Clues Hint at the True Nature of OpenAI's Shadowy Q* Project
Briefly

The name may be an allusion to Q-learning... Some have suggested that the name may also be related to the A* search algorithm... That appears to be a reference to the idea of training algorithms with so-called synthetic training data...
Subbarao Kambhampati... thinks that Q* may involve using huge amounts of synthetic data, combined with reinforcement learning, to train LLMs to specific tasks...
Q* could be an effort to use reinforcement learning and a few other techniques to improve a large language model's ability to solve tasks by reasoning through steps along the way... it's unclear whether it would automatically suggest AI systems could evade human control.
Read at WIRED
[
add
]
[
|
|
]