The researchers used data from an Australian baby known only as Sam, who is now 11 years old, from the SAYCam database.
Trained on just 61 hours of footage of Sam, including 600,000 video frames paired with 37,500 transcribed words, the AI was able to match basic nouns and images on par with an AI trained on 400 million captioned-images.
[
add
]
[
|
|
...
]