CEO of AI training startup says humans will still be involved in data creation for decades
Briefly

CEO of AI training startup says humans will still be involved in data creation for decades
""When I first started this job, the main push back I always got was that synthetic data will take over and you just will not need human feedback two to three years from now," said Fitzpatrick, who joined the startup last year. "From first principles, that actually doesn't make very much sense." Synthetic data refers to data that is artificially created."
"On the podcast, Fitzpatrick said that there are too many kinds of tasks for AI to accomplish in the world, and it would take a long time to do them accurately with language and cultural context in mind. For example, the legal industry contains vast amounts of nonpublic information. "On the GenAI side, you are going to need humans in the loop for decades to come," he said."
Human feedback is essential for AI training because synthetic data cannot replicate the variety and contextual nuance of many real-world tasks. Synthetic data is artificially created and is useful when real data is scarce or restricted by privacy. Numerous tasks require language and cultural context that are difficult to model with synthetic data alone, and industries such as legal work contain large amounts of nonpublic information. Humans will be needed in the loop for decades to ensure accuracy and contextual relevance. Data labeling startups continue to hire specialized workers to provide high-quality labeled data for tech companies. Invisible raised $100 million at a $2 billion valuation and competes with firms like Scale AI and Surge AI.
Read at Business Insider
Unable to calculate read time
[
|
]