"We don't add languages to the platform without communities," says EM Lewis-Jong, product director at Mozilla. "It sounds like a small thing, but I think in the current AI age, it actually is weirdly radical to be consent-centered."
In some cases, the dataset has been used by smaller projects focused on specific tasks, like delivering multilingual legal advice, providing information about governance, or building voice-powered chatbots with local agricultural information.
Lewis-Jong says it's been used by Big Tech companies, small independent operations, and plenty of projects in between. The dataset has been downloaded from Mozilla millions of times.
Common Voice continues to grow as new material gets recorded in existing languages and new volunteers approach Mozilla to localize the contribution for their own languages.
Collection
[
|
...
]