12/21/2023 0 Comments Bill gates was office bullyThe problem is so stark.” Gupta ticked off a list of issues that AI firms need to address in order to serve India’s internet users: Non-English datasets are dismally low quality hardly any conversational data exists in Hindi and other Indian languages and digitized content from books and newspapers in Indian languages is very limited. “Over 70 Indian languages spoken by over a million people each had zero digital corpus. “India is the first non-Western country we are doing this in, and we are testing Bard in nine Indian languages,” said Manish Gupta, head of Google Research in India, referring to the company’s AI chatbot. Nearly one billion such potential users live in India alone, as the government pushes for a rollout of AI tools in every sphere from healthcare to education to financial services. As a result, these AI models poorly represent the diversity of languages for internet users in other countries who are accessing AI-powered smartphones and apps faster than they’re learning English. Many AI services have been disproportionately developed with English-language internet data, such as articles, books and social media posts. Google plans to expand to every district to include the majority language or dialect spoken and build a generative AI model for 125 Indian languages. And Alphabet Inc.’s Google is leaning on Karya and other local partners to gather speech data in 85 Indian districts. The Bill & Melinda Gates Foundation is working with Karya to reduce gender biases in data that feeds into large language models, the technology underpinning AI chatbots. has used Karya to source local speech data for its AI products. “All I need is a phone and the internet.”īill Gates hailed PM Modi for his leadership of G20 and his ‘groundbreaking’ consensus on DPI Microsoft Corp. The money is enough, she said, to pay off that month’s installment on a loan taken to partly repair the crumbling mud walls of her home that have been carefully patched up with colorful saris. After three days of working with Karya, Preethi earned 4,500 rupees ($54), more than four times the amount the 22-year-old high school graduate usually makes as a tailor in an entire month. Unlike many other data contractors, however, Preethi gets paid well for her efforts, at least by local standards. She is part of a vast, unseen global workforce - operating in countries like India, Kenya and the Philippines - who collect and label the data that AI chatbots and virtual assistants rely on to generate relevant responses. Preethi, who goes by a single name, as is common in the region, is among the 70 workers hired in Agara and neighboring villages by a startup called Karya to gather text, voice and image data in India’s vernacular languages.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |