Karya is a data company that labels, annotates and categorizes data that is used to train AI, especially those meant for non-English speaking part of the Indian population. (REUTERS)AI 

10 Things to Know About Manu Chopra’s Karya and Its India-Focused AI Development

During an interview with UK PM Rishi Sunak, Elon Musk compared artificial intelligence to a magic genie and emphasized the importance of data as the magic lamp that houses the genie. Just as the genie cannot grant wishes without the lamp, AI models cannot function effectively without high-quality data. Manu Chopra, a 27-year-old Stanford alum, recognized the significance of data annotation and labeling long before the AI frenzy sparked by OpenAI’s ChatGPT. This led him to establish Karya, a startup specializing in accurate non-English data annotation. Karya’s services cater to companies developing AI models for individuals in India who do not comprehend English. Here are 10 key points about Manu and Karya.

10 key points from Karya

1. Manu Chopra is a 27-year-old computer engineer from Stanford University.

2. Chopra founded Karya in 2021. It is a data annotation company that acquires, labels and annotates data.

3. Karya differentiates itself from other data providers by offering its contractors – mostly women living in rural communities – up to 20 times the prevailing minimum wage.

4. First of all, of course, quality is required. Karya promises its customers high-quality Indian language data with high accuracy that allows AI models to learn without picking up bias, misinformation, or poor quality data.

5. “Every year, big tech companies spend billions of dollars collecting training data for their AI and machine learning models,” Manu Chopra told Bloomberg. And there the high costs are justified and easily absorbed.

6. Microsoft has used Karya to acquire local voice data for its AI products. The Bill & Melinda Gates Foundation is working with Karya to reduce gender bias in data fed into large language models, the technology that powers AI chatbots.

7. Google is also working with Karya and other local partners to collect voice data in 85 districts of India. Google plans to expand into each district by including the majority language or dialect, and build a generative AI model for 125 Indian languages.

8. According to Bloomberg, Karya employs 70 workers in Agara and neighboring villages to collect text, audio and image data in Indian vernacular languages.

9. Karya uses a user-friendly application and a work-from-anywhere model so that everyone with a smartphone can be a Karya employee! Karya connects digital workers with different data set requirements in four areas.

10. According to the startup, it collects customer requirements, breaks down the requirements into bite-sized digital tasks, identifies the best-suited employees, collects the data, validates the data, and synthesizes it into high-quality AI/ML. training datasets. It also claims to source the data ethically.

Related posts

Leave a Comment