

Technology
Google taps AI to grasp India’s language diversity – Crypto News
NEW DELHI : Google India has teamed up with the Indian Institute of Science (IISc) for the Project Vaani initiative that would gather speech data across India and use it to create an artificial intelligence (AI)-based language model that can understand diverse Indian languages and dialects .
The project is part of Bengaluru-based IISc and AI and Robotics Technology Park’s (Artpark) Bhasha AI project that includes SYSPIN (Synthesizing Speech in Indian languages) and RESPIN (Recognizing Speech in Indian languages).
“India’s spoken languages change every few kilometres…machines have no hope. So, research and innovation for inclusive language AI requires capturing this diversity in our datasets,” Prasanta Kumar Ghosh, a professor at IISc, who leads these initiatives, said, explaining the reasons for launching Project Vaani.
Google and IISc plan to collect speech samples from 773 districts. The initiative, currently focused in 80 districts across 10 states, is expected to expand to every district over the next couple of years and boost the size and diversity of India’s open-sourced language data, with over 150,000 hours of curated speech and 100 million sentences of text in Indian scripts. Artpark and IISc simultaneously plan to launch challenges for researchers and startups to build applications in areas such as health, agriculture, and financial inclusion using these datasets.
Manish Gupta, director of Google Research India, said Vaani would be trained on speech and text data from over 100 Indian languages. He said the new model is a leap over Multilingual Representations for Indian Languages (MuRIL), which was a text-only model. The new model supports both speech and text.
“We want to make sure that any language which is spoken by 100,000 people is covered,” he added. MuRIL is a Bert-based language model trained on 17 Indian languages. Bert or Bidirectional Encoder Representation from Transformers is a Google-developed machine language (ML)-based technique to learn contextual relations between words to generate a language model.
Gupta also announced another AI model that will use satellite imagery to offer agriculture-related insights to agritech startups and policymakers and an AI-based optical character recognition (OCR) tool that has been trained to read handwritten medical prescriptions.
Google Research India also announced a $1 million grant for IIT Madras to open a Center for Responsible AI in India. Another grant of a similar amount will be offered to Wadhwani Foundation to support the deployment of AI models.
The new language model is part of a wider Google initiative to build a model for 1,000 global languages, said Gupta. “We want to ensure that Indian languages are front and center in terms of representation in this model,” he said.
Gupta explained that many Indian languages have relatively lower resources. Models like Bert are built on available web resources, and since Indian languages tend to be less represented, often the capability of these models with Indian languages is not as good as expected by researchers. That said, Gupta also cautioned that language models must be handled carefully for the good of society.
He pointed out that models like Language Model for Dialogue Applications (LaMDA) and ChatGPT are prone to hallucination. “They may come up with an explanation that sounds convincing but is actually bogus. Working on the development of these AI models in a responsible manner becomes very important,” he added.
The $1 million grant to IIT Madras to establish a Center for Responsible AI in India is an attempt to bring together researchers from other institutes and other fields like social science and law. “A lot of research on responsible AI has been done in a western context. In India, there are additional dimensions of bias based on region and caste. It is important that we study all these biases in the Indian context and keep them in mind while developing these AI models,” he added.
Similarly, the AI model for agriculture is an attempt to solve many of the problems in the sector by applying AI models to satellite imagery. “Our work focuses on a combination of remote sensing and AI. We will apply the model to identify farm boundaries and landscape understanding. Then we can go deeper into what crop is being grown on each farm and what is the likely yield,” said Gupta.
Gupta said Google would work with partners in the ecosystem and make this data available to the government, policymakers, and startups that are building agri solutions and contribute to the agri stack that the Indian government is defining. Google has been working on a pilot for this with the Telangana government.
Catch all the Technology News and Updates on Live Mint. Download Mint News App to get Daily market update Live business news,
-
Technology1 week ago
Who is Daniel Gross? Tech veteran who joins Meta as Zuckerberg deepens AI talent hunt – Crypto News
-
Blockchain7 days ago
Bitcoin Consolidation Continues: These Are Two Key Support Levels To Watch – Crypto News
-
Blockchain1 week ago
Wall Street Moves on-Chain Amid Tokenization of US Stocks – Crypto News
-
Cryptocurrency1 week ago
Trent Share Price Crashes Over 9% After Weak Q1 Forecast, Nuvama Downgrade – Crypto News
-
Blockchain7 days ago
Bitcoin Consolidation Continues: These Are Two Key Support Levels To Watch – Crypto News
-
Blockchain7 days ago
Bitcoin Consolidation Continues: These Are Two Key Support Levels To Watch – Crypto News
-
Blockchain7 days ago
Bitcoin Consolidation Continues: These Are Two Key Support Levels To Watch – Crypto News
-
Blockchain7 days ago
Bitcoin Consolidation Continues: These Are Two Key Support Levels To Watch – Crypto News
-
Business7 days ago
Is Roger Ver the Satoshi Era Bitcoin Whale Behind $8 Billion BTC Transfer? – Crypto News
-
Business1 week ago
Why Is Crypto Market Up Today? – Crypto News
-
Cryptocurrency1 week ago
Nio Stock Price Forecast for 2025, 2027, and 2030: Buy the Dip? – Crypto News
-
Technology1 week ago
Turkey Bans Binance Chain DEX PancakeSwap Over Licensing Concerns – Crypto News
-
Technology1 week ago
‘Notice the difference’: Elon Musk claims major upgrade to Grok chatbot’s question-answering abilities – Crypto News
-
Cryptocurrency7 days ago
Binance stacks Ethereum at yearly high, U.S. funds buy more: So why isn’t ETH moving? – Crypto News
-
others4 days ago
Will Ethereum Price Rally to $3,200 as Wall Street Pivots from BTC to ETH – Crypto News
-
Metaverse1 week ago
ChatGPT, Claude and Gemini not helping? Here’s how to fix your prompts for better output – Crypto News
-
Technology1 week ago
No Waiting, No Hassles—CCE.Cash Delivers Instant Cross‑Chain Crypto Swaps – Crypto News
-
others1 week ago
Will Solana Price Rally or Crash in July? – Crypto News
-
Technology1 week ago
JA Mining Redefines Crypto Income with Accessible Crypto Mining Platform – Crypto News
-
Business1 week ago
XRP Price Jumps As Ripple Applies for Banking License, Is $3 Next? – Crypto News
-
others1 week ago
Judge Torres Has No More Role in XRP Vs SEC Lawsuit, Says Former SEC Lawyer – Crypto News
-
Blockchain1 week ago
Where Did Bitcoin’s Retail Go? Look Offchain – Crypto News
-
Cryptocurrency1 week ago
XRP: Mini Death Cross Surprise, Shiba Inu (SHIB): It’s Not Normal, Bitcoin (BTC): Fundamental Breakout Secured – Crypto News
-
Cryptocurrency1 week ago
Zelenskyy’s attire divides Polymarket with $79M at stake – Crypto News
-
Cryptocurrency7 days ago
Ripple CTO Reveals How Many Bitcoins He Has Mined – Crypto News
-
De-fi7 days ago
World Liberty Finance Opens Vote to List $WLFI Token – Crypto News
-
Cryptocurrency6 days ago
TON Foundation Confirms UAE Golden Visa Offer Is Not Official – Crypto News
-
others2 days ago
Skies are clearing for Delta as stock soars 13% on earnings beat – Crypto News
-
others2 days ago
Skies are clearing for Delta as stock soars 13% on earnings beat – Crypto News
-
others1 week ago
Gold price in Philippines: Rates on July 4 – Crypto News
-
Business1 week ago
XRP Price Nears 50% Surge as XXRP ETF Nears $160M Milestone – Crypto News
-
Cryptocurrency1 week ago
Bitcoin Price Drops After Rejection at $110K Amid Unusual On-Chain Activity – Crypto News
-
Cryptocurrency1 week ago
Wintermute secures Bitcoin-backed credit line from Cantor Fitzgerald – Crypto News
-
others1 week ago
XAG/USD advance stalls near $37.00 as holiday lull masks bullish setup – Crypto News
-
Cryptocurrency1 week ago
XRP price rises 15% to $2.24, but whale sell-off raises downside risk – Crypto News
-
Blockchain1 week ago
Bitcoin Suisse Exec Laments EU and Swiss Stablecoin Rules – Crypto News
-
others1 week ago
Will SUI Price Rally to $6 After Reclaiming $3? – Crypto News
-
others1 week ago
Bearish outlook remains in play near 1.3600 – Crypto News
-
Technology1 week ago
Google’s EU search results could soon feature competitors first to avoid DMA fines: Report – Crypto News
-
others1 week ago
US Dollar Primed To Weaken Further Amid the Worst First-Half-Year Performance Since 1973: S&P Global – Crypto News
-
De-fi1 week ago
Less Than 5% of Wallets Generate Most On-Chain Value: Report – Crypto News
-
Blockchain1 week ago
Bitcoin Price To See 52% Increase To $166,000, Analyst Reveals Tight Timeline – Crypto News
-
others1 week ago
GBP/USD moves little as traders remain cautious amid uncertainty – Crypto News
-
Technology1 week ago
Donald Trump Threatens Tariffs Of Up To 70% Ahead July 9 Deadline – Crypto News
-
others1 week ago
Singapore Retail Sales (MoM) climbed from previous 0.3% to 1% in May – Crypto News
-
others1 week ago
Singapore Retail Sales (MoM) climbed from previous 0.3% to 1% in May – Crypto News
-
Cryptocurrency1 week ago
BlockDAG Named Official Blockchain Partner of Seattle Seawolves—Details Inside – Crypto News
-
others1 week ago
XAG/USD advance stalls near $37.00 as holiday lull masks bullish setup – Crypto News
-
others1 week ago
XAG/USD advance stalls near $37.00 as holiday lull masks bullish setup – Crypto News
-
others1 week ago
Hong Kong To Launch Third Tokenized Bond with ETF Stamp Duty Relief – Crypto News