

Technology
‘Studying AI bias in the Indian societal context’ – Crypto News
Besides opening your PaLM API for developer access, would Google also be backing developer projects in India?
Today, there are so many startups and developers looking to build solutions that serve these customers. What we’re now enabling is for them to start using our APIs, to build these solutions. We also have various teams, including customer engineering units and at our Google Cloud division, who already have pre-existing relationships with many developers. Depending on that, these teams will provide further hand-holding and assistance in terms of making the most of our generative AI APIs.
Researchers at Indian institutes have struggled with availability of digitized datasets in local languages. Would Google’s dataset now be available to institutes?
We already do that — Project Vaani was done in collaboration with the Indian Institute of Science (IISc). Through this, we’re seeing the first-ever digital dataset for Indic languages, for AI researchers.
When we started working on establishing a single generative AI model for 125 Indian languages, all of these languages were what researchers call zero-corpus. It’s not that we had very little data—for many of them, we had absolutely no digitized data at all. For the first time, we’ve managed to move many Indian languages from zero-corpus to at least the low-resource level.
All of this data is now open-sourced, which means that it is now openly available to academic researchers, startups, and even large companies. This is just the first tranche — over the coming months and the next one year, we’ll keep making more Indian language data available to our database. This will continue to happen as we keep scaling our efforts to more districts across India, through which the dataset that we have will become more diverse.
You’ve also open-sourced a local language bias benchmark in India. Given that data on Indian languages is still so scarce, is it possible to address AI bias at this stage?
The first and foremost thing that we did in bias was to start understanding the issue in a non-Western context. If you look at most AI literature on bias, up until two years ago, all of it—including understanding race and gender-based biases—were in the Western context. Hence, what we recognized is that there is a major societal context here — in India, for instance, there are multiple additional axes of bias that are based on caste, religion and others. We wanted to understand these. There is a technological gap in this regard, because the capability of language models were poorer in Indian languages than in more mature languages such as English. It is well-known that LLMs can hallucinate, which leads to misinformation in the output results. Hence, the problems (such as those of bias) often become worse in lower resource languages.
Then, there is also a pillar of aligning values. For instance, while confronting an elderly user’s queries in stoic phrases is acceptable in a Western cultural context, the same within India would not necessarily be so.
We wanted to understand these issues in the Indian cultural context—the technological gap of data is just one aspect that was missing in terms of understanding bias in an Indianized context. This would therefore apply even to English within the Indian context.
How good is the benchmark in addressing these biases?
It’s a start. We’ve already used our LLMs to automatically create certain phrases and sentence completions, through which we were able to get a comprehensive set of stereotypes that we uncovered in the local context.
In addition to this, we’re also engaging with the research community, and using our interactions to uncover additional sources of bias. These have led to multiple interesting ideas around intersectional issues of bias — for instance, in the case of a Dalit woman, a combination of gender and caste-based biases may come together within the model, which is what we’re working to identify and develop now.
How is the data on Indian languages collected by Google?
The entire effort is driven by IISc, and we’ve collaborated with them to share best practices on what we need the dataset to be like, in order for it to be used well by AI researchers. The IISc, in turn, has partners that operationalize their data collection efforts by having people reach various districts.
There, these partners then show a set of images to local residents, and record their local dialect answers.
Lack of compute is another major challenge, alongside data. Would Google also answer this for those who work on generative AI projects?
Yes. In many cases, we’ve been offering researchers access to free Google Cloud credits. This allows them to run their own AI models on our cloud infrastructure.
Compute is a significant enabler for building AI models, and is often hard to access for many developers and researchers. We recognize that, and we’ve been accordingly providing compute capabilities wherever feasible.
What contribution does Google Research India make in the development of PaLM, or even Bard?
We have significant engineering and research teams in India. In particular, our research lab has been making critical contributions to extending multilingual capabilities of LLMs within Google. We’ve of course started with Indian languages, but a lot of our work has been done in a manner that the same principles can be applied more broadly across other under-resourced languages around the world. This can help other languages also understand aspects around bias and misinformation.
Is it possible for versions of generative AI models to work on-device?
Our PaLM API runs on the cloud. But, there are certain generative AI capabilities that are becoming available on-device. They would be offline, and would be highly reduced models that are distilled for local functioning. They wouldn’t be as powerful as the ones that run on the cloud, but there are such models that exist today.
For instance, there are some versions of the PaLM API that are internally available, and work on-device.
Catch all the technology news and Updates on Live Mint. Download Mint News App to get Daily market update Live business news,
Updated: 28 Jun 2023, 10:00 PM IST
-
De-fi1 week ago
Wells Fargo Lifts Bitcoin ETF Holdings to $160 Million – Crypto News
-
Technology1 week ago
99% Approval Odds? How Close Are We to Spot Solana ETF Launch in US? – Crypto News
-
others6 days ago
Tom Lee’s BitMine Ethereum Treasury Tops $6.6B, Overtakes MARA in Crypto Holdings – Crypto News
-
Cryptocurrency1 week ago
Solana’s $200 Comeback Is No Mere ‘Speculative Pop’ – Crypto News
-
Cryptocurrency1 week ago
How to Invest in Penny Stocks: Strategies That Actually Work – Crypto News
-
De-fi1 week ago
SharpLink Raises $400 Million as Ethereum Treasury Swells to $3.3 Billion – Crypto News
-
De-fi1 week ago
SharpLink Raises $400 Million as Ethereum Treasury Swells to $3.3 Billion – Crypto News
-
Business1 week ago
Coinbase Completes $2.9B Deal To Acquire Deribit Amid ‘Everything Exchange’ Push – Crypto News
-
Metaverse1 week ago
‘Should I open the door in…’: Meta’s flirty AI chatbot invites 76-year-old to ‘her apartment’ – What happens next? – Crypto News
-
Business1 week ago
Gemini Details IPO Plans Amid Increasing Losses and Ripple Loan Agreement – Crypto News
-
Business1 week ago
Breaking: U.S. Bitcoin Reserves Worth Up To $20 Billion, Scott Bessent Confirms – Crypto News
-
Business1 week ago
Pi Network Set for RWA Tokenization as Stellar Partners with ERC-3643 Association – Crypto News
-
Technology1 week ago
Aravind Srinivas-led Perplexity’s $34.5bn Chrome bid: Are browsers the next AI battleground? – Crypto News
-
Blockchain1 week ago
Novogratz Worries About Economy If Bitcoin Reaches $1M In 2026 – Crypto News
-
others1 week ago
Binance Coin Price Eyes $1K on BNB Treasury Boom, ETF Approval Hopes – Crypto News
-
others1 week ago
How Bitcoin Made Satoshi Nakamoto Richer Than Bill Gates: Net Worth Revealed – Crypto News
-
Cryptocurrency1 week ago
Altcoins soar, Bitcoin stalls as Fed rate cut speculation hits fever pitch – Crypto News
-
Blockchain1 week ago
How to Pay for Flights with Crypto in the UAE: A Complete Step-by-Step Guide – Crypto News
-
Blockchain1 week ago
Ronin Network is Coming Back Home to Ethereum in 2026 – Crypto News
-
Metaverse1 week ago
‘A 25-year-old in Mumbai…’: ChatGPT mastermind Sam Altman bets big on India, poised to be OpenAI’s top market – Crypto News
-
Technology1 week ago
iPhone 17 Air launching next month: Price, display, processor, battery and everything expected – Crypto News
-
Cryptocurrency1 week ago
How to Avoid Capital Gains Tax on Cryptocurrency – Crypto News
-
Cryptocurrency6 days ago
DAT-a crunch: Momentum builds around ETH treasury companies – Crypto News
-
Technology1 week ago
AI just changed how we travel! This new Google tool could save you thousands on flights: Here’s how – Crypto News
-
others1 week ago
$2.5T Citigroup Considers Custody Services for Crypto ETFs and Stablecoins – Crypto News
-
Business1 week ago
Crypto Liquidations Close to $1B as Scott Bessent Revises US Treasury Bitcoin Stance – Crypto News
-
Cryptocurrency1 week ago
CYBER price explodes 80% to YTD high above $4.5: here’s why – Crypto News
-
others1 week ago
Breaking: Federal Reserve to End Program That Targeted Crypto Banking – Crypto News
-
others1 week ago
Dow Jones falls from record highs after consumer sentiment declines – Crypto News
-
Technology1 week ago
iOS 26 brings a new AI-powered feature to extend your iPhone’s battery life: Here’s how it works – Crypto News
-
Cryptocurrency1 week ago
Norway’s $1.6 trillion wealth fund boosts indirect Bitcoin exposure by 192% in Q2 2025 – Crypto News
-
others1 week ago
Trump and Putin joint press conference ends with no deal – Crypto News
-
Cryptocurrency7 days ago
Ethereum vs. Bitcoin: Here’s why ETH can be a better 2025 risk-on pick – Crypto News
-
Technology7 days ago
Google’s Gemini AI is training on your personal conversations by default. Here’s how you can turn if off – Crypto News
-
Business4 days ago
Ripple’s RLUSD Gains Spotlight as OCC Permits Bank–Stablecoin Partnerships – Crypto News
-
others4 days ago
MSTR Stock Crashes As Michael Saylor Takes U-turn on mNAV Policy – Crypto News
-
Technology1 week ago
AI threatens entry-level lobs? Tech openings for new grads have already been halved, says report – Crypto News
-
Blockchain1 week ago
Blockchain Security Must Localize To Stop Asia’s Crypto Crime Wave – Crypto News
-
others7 days ago
Japan CFTC JPY NC Net Positions dipped from previous ¥82K to ¥74.2K – Crypto News
-
Blockchain6 days ago
Circle’s Arc to Launch with Fireblocks Integration as Stablecoin Race Intensifies – Crypto News
-
Cryptocurrency6 days ago
U.S. Treasury Seeks Public Input on GENIUS Act Stablecoin Rules – Crypto News
-
Cryptocurrency1 week ago
Why XRP price has failed to breakout despite SEC settlement – Crypto News
-
De-fi7 days ago
SEC Postpones Decision on Bitwise and 21Shares Solana ETFs to October – Crypto News
-
Technology6 days ago
From ‘Step Mom’ to ‘Russian Girl’: Meta’s sexualised AI chatbots flood Instagram, Facebook; netizens call it ‘dystopian’ – Crypto News
-
De-fi6 days ago
Citigroup Weighs Stablecoin and Crypto ETF Custody Services – Crypto News
-
Technology6 days ago
Setback for Apple? More executive exits are coming for tech giant, says report – Crypto News
-
De-fi6 days ago
Crypto Sponsorships in Football Hits $565 Million as Clubs Embrace Digital Finance – Crypto News
-
De-fi5 days ago
1inch Unveils Solana Integration for Cross-Chain Swaps – Crypto News
-
Business3 days ago
Coinbase To List Trump’s World Liberty Financial USD1 Stablecoin – Crypto News
-
Business2 days ago
DeFi Scores Major Win: DOJ Softens Stance on Money Transmitting Charges – Crypto News