

Technology
‘Studying AI bias in the Indian societal context’ – Crypto News
Besides opening your PaLM API for developer access, would Google also be backing developer projects in India?
Today, there are so many startups and developers looking to build solutions that serve these customers. What we’re now enabling is for them to start using our APIs, to build these solutions. We also have various teams, including customer engineering units and at our Google Cloud division, who already have pre-existing relationships with many developers. Depending on that, these teams will provide further hand-holding and assistance in terms of making the most of our generative AI APIs.
Researchers at Indian institutes have struggled with availability of digitized datasets in local languages. Would Google’s dataset now be available to institutes?
We already do that — Project Vaani was done in collaboration with the Indian Institute of Science (IISc). Through this, we’re seeing the first-ever digital dataset for Indic languages, for AI researchers.
When we started working on establishing a single generative AI model for 125 Indian languages, all of these languages were what researchers call zero-corpus. It’s not that we had very little data—for many of them, we had absolutely no digitized data at all. For the first time, we’ve managed to move many Indian languages from zero-corpus to at least the low-resource level.
All of this data is now open-sourced, which means that it is now openly available to academic researchers, startups, and even large companies. This is just the first tranche — over the coming months and the next one year, we’ll keep making more Indian language data available to our database. This will continue to happen as we keep scaling our efforts to more districts across India, through which the dataset that we have will become more diverse.
You’ve also open-sourced a local language bias benchmark in India. Given that data on Indian languages is still so scarce, is it possible to address AI bias at this stage?
The first and foremost thing that we did in bias was to start understanding the issue in a non-Western context. If you look at most AI literature on bias, up until two years ago, all of it—including understanding race and gender-based biases—were in the Western context. Hence, what we recognized is that there is a major societal context here — in India, for instance, there are multiple additional axes of bias that are based on caste, religion and others. We wanted to understand these. There is a technological gap in this regard, because the capability of language models were poorer in Indian languages than in more mature languages such as English. It is well-known that LLMs can hallucinate, which leads to misinformation in the output results. Hence, the problems (such as those of bias) often become worse in lower resource languages.
Then, there is also a pillar of aligning values. For instance, while confronting an elderly user’s queries in stoic phrases is acceptable in a Western cultural context, the same within India would not necessarily be so.
We wanted to understand these issues in the Indian cultural context—the technological gap of data is just one aspect that was missing in terms of understanding bias in an Indianized context. This would therefore apply even to English within the Indian context.
How good is the benchmark in addressing these biases?
It’s a start. We’ve already used our LLMs to automatically create certain phrases and sentence completions, through which we were able to get a comprehensive set of stereotypes that we uncovered in the local context.
In addition to this, we’re also engaging with the research community, and using our interactions to uncover additional sources of bias. These have led to multiple interesting ideas around intersectional issues of bias — for instance, in the case of a Dalit woman, a combination of gender and caste-based biases may come together within the model, which is what we’re working to identify and develop now.
How is the data on Indian languages collected by Google?
The entire effort is driven by IISc, and we’ve collaborated with them to share best practices on what we need the dataset to be like, in order for it to be used well by AI researchers. The IISc, in turn, has partners that operationalize their data collection efforts by having people reach various districts.
There, these partners then show a set of images to local residents, and record their local dialect answers.
Lack of compute is another major challenge, alongside data. Would Google also answer this for those who work on generative AI projects?
Yes. In many cases, we’ve been offering researchers access to free Google Cloud credits. This allows them to run their own AI models on our cloud infrastructure.
Compute is a significant enabler for building AI models, and is often hard to access for many developers and researchers. We recognize that, and we’ve been accordingly providing compute capabilities wherever feasible.
What contribution does Google Research India make in the development of PaLM, or even Bard?
We have significant engineering and research teams in India. In particular, our research lab has been making critical contributions to extending multilingual capabilities of LLMs within Google. We’ve of course started with Indian languages, but a lot of our work has been done in a manner that the same principles can be applied more broadly across other under-resourced languages around the world. This can help other languages also understand aspects around bias and misinformation.
Is it possible for versions of generative AI models to work on-device?
Our PaLM API runs on the cloud. But, there are certain generative AI capabilities that are becoming available on-device. They would be offline, and would be highly reduced models that are distilled for local functioning. They wouldn’t be as powerful as the ones that run on the cloud, but there are such models that exist today.
For instance, there are some versions of the PaLM API that are internally available, and work on-device.
Catch all the technology news and Updates on Live Mint. Download Mint News App to get Daily market update Live business news,
Updated: 28 Jun 2023, 10:00 PM IST
-
Blockchain1 week ago
Crypto execs cheer as Australia appoints pro-crypto assistant minister – Crypto News
-
Business6 days ago
How Mid-Sized Treasurers Are Managing Liquidity Amid Uncertainty – Crypto News
-
Blockchain1 week ago
US property manager tokenizes multifamily properties on Chintai blockchain – Crypto News
-
Blockchain1 week ago
Top Expert Declares It The Best Crypto To Buy Now – Crypto News
-
Business1 week ago
No Truth to Truth Social Memecoin: World Liberty Financial Clarifies – Crypto News
-
others1 week ago
Why Is Crypto Market Down When S&P 500 Flashes Bull Run Ahead? – Crypto News
-
others1 week ago
Hackers Attempting To Extort School Employees via Email After Millions of Students’ Personal Data Leaked in Breach: Report – Crypto News
-
Business1 week ago
XRP Flips Tether’s USDT By Market Cap Reclaiming 3rd Spot, Price Rally To $3? – Crypto News
-
Business1 week ago
Crypto News: Animoca Brands Eye NYSE Listing Amid Donald Trump’s Crypto Push – Crypto News
-
others1 week ago
MoonX: BYDFi’s On-Chain Trading Engine — A Ticket from CEX to DEX – Crypto News
-
others1 week ago
Analyst Sees Crypto Repeating Dot-Com Bubble, Predicts Rallies for XRP and One Solana Challenger – Crypto News
-
Technology1 week ago
XRP Price Prediction as Binance Data Reveals Early Signs Of Bull Run – Crypto News
-
others1 week ago
SEC Crypto Roundtable: Paul Atkins Vows To Make US Crypto Capital Of The World – Crypto News
-
others1 week ago
Investor Kidnapped, Driven to Remote Desert and Robbed of $4,000,000 in Cryptocurrency by Teenagers: Report – Crypto News
-
others1 week ago
USD/JPY falls below 148.00 despite persistent uncertainty over BoJ’s policy outlook – Crypto News
-
Metaverse1 week ago
Why AI is central to the new browser wars – Crypto News
-
Business1 week ago
Can WIF Price Hit $2? Pattern Breakout and 100% OI Surge to $445M Signal Major Upside – Crypto News
-
others1 week ago
AUD/USD gains after softer CPI data from the US and trade developments – Crypto News
-
others1 week ago
Breaking: US SEC Delays Decision on Grayscale Spot Solana and Litecoin ETFs – Crypto News
-
Technology1 week ago
Best wireless soundbars in 2025: Top 10 picks to elevate your home audio experience – Crypto News
-
Blockchain1 week ago
10 Signs a Crypto Investment Platform Is a Scam—and How to Avoid It – Crypto News
-
Technology1 week ago
Pi Coin Crashes 33% As Pi Network Community Screams ‘Betrayal’ – Crypto News
-
Business1 week ago
Bitcoin Price Risks Dropping Below $100k As Crypto Liquidations Hit $714M – Crypto News
-
others1 week ago
Pepe Coin Price Outperforms DOGE and SHIB, Targets 80% Upside Post-Retest – Crypto News
-
Blockchain1 week ago
Top Expert Declares It The Best Crypto To Buy Now – Crypto News
-
Technology1 week ago
iQOO Neo 10 vs Motorola Edge 60 Pro: Which smartphone to buy under Rs.35000 – Crypto News
-
others1 week ago
Silver trims early gains, holds above 50-day EMA as weak US CPI tempers Fed tightening bets – Crypto News
-
Cryptocurrency1 week ago
Ripple (XRP) Price Analysis: $5.5 Billion XRP Open Interest Signals Positive Reaction to Paul Atkins’ Latest Update – Crypto News
-
Business1 week ago
Ripple (XRP) Price Analysis: $5.5 Billion XRP Open Interest Signals Positive Reaction to Paul Atkins’ Latest Update – Crypto News
-
Technology1 week ago
Ripple (XRP) Price Analysis: $5.5 Billion XRP Open Interest Signals Positive Reaction to Paul Atkins’ Latest Update – Crypto News
-
Blockchain1 week ago
Alarm bells ring in US over OpenAI’s crypto project World – Crypto News
-
others1 week ago
Tests 100.50 support, with nine-day EMA providing backing – Crypto News
-
Business1 week ago
COIN Stock Soars 23% Ahead of Coinbase’s May 19 Debut on S&P 500 – Crypto News
-
Cryptocurrency1 week ago
Why investors should say ‘no’ more often – Crypto News
-
Cryptocurrency1 week ago
Why investors should say ‘no’ more often – Crypto News
-
others1 week ago
Nifty 50 Index Elliott Wave technical analysis [Video] – Crypto News
-
others6 days ago
Crypto Trader Prints 517x Profit on Solana-Based Altcoin That’s Exploded 7,000% in Just One Week: Lookonchain – Crypto News
-
Blockchain6 days ago
Stablecoin bill passes in Northern Marianas as House overrides veto – Crypto News
-
others6 days ago
Dogecoin On-Chain Metrics Hint At DOGE Mega Rally Ahead – Crypto News
-
Blockchain6 days ago
Solana Poised For Upside Move After A Bounce From $168 – Crypto News
-
others6 days ago
XRP Futures ETF Goes Live on May 19: Will It Beat ETH And BTC Debut? – Crypto News
-
Cryptocurrency6 days ago
Top crypto to buy as Saudi Central Bank reveals exposure to MSTR – Crypto News
-
Cryptocurrency6 days ago
UK confirms crypto tax data rules under CARF; first deadline set for May 2027 – Crypto News
-
Technology6 days ago
Coinbase estimates $400M cost after data breach and crypto scam – Crypto News
-
Business5 days ago
World Liberty Financial Partners Chainlink To Enable USD1 Stablecoin Cross-Chain Transfers – Crypto News
-
Blockchain5 days ago
Bitcoin Panic Buying? Eric Trump Says the World Is Stockpiling BTC – Crypto News
-
Cryptocurrency5 days ago
Ripple’s XRP may enable BRICS to ditch dollar and settle trade in gold – Crypto News
-
Technology5 days ago
XRP Price Impact If GENIUS Act Boosts Ripple’s RLUSD Market Cap to 50% of Tether’s $150B – Crypto News
-
Cryptocurrency1 week ago
XRP Price Nears $2.50 Support As Fundamentals Bring Record Highs In Sight – Crypto News
-
Technology1 week ago
Samsung Galaxy Z Fold 7 tipped to outsize last-gen Z Fold 6: Check details – Crypto News