

Metaverse
Why voice is emerging as India’s next frontier for AI interaction – Crypto News
Unlike text, which is relatively uniform, spoken language is richly-layered—with cultural nuances, colloquialisms and emotion. Startups building voice-first AI models are now doubling down on one thing above all else: the depth and diversity of datasets.
Why voice is emerging as the frontline interface
In India, where oral tradition plays a pivotal role in communication, voice isn’t just a convenience—it’s a necessity. “We’re not an English-first or even a text-first country. Even when we type in Hindi, we often use the English script instead of Devanagari. That’s exactly why we need to build voice-first models—because oral tradition plays such a vital role in our culture,” said Abhishek Upperwal, chief executive officer (CEO) of Soket AI Labs.
Voice is also proving critical for customer service and accessibility. “Voice plays a crucial role in bridging accessibility gaps, particularly for users with disabilities,” said Mahesh Makhija, leader, technology consulting, at EY.
“Many customers even prefer voicing complaints over typing, simply because talking feels more direct and human. Moreover, voice is far more frictionless than navigating mobile apps or interfaces—especially for users who are digitally-illiterate, older, or not fluent in English,” said Makhija, adding that “communicating in vernacular languages opens access to the next half a billion consumers, which is a major focus for enterprises.”
Startups like Gnani.ai are already deploying voice systems across banking and financial services to streamline customer support, assist with loan applications, and eliminate virtual queues. “The best way to reach people—regardless of literacy levels or demographics—is through voice in the local language, so it’s very important to capture the tonality of the conversations,” said Ganesh Gopalan, CEO of Gnani.ai.
The hunt for rich, real-world data
As of mid-2025, India’s AI landscape shows a clear tilt toward text-based AI, with over 90 Indian companies active in the space, compared to 57 in voice-based AI. Text-based platforms tend to focus on document processing, chat interfaces, and analytics. In contrast, voice-based companies are more concentrated in customer service, telephony, and regional language access, according to data from Tracxn.
In terms of funding, voice-first AI startups have attracted larger funding rounds at later stages, while text AI startups show broader distribution, especially at earlier stages.
For example, Skit.ai, a voice-first AI firm, raised a total of $47.6 million across five funding rounds. Similarly, Yellow.ai has cumulatively secured around $102 million, including a major $78.15M Series C round in 2021, making it one of the top-funded startups in voice AI, data from Tracxn shows.
However, data remains the foundational challenge for voice models. Voice AI systems need massive, diverse datasets that not only cover different languages, but also regional accents, slangs and emotional tonality.
Chaitanya C., co-founder and chief technological officer of Ozonetel Communications, put it simply: “The datasets matter the most—speaking as an AI engineer, I can say it’s not about anything else; it’s all about the data.”
IndiaAI Mission has allocated ₹199.55 crore for datasets—just about 2% of the mission’s total ₹10,300 crore budget —while 44% has gone to compute. “Investments solely in compute are inherently transient—their value fades once consumed. On the other hand, investments in datasets build durable, reusable assets that continue to deliver value over time,” said Chaitanya.
He also emphasized the scarcity of rich, culturally-relevant data in regional languages like Telugu and Kannada. “The amount of data easily available in English, when compared with Telugu and Kannada or Hindi, it’s not even comparable,” he said. “Somewhere it’s just not perfect, it wouldn’t be as good as an English story, which is why I wouldn’t want it to tell a Telugu story for my kid.”
“Some movie comes out, nobody’s going to write it in government documents, but people are going to talk about it, and that is lost,” he added, pointing out that government datasets often lack cultural nuance and everyday language.
Gopalan of Gnani.ai agreed. “The colloquial language is often very different from the written form. Language experts have a great career path ahead of them because they not only understand the language technically, but also know how to converse naturally and grasp colloquial nuances.”
Startups are now employing creative methods to fill these gaps. “First, we collect data directly from the field using multiple methods—and we’re careful with how we handle that data. Second, we use synthetic data in some cases. Third, we augment that synthetic data further. In addition, we also leverage a substantial amount of open-source data available from universities and other sources,” Gopalan said.
Synthetic data is artificially-generated data that mimics real-world data for use in training, testing, or validating models.
Upperwal added that Soket AI uses a similar approach: “We start by training smaller AI models with the limited real voice data we have. Once these smaller models are reasonably accurate, we use them to generate synthetic voice data—essentially creating new, artificial examples of speech.”
However, some intend to consciously stay away from synthetic data.
Ankush Sabarwal, CEO and founder of CoRover AI, said the company relies exclusively on real data, deliberately avoiding synthetic data, “If I am a consumer and I am interacting with an AI bot, the AI bot will become intelligent by the virtue of it interacting with a human like me.”
The ethical labyrinth of voice AI
As companies begin to scale their data pipelines, the new Digital Personal Data Protection (DPDP) Act will shape how they collect and use voice data.
“The DPDP law emphasizes three key areas: it mandates clear, specific, and informed consent before collecting data. Second, it enforces purpose limitation—data can only be used for legitimate, stated purposes like KYC or employment, not unrelated model training. Third, it requires data localization, meaning critical personal data must reside on servers in India,” said Makhija.
He added, “Companies have begun including consent notices at the start of customer calls, often mentioning AI training. However, the exact process of how this data flows into model training pipelines is still evolving and will become clearer as DPDP rules are fully implemented.”
Outsourcing voice data collection raises red flags, too. “For a deep-tech company like ours, voice data is one of the most powerful forms of IP (intellectual property) we have, and outsourcing it could compromise its integrity and ownership. What if someone is using copyrighted material?” said Gopalan.
-
Blockchain1 week ago
Bitcoin Consolidation Continues: These Are Two Key Support Levels To Watch – Crypto News
-
Blockchain1 week ago
Bitcoin Consolidation Continues: These Are Two Key Support Levels To Watch – Crypto News
-
Blockchain1 week ago
Bitcoin Consolidation Continues: These Are Two Key Support Levels To Watch – Crypto News
-
Blockchain1 week ago
Bitcoin Consolidation Continues: These Are Two Key Support Levels To Watch – Crypto News
-
Blockchain1 week ago
Bitcoin Consolidation Continues: These Are Two Key Support Levels To Watch – Crypto News
-
others1 week ago
Will Ethereum Price Rally to $3,200 as Wall Street Pivots from BTC to ETH – Crypto News
-
others5 days ago
Skies are clearing for Delta as stock soars 13% on earnings beat – Crypto News
-
others5 days ago
Skies are clearing for Delta as stock soars 13% on earnings beat – Crypto News
-
Cryptocurrency1 week ago
TON Foundation Confirms UAE Golden Visa Offer Is Not Official – Crypto News
-
Cryptocurrency1 week ago
Binance stacks Ethereum at yearly high, U.S. funds buy more: So why isn’t ETH moving? – Crypto News
-
others1 week ago
Company Owned by Billionaire Gold Miner May Be Seized by Russian Government for Allegedly Breaching Regulations: Report – Crypto News
-
Blockchain6 days ago
Insomnia Labs Debuts Stablecoin Credit Platform for Creators – Crypto News
-
De-fi1 week ago
World Liberty Finance Opens Vote to List $WLFI Token – Crypto News
-
others1 week ago
US Dollar Witnesses Worst First-Half Performance in 52 Years As Money Supply Explodes To $21,942,000,000,000 – Crypto News
-
Technology1 week ago
We’re Losing the Plot on AI in Universities – Crypto News
-
others1 week ago
Appropriate to have cautious gradual stance on easing – Crypto News
-
others6 days ago
EUR/GBP posts modest gain above 0.8600 ahead of German inflation data – Crypto News
-
Blockchain6 days ago
Ant Group Eyes USDC Integration Circle’s: Report – Crypto News
-
Cryptocurrency5 days ago
Bitcoin Breaks New Record at $111K, What’s Fueling the $120K Price Target? – Crypto News
-
Technology5 days ago
XRP Eyes $3 Breakout Amid Rising BlackRock ETF Speculation – Crypto News
-
others1 week ago
Eyes breakout as triangle narrows, but lacks momentum – Crypto News
-
others1 week ago
Bitcoin Treasury Activity Explodes with 8,400 BTC Added in One Week – Crypto News
-
Blockchain1 week ago
Bitcoin Consolidation Continues: These Are Two Key Support Levels To Watch – Crypto News
-
Metaverse1 week ago
Are firms wasting their money on AI agents? – Crypto News
-
Metaverse1 week ago
Are firms wasting their money on AI agents? – Crypto News
-
Cryptocurrency1 week ago
Institutions Pile Up BTC But Price Doesn’t go up, Why? – Crypto News
-
others1 week ago
Bank Insider Admits to Nearly Decade-Long Scheme of Falsifying Loan Applications To Steal Funds: DOJ – Crypto News
-
Cryptocurrency1 week ago
This Week in Crypto Games: Planetside Dev’s ‘Reaper Actual’, What’s Next for ‘MapleStory Universe’ – Crypto News
-
Business1 week ago
Toncoin Price Drops 10% As UAE Authorities Call TON Golden Visa Offer Unofficial – Crypto News
-
Blockchain1 week ago
XRP Set To Shock The Crypto Market With 30% Share: Analyst – Crypto News
-
Cryptocurrency1 week ago
Coinbase hacker returns with $12.5 mln ETH buy: Will security concerns affect Ethereum? – Crypto News
-
others1 week ago
Is a Pi Network Crash Ahead As 272M Coins Unlock in July – Crypto News
-
Business1 week ago
Solana ETF Launch Delayed Amid Wait for SEC’s Crypto ETF Framework – Crypto News
-
Cryptocurrency1 week ago
On thinking ahead when markets get murky – Crypto News
-
Cryptocurrency7 days ago
Is ETH Finally Ready to Shoot For $3K? (Ethereum Price Analysis) – Crypto News
-
Cryptocurrency7 days ago
Tornado Cash Judge Won’t Let One Case Be Mentioned in Roman Storm’s Trial: Here’s Why – Crypto News
-
Blockchain7 days ago
XRP Rally Possible If Senate Web3 Crypto Summit Goes Well – Crypto News
-
others6 days ago
USD/CAD trades with positive bias below 1.3700; looks to FOMC minutes for fresh impetus – Crypto News
-
Blockchain6 days ago
Ethereum Bulls Roar — $3K Beckons After 5% Spike – Crypto News
-
Blockchain6 days ago
Kraken and Backed Expand Tokenized Equities to BNB Chain – Crypto News
-
others6 days ago
NovaEx Launches with a Security-First Crypto Trading Platform Offering Deep Liquidity and Institutional-Grade Infrastructure – Crypto News
-
Cryptocurrency6 days ago
XRP price forecast as coins surges 2.19% to $2.33 – Crypto News
-
others5 days ago
Anthony Scaramucci Says $180,000 Bitcoin Price Explosion Possible As BTC ‘Supremacy’ Creeps Up – Here’s His Timeline – Crypto News
-
Blockchain5 days ago
SUI Chart Pattern Confirmation Sets $3.89 Price Target – Crypto News
-
others5 days ago
EUR/GBP climbs as weak UK data fuels BoE rate cut speculation – Crypto News
-
Business4 days ago
PENGU Rallies Over 20% Amid Coinbase’s Pudgy Penguins PFP Frenzy – Crypto News
-
Technology1 week ago
Why Are Dormant Bitcoin Whale Wallets Suddenly Waking Up? – Crypto News
-
others1 week ago
NZD/USD risks further downside as Kiwi tests critical support at 0.6050 – Crypto News
-
Blockchain1 week ago
Cardano (ADA) Turns Upward — Signs of a Recovery Emerge – Crypto News
-
Cryptocurrency1 week ago
Macroeconomics, Market Shifts, and Trading Speed Take Center Stage at B2MEET by B2PRIME – Crypto News