Metaverse
Why voice is emerging as India’s next frontier for AI interaction – Crypto News
Unlike text, which is relatively uniform, spoken language is richly-layered—with cultural nuances, colloquialisms and emotion. Startups building voice-first AI models are now doubling down on one thing above all else: the depth and diversity of datasets.
Why voice is emerging as the frontline interface
In India, where oral tradition plays a pivotal role in communication, voice isn’t just a convenience—it’s a necessity. “We’re not an English-first or even a text-first country. Even when we type in Hindi, we often use the English script instead of Devanagari. That’s exactly why we need to build voice-first models—because oral tradition plays such a vital role in our culture,” said Abhishek Upperwal, chief executive officer (CEO) of Soket AI Labs.
Voice is also proving critical for customer service and accessibility. “Voice plays a crucial role in bridging accessibility gaps, particularly for users with disabilities,” said Mahesh Makhija, leader, technology consulting, at EY.
“Many customers even prefer voicing complaints over typing, simply because talking feels more direct and human. Moreover, voice is far more frictionless than navigating mobile apps or interfaces—especially for users who are digitally-illiterate, older, or not fluent in English,” said Makhija, adding that “communicating in vernacular languages opens access to the next half a billion consumers, which is a major focus for enterprises.”
Startups like Gnani.ai are already deploying voice systems across banking and financial services to streamline customer support, assist with loan applications, and eliminate virtual queues. “The best way to reach people—regardless of literacy levels or demographics—is through voice in the local language, so it’s very important to capture the tonality of the conversations,” said Ganesh Gopalan, CEO of Gnani.ai.
The hunt for rich, real-world data
As of mid-2025, India’s AI landscape shows a clear tilt toward text-based AI, with over 90 Indian companies active in the space, compared to 57 in voice-based AI. Text-based platforms tend to focus on document processing, chat interfaces, and analytics. In contrast, voice-based companies are more concentrated in customer service, telephony, and regional language access, according to data from Tracxn.
In terms of funding, voice-first AI startups have attracted larger funding rounds at later stages, while text AI startups show broader distribution, especially at earlier stages.
For example, Skit.ai, a voice-first AI firm, raised a total of $47.6 million across five funding rounds. Similarly, Yellow.ai has cumulatively secured around $102 million, including a major $78.15M Series C round in 2021, making it one of the top-funded startups in voice AI, data from Tracxn shows.
However, data remains the foundational challenge for voice models. Voice AI systems need massive, diverse datasets that not only cover different languages, but also regional accents, slangs and emotional tonality.
Chaitanya C., co-founder and chief technological officer of Ozonetel Communications, put it simply: “The datasets matter the most—speaking as an AI engineer, I can say it’s not about anything else; it’s all about the data.”
IndiaAI Mission has allocated ₹199.55 crore for datasets—just about 2% of the mission’s total ₹10,300 crore budget —while 44% has gone to compute. “Investments solely in compute are inherently transient—their value fades once consumed. On the other hand, investments in datasets build durable, reusable assets that continue to deliver value over time,” said Chaitanya.
He also emphasized the scarcity of rich, culturally-relevant data in regional languages like Telugu and Kannada. “The amount of data easily available in English, when compared with Telugu and Kannada or Hindi, it’s not even comparable,” he said. “Somewhere it’s just not perfect, it wouldn’t be as good as an English story, which is why I wouldn’t want it to tell a Telugu story for my kid.”
“Some movie comes out, nobody’s going to write it in government documents, but people are going to talk about it, and that is lost,” he added, pointing out that government datasets often lack cultural nuance and everyday language.
Gopalan of Gnani.ai agreed. “The colloquial language is often very different from the written form. Language experts have a great career path ahead of them because they not only understand the language technically, but also know how to converse naturally and grasp colloquial nuances.”
Startups are now employing creative methods to fill these gaps. “First, we collect data directly from the field using multiple methods—and we’re careful with how we handle that data. Second, we use synthetic data in some cases. Third, we augment that synthetic data further. In addition, we also leverage a substantial amount of open-source data available from universities and other sources,” Gopalan said.
Synthetic data is artificially-generated data that mimics real-world data for use in training, testing, or validating models.
Upperwal added that Soket AI uses a similar approach: “We start by training smaller AI models with the limited real voice data we have. Once these smaller models are reasonably accurate, we use them to generate synthetic voice data—essentially creating new, artificial examples of speech.”
However, some intend to consciously stay away from synthetic data.
Ankush Sabarwal, CEO and founder of CoRover AI, said the company relies exclusively on real data, deliberately avoiding synthetic data, “If I am a consumer and I am interacting with an AI bot, the AI bot will become intelligent by the virtue of it interacting with a human like me.”
The ethical labyrinth of voice AI
As companies begin to scale their data pipelines, the new Digital Personal Data Protection (DPDP) Act will shape how they collect and use voice data.
“The DPDP law emphasizes three key areas: it mandates clear, specific, and informed consent before collecting data. Second, it enforces purpose limitation—data can only be used for legitimate, stated purposes like KYC or employment, not unrelated model training. Third, it requires data localization, meaning critical personal data must reside on servers in India,” said Makhija.
He added, “Companies have begun including consent notices at the start of customer calls, often mentioning AI training. However, the exact process of how this data flows into model training pipelines is still evolving and will become clearer as DPDP rules are fully implemented.”
Outsourcing voice data collection raises red flags, too. “For a deep-tech company like ours, voice data is one of the most powerful forms of IP (intellectual property) we have, and outsourcing it could compromise its integrity and ownership. What if someone is using copyrighted material?” said Gopalan.
-
Technology1 week agoMulticloud Agility Comes to Financial Services – Crypto News
-
Technology1 week agoVivo X300, X300 Pro launched in India, price starts at ₹75,999: Display, camera details and all you need to know – Crypto News
-
Technology1 week ago
Jerome Powell Speech Today: What To Expect as Fed Ends QT – Crypto News
-
Cryptocurrency4 days agoIlluminating progress: Is a $140K income ‘poor’? – Crypto News
-
Cryptocurrency1 week agoBitcoin, Ethereum, and XRP Crash Triggering $637M in Liquidations – Crypto News
-
Cryptocurrency1 week ago‘ZEC Is 20x Lower Than XRP’: Solana Builder Breaks Silence After Zcash’s 50% Crash – Crypto News
-
others1 week ago
Ethereum Price Crashes Below $3,000 as $500M Longs Liquidated: What’s Next? – Crypto News
-
Technology1 week ago
Breaking: First U.S. Chainlink ETF Goes Live as Grayscale Launches ‘GLNK’ – Crypto News
-
others1 week ago
Grayscale Cleared to Launch First Spot Chainlink ETF This Week Amid Rising Demand – Crypto News
-
Cryptocurrency1 week agoAstrology for traders – Blockworks – Crypto News
-
Technology1 week agoIs your iPhone obsolete? Apple adds 5 new devices to the ‘no repair’ list – Crypto News
-
Cryptocurrency1 week ago
Operation Choke Point: House Republicans Spotlight Biden Administration’s ‘Attack on Crypto’ – Crypto News
-
others1 week agoWTI declines below $59.50 as bearish outlook prevails – Crypto News
-
others1 week ago
Trump-Backed Alt5 Sigma Under Fire for Possible SEC Rule Violations, New Report Reveals – Crypto News
-
others1 week ago
Why Is Crypto Market Recovering? – Crypto News
-
Cryptocurrency1 week ago‘Get it done on time’ – Lawmakers push regulators on GENIUS Act rollout – Crypto News
-
Cryptocurrency1 week ago
Crypto Platform Polymarket Relaunches in U.S. Following CFTC Approval – Crypto News
-
Cryptocurrency7 days agoUK recognises crypto as property in major digital asset shift – Crypto News
-
Cryptocurrency4 days agoFlorida Appeals Court Revives $80M Bitcoin Theft – Crypto News
-
Technology3 days ago
Crypto Lawyer Bill Morgan Praises Ripple’s Multi-Chain Strategy as RLUSD Hits $1.1B – Crypto News
-
others1 week ago
Crypto Market Crash Erases Fed Rate Cut-Driven Bitcoin, ETH, XRP, SOL, ZEC Gains – Crypto News
-
Technology1 week ago
Sony Bank Joins Ripple, Circle to Launch USD-Pegged Stablecoin in the U.S. by 2026 – Crypto News
-
others1 week agoUSD/CNH hits lowest since October 2024 – BBH – Crypto News
-
Business1 week ago
Schiff Predicts ‘Beginning of the End’ for MSTR as Strategy Eases Bitcoin Sell-Off Fears With $1.44B Reserve – Crypto News
-
Business1 week ago
8 Best Crypto Exchanges That Accept PayPal Deposits and Withdrawals – Crypto News
-
others1 week ago
Strategy CEO Says Bitcoin Sales Unlikely Before 2029 After Creating $1.44B Dividend Reserves – Crypto News
-
Business1 week ago
Sui Price Surges 10% As Vanguard Group Adds SUI to Bitwise 10 Crypto Index – Crypto News
-
Cryptocurrency1 week agoRipple CTO Shares Hilarious Email from Jed McCaleb Impersonator – Crypto News
-
others1 week ago
$12T Charles Schwab to Launch Bitcoin and Ethereum Trading in Early 2026, CEO Confirms – Crypto News
-
Business1 week ago
Senator Tim Scott Floats December 17 and 18 For Crypto Market Bill Markup – Crypto News
-
Business1 week ago
Crypto Platform Polymarket Relaunches in U.S. Following CFTC Approval – Crypto News
-
Cryptocurrency7 days agoBTC staking platform Babylon teams up with Aave for Bitcoin-backed DeFi insurance – Crypto News
-
others4 days agoGold holds strong at $4,200 as Fed-cut anticipation builds – Crypto News
-
Cryptocurrency4 days agoCrypto Holiday Gift Guide 2025 – Crypto News
-
Blockchain2 days agoAnalyst Reveals What You Should Look Out For – Crypto News
-
others1 week agoAustralian Dollar loses momentum below 0.6550 on disappointing Chinese PMI – Crypto News
-
others1 week ago
Schiff Predicts ‘Beginning of the End’ for MSTR as Strategy Eases Bitcoin Sell-Off Fears With $1.44B Reserve – Crypto News
-
Blockchain1 week agoRipple Gets OK to Expand Payments Business in Singapore – Crypto News
-
Business1 week ago
XRP Price Prediction as Ripple Gets MAS Licence in Singapore – Crypto News
-
Technology1 week ago
Polymarket Rival Kalshi Moves On-Chain With Launch of Tokenized Prediction Markets on Solana – Crypto News
-
Technology1 week agoPrice drop on headphones and speakers in Amazon Mega Electronic Days sale: Top 10 deals with up to 75% off on top brands – Crypto News
-
Business1 week ago
Tom Lee Says Bitcoin Could Hit New ATH In January As Hassett Becomes Favorite For Fed Chair – Crypto News
-
others1 week agoEUR/GBP edges higher as Eurozone inflation supports Euro, BoE weighs – Crypto News
-
Cryptocurrency1 week agoSHIB Exec Calls FBI, RCMP and Interpol to Action on Surging Cyber Attacks – Crypto News
-
Business1 week ago
Trump Sets Early 2026 Timeline for New Fed Chair Pick – Crypto News
-
Cryptocurrency1 week agoVanguard reverses course, opens door to Bitcoin, Ethereum, XRP, and Solana ETFs – Crypto News
-
others1 week ago
XRP News: Ripple Expands Payments Service With RedotPay Integration – Crypto News
-
Blockchain7 days agoLedger Finds Chip Flaw Allowing Complete Phone Takeover – Crypto News
-
Business7 days ago
Kalshi, Robinhood and Crypto com Face Cease & Desist Order in Connecticut – Crypto News
-
Blockchain7 days agoSolana (SOL) Cools Off After Rally While Market Eyes a Resistance Break – Crypto News
