
Metaverse
‘We want to fix the language gap in AI language models’, says Two Platforms’ Pranav Mistry – Crypto News
Backed by billionaire Mukesh Ambani’s Jio Platforms and South Korea’s Naver Corp, the artificial reality startup also plans to soon release an artificial intelligence (AI)-powered messaging and social app called Zappy in India, according to Two Platform’s founder and CEO, Pranav Mistry.
“Sutra is our mission to fix the language gap in AI language models. We are committed to pioneering AI solutions for non-English markets. We believe our Sutra models will unlock AI growth opportunities in large economies such as India, Korea, Japan, and the MEA (Middle East and Africa) region,” Mistry said in an interview with Mint.
But there are some basic differences “in our approach to building these models”, he insisted. First, unlike most other startups and companies that are building ‘local’ or ‘Indic’ LLMs for India by fine-tuning global LLMs, “we have built a foundational, and not a fine-tuned model,” he said.
General-purpose foundational models such as Google’s BERT and Gemini, OpenAI’s generative pre-trained transformer (GPT) variants, and Meta’s LlaMA series, have been pre-trained on humungous amounts of data from the internet, books, media articles, and other sources. But most of this training data is in English.
A Transformational Approach
Most companies in India are building their Indic LLMs atop these foundational models (hence they’re called ‘wrappers’) by fine tuning these general-purpose LLMs on a smaller, task-specific dataset (such as regional languages like Hindi, Marathi, Gujarati, Tamil, Telegu, Malayalam, etc., and their dialects), which allows the models to learn the nuances of the language and improves its performance.
Sutra, instead, uses two different transformer architectures. Developed by Google, transformers predict the next word in a sequence of text based on large, complex data sets. Since they process words in a single sequence while understanding their relationships with each other, transformers are very effective for tasks like translating languages.
The multilingual LLM Sutra, according to Mistry, has combined an LLM architecture with a Neural Machine Translation (NMT) one. The reason: while LLMs may struggle due to the lack of specialized training data while translating specific pairs of language, NMT systems are typically better equipped to translate idiomatic expressions and colloquial language.
Second, while “GPT-4 is great in Korean or Hindi, too, its size and cost make it more expensive for a country like India”, argued Mistry. The Sutra architecture “decouples concept learning (we learn concepts by associating new information with existing knowledge, such as learning that both apples and oranges are fruits) from language learning. So, when you use Sutra, the number of the tokens used are similar to using English tokens. This saves almost five to eight times in costs,” he explained.
Third, “our specialized NMT models are significantly smaller in parameter size, requiring much less data for training”, Mistry said. When you add more data, say Korean or some Indian language, you also increase the tokens (loosely, pieces of words and sub-words that an LLM can understand. For example, banana is a word, while homework can be split into two words, home and work). This makes the model bigger, but also slows it down. It increases the costs, too, since similar information content in English, when expressed in a language such as Hindi would need three to four times more tokens.
“Besides, in this approach, the quality of, say Hindi, can never surpass that of English in the original,” Mistry added. For instance, about 80% of a general-purpose foundational model pre-training would typically be from sources such as the internet, books, and media articles, which are mostly in English.
Innovation, Not Fine-tuning
However, if you’re fine-tuning this model with data in Hindi from India, for instance, “most of the data would be about cricket, data found on Twitter, or from people discussing news articles, etc., in Hindi. Hence, a Hindi language model built atop a foundational model that has pre-trained mostly on English will not be able to do full justice to the output in Hindi”.
“As an example, if you want to translate Gujarati to Tamil, most models first translate from Gujarati to English and then from English to Tamil, because that’s the data they have trained on. Our model does not do that, so we also require fewer tokens, which also lowers the cost of running the model,” he explained. Mistry adds that Two Platforms’ model is also aligned to human values, a process technically known as ‘AI alignment’.
Sutra, which is currently available in three versions—Light (56 billion parameters), Online (internet-connected multilingual model with 56 billion parameters), and Pro (150 billion parameters)—supports more than 50 languages, “of which 31 are fully tested”, according to Mistry. He emphasized that Sutra’s architecture and use of “synthetically translated data” not only lowers the computing costs of running these models, but also makes the model more efficient.
“Sutra maintains an impressive performance in English of 77% on the MMLU (massive multitask language understanding) benchmark. It also demonstrates superior and consistent performance in the range of 65-75% across languages. In contrast, many leading language models score closer to 25% on non-English MMLU tasks,” Mistry said.
Two Platforms uses its “in-house GPU (graphics processing unit) cluster and rents top-tier cloud GPUs when needed”. “As we expand, the rising costs of training will require us to create specialized models for different areas like images and video,” Mistry added. His company is also in the process of raising a Series A round “to accelerate the development of Sutra into a model-as-a-service (MaaS)” platform. In February 2022, Jio Platforms had invested $15 million in Two Platforms for a 25% equity stake, while a Naver Corp unit, Snow Corp., had invested $5 million.
Other than Sutra, India has Sarvam AI—a generative AI (GenAI) startup that has launched the Open Hathi series; Tech Mahindra’s Indus Project; the ‘Hanooman’ model that was jointly released this month by SML India and 3AI Holding, an Abu Dhabi-based investment firm; CoRover’s BharatGPT LLM-based chatbot; and Ola Cabs and Ola Electric co-founder Bhavish Aggarwal’s Krutrim AI. Meanwhile, the ‘Nilekani Center at AI4Bharat’ at IIT Madras, too, released ‘Airavata’ an open-source LLM for Indian languages.
In a wider context, the LLM market is projected to grow from $6.4 billion in 2024 to $36.1 billion by 2030, according to a research report released by MarketsandMarkets in March. Moreover, India-specific LLMs are certainly the need of the hour but “we need faster, more affordable, multilingual, and energy-efficient LLMs that can bridge the existing market gaps”, concluded Mistry, who hopes Sutra will be one of those companies that “fills this gap”.
-
Cryptocurrency6 days ago
Shiba Inu burn surges 2,408%: Can SHIB finally escape bearish pressure? – Crypto News
-
Metaverse1 week ago
Samsung tapping Perplexity AI for all devices — what does this mean for you? – Crypto News
-
Blockchain1 week ago
Czech Justice Minister Resigns Over $45M Bitcoin Donation Scandal – Crypto News
-
others1 week ago
‘Nothing Stops This Train’ – Macro Guru Lyn Alden Warns Fed Has No Way To Slow Down Debt Growth in US Financial System – Crypto News
-
Cryptocurrency1 week ago
Top crypto predictions: XRP, Monero, Bitcoin Pepe – Crypto News
-
Cryptocurrency6 days ago
$106,313,218 Solana (SOL) In One Transfer — What Happened? – Crypto News
-
Cryptocurrency6 days ago
Crypto ATM scams in Australia cause over AUD 3.1 million in losses – Crypto News
-
Blockchain6 days ago
American Rapper Cardi B Endorses WAP Token Again—But Is It A Rugpull? – Crypto News
-
Technology6 days ago
Final Fantasy Tactics returns once again with remastered edition – The Ivalice Chronicles; all details here – Crypto News
-
Cryptocurrency1 week ago
Can Shiba Inu Price Recover as Age Consumed & Falling MVRV Signal Bottom? – Crypto News
-
others1 week ago
XRP Price Prediction for June: Key Levels to Watch as Technicals Flash 2017 Bull Signs – Crypto News
-
Cryptocurrency1 week ago
Ethereum’s Pectra Upgrade leaves massive loophole for scammers – Crypto News
-
others1 week ago
Bitcoin Rises As FED Chair Jerome Powell Fails To Speak On Economic Outlook – Crypto News
-
Technology6 days ago
Google Search now shows AI-generated weather snapshots for select users: Report – Crypto News
-
Technology6 days ago
Best water purifiers under ₹15000: Explore the top 6 options from Aquaguard, Urban Company and more – Crypto News
-
Cryptocurrency1 week ago
Bitcoin in ‘make or break’ zone – Trump Media hints at what’s next – Crypto News
-
Technology1 week ago
Just-In: IMF Raises Red Flag Over Pakistan’s Bitcoin Mining Plans, Is $1.5B IMF Loan at Risk? – Crypto News
-
Business1 week ago
From Buffett to Zuck: Satoshi Bitcoin Wealth on Path to Surpass Tech and Finance Titans – Crypto News
-
Technology1 week ago
Why are people choosing smart rings over smartwatches in 2025 – Crypto News
-
others1 week ago
WTI Crude Oil extends gains as Canada wildfires, geopolitical tensions, and a broadly weaker US Dollar support prices – Crypto News
-
Technology1 week ago
Wi-Fi router buying guide: Speed, range and smart home tips – Crypto News
-
Cryptocurrency1 week ago
what’s fueling the June crypto rally? – Crypto News
-
Technology6 days ago
Top 5 AI tools in 2025 to boost your productivity, stay ahead and help you save time – Crypto News
-
others1 week ago
JPMorgan Chase CEO Warns US Bond Crisis Coming After Massive Money Printing, Says Regulators Will Panic – Crypto News
-
Blockchain1 week ago
Bitcoin Still Bullish, But $200,000 Off The Table And $137,000 In Sight – Crypto News
-
Business1 week ago
Michael Saylor Signals Another Massive Strategy Bitcoin Purchase – Crypto News
-
Cryptocurrency1 week ago
XRP Saved? Bears Not Taking Control – Crypto News
-
Cryptocurrency1 week ago
Ethereum retests $2,500 as companies bet big on ETH – Crypto News
-
others1 week ago
Pound Sterling Price News and Forecast: GBP/USD steadies near 1.3540 – Crypto News
-
Cryptocurrency1 week ago
Cardano Price Downside Extends As Ethereum Upsurge Adds Pressure – Crypto News
-
others7 days ago
Australian Dollar holds ground as Q1 GDP expands 0.2% QoQ – Crypto News
-
Blockchain6 days ago
JPMorgan to Accept Bitcoin ETFs as Loan Collateral – Crypto News
-
Technology5 days ago
Why Anthropic CEO Dario Amodei thinks a 10-year AI regulation freeze is dangerous – Crypto News
-
Cryptocurrency1 week ago
Pakistan to create strategic Bitcoin reserve, earmarks 2000MW for crypto mining – Crypto News
-
Business1 week ago
XRP Las Vegas: Brad Garlinghouse Says Bitcoin Is Not The Enemy – Crypto News
-
Technology1 week ago
Job Interviews Enter a Strange New World With AI That Talks Back – Crypto News
-
others7 days ago
Analyst Says Solana Flashing ‘Very Promising’ Bullish Setup, Predicts Rallies for Two Low-Cap Altcoins – Crypto News
-
Technology7 days ago
Nintendo Can’t Afford a Slip Up With Switch 2 – Crypto News
-
Technology6 days ago
Apple WWDC 2025: How to watch the keynote and what all to expect – Crypto News
-
Blockchain6 days ago
Solana Analyst Sets $300 Target – Can Bulls Sustain A Rally? – Crypto News
-
Blockchain6 days ago
Best crypto to buy as altcoin rotation favors low-caps BPEP, Bitcoin Pepe sets June 17 for listing announcement – Crypto News
-
others1 week ago
Canadian Dollar lurches higher on upbeat quarterly GDP growth – Crypto News
-
Blockchain1 week ago
Major crypto hacks fell 40% in May, says PeckShield – Crypto News
-
Blockchain1 week ago
Strategy signals another Bitcoin buy on June 2 – Crypto News
-
Business1 week ago
$3.5B UK Firm Opens XRP Spot Trading In Retail Crypto Push – Crypto News
-
Business1 week ago
Just-In: BlackRock Breaks Acccumulation Streak, Moves $429M In Bitcoin To Coinbase Prime – Crypto News
-
Business1 week ago
Strategy Announces STRD Offering To Facilitate More Bitcoin Purchases – Crypto News
-
Technology1 week ago
Best split ACs under ₹30000 that cool well and don’t cut corners on the things that matter: Top editor’s picks for you – Crypto News
-
Technology7 days ago
Dashcam buying guide: 5 things to know before making a purchase in 2025 – Crypto News
-
Blockchain6 days ago
JPMorgan Plans to Allow Financing Against Crypto ETFs: Report – Crypto News