Sign in to view Divay’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Sign in to view Divay’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Meesho
National Institute of Technology Silchar
Bengaluru, Karnataka, India
Sign in to view Divay’s full profile
Divay can introduce you to 10+ people at Meesho
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
2K followers
500+ connections
View mutual connections with Divay
Divay can introduce you to 10+ people at Meesho
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Sign in to view Divay’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Divay
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Sign in to view Divay’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
About
Welcome back
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
New to LinkedIn? Join now
Activity
2K followers
-
Divay Jindal posted thisExcited to share that our paper “TrendPulse: A Simple yet Efficient Framework for Capturing Viral E-Commerce Spikes via LLM-Driven Contextualization” has been accepted at ACL 2026 (Industry Track) TrendPulse addresses the challenge of capturing short-lived demand spikes by detecting regional search momentum, transforming it into semantic trends using LLMs, and powering personalized recommendations via attention framework. This would not have been possible without an incredible team. Thanks to Devashish Gupta Arin Jain Bhavuk Singhal Also thankful for the support and guidance from Vinit Rongata Ravindra Yadav Debdoot Mukherjee Looking forward to presenting at ACL! #ACL2026 #MachineLearning #NLP #RecommenderSystems #LLM
-
Divay Jindal reposted thisDivay Jindal reposted thisHow do you personalize for users you know nothing about? At Meesho, we tackled the cold start problem head-on: blending demographics and early-session signals to deliver relevant recommendations from the very first scroll. Curious how we approached the cold start problem? 👉 https://lnkd.in/gfmg3b_b #Personalization #Ecommerce #MeeshoTech #MLForBharat Vinit Rongata Debdoot Mukherjee Ravindra Yadav Divay Jindal Devashish GuptaPersonalization from Day One: Solving the Cold Start Problem at MeeshoPersonalization from Day One: Solving the Cold Start Problem at Meesho
-
Divay Jindal shared thisGreat answerWhat skills do I need to be a data scientist at Google or Facebook?What skills do I need to be a data scientist at Google or Facebook?
-
Divay Jindal shared thisAwesome glossary. Have a look
-
Divay Jindal liked thisDivay Jindal liked thisAt the start of this year, I gave myself a challenge: write more regularly. Since I work at a payments company, I figured this was the most honest thing I could write about. Something I actually know. Or thought I knew. Thirteen weeks later, I realize how much I didn't. The comments, the DMs, the messages from people who work in banking, policy, and financial inclusion — they've taught me more than the research did. Someone pointed out last week that Ramesh, the migrant worker I wrote about in the KYC piece, could just open a Jan Dhan account in his home village. They were right. And that conversation pushed me to think harder about what "access" actually means when you're working six days a week in a different state. Two things hit harder than I expected when I actually sat down to write them out. The economics. I knew UPI was free. I hadn't thought long enough about what that actually means. The government subsidizes some of it. Banks absorb the rest. Nobody has made a permanent commitment. The most widely used payment system in the world's most populous country runs on a funding arrangement that gets renegotiated periodically. And KYC. I'd always filed it under compliance. Thirteen weeks of building toward it made me realize it's probably the single most consequential design decision in the entire stack. Everything else assumes the user already made it through the gate. For hundreds of millions of people, the gate is where the journey ends. The bigger realization though: none of these are isolated problems. Each layer of the system exists because the layer below it created a new problem. Money needs clearing. Clearing creates a single point of failure. Fixing that costs money. Free UPI creates fragile economics. And so on. Arc 1 is done. Thirteen weeks, one thread. Arc 2 starts next Tuesday. We go back to the early 2000s — before UPI, before smartphones, before any of this existed — and ask how it all got built. #HowMoneyMoves #Resolution
-
Divay Jindal liked thisDivay Jindal liked thisMost people use AI like a tool. Smart builders turn it into a system. That’s where 𝐒𝐊𝐈𝐋𝐋.𝐦𝐝 changes the game. It’s not just prompting. It’s about creating reusable AI capabilities 👇 𝐖𝐡𝐚𝐭 𝐢𝐬 𝐒𝐊𝐈𝐋𝐋.𝐦𝐝? → A modular way to package AI tasks → Trigger-based execution → Works across tools like Cursor, Claude, Copilot 𝐖𝐡𝐲 𝐢𝐭 𝐦𝐚𝐭𝐭𝐞𝐫𝐬: • Stop rewriting prompts again and again • Build reusable workflows • Standardize AI usage across teams • Scale AI like software 𝐖𝐡𝐚𝐭 𝐠𝐨𝐞𝐬 𝐢𝐧𝐬𝐢𝐝𝐞 𝐚 𝐒𝐊𝐈𝐋𝐋.𝐦𝐝? • Clear description (trigger logic) • Structured instructions • Constraints & rules • Tool integrations • Output format 𝐇𝐨𝐰 𝐢𝐭 𝐰𝐨𝐫𝐤𝐬: Input → Trigger match → Load SKILL.md → Execute instructions → Generate output 𝐁𝐞𝐬𝐭 𝐩𝐫𝐚𝐜𝐭𝐢𝐜𝐞𝐬: • Write as triggers, not prompts • Keep it modular • Add constraints for reliability • Avoid unnecessary complexity 𝐑𝐞𝐚𝐥 𝐢𝐦𝐩𝐚𝐜𝐭: Faster execution Consistent outputs Scalable AI workflows Most people are still prompting manually. The shift is happening towards 👉 𝐩𝐫𝐨𝐠𝐫𝐚𝐦𝐦𝐚𝐛𝐥𝐞 𝐀𝐈 𝐬𝐲𝐬𝐭𝐞𝐦𝐬 P.S. Are you still writing prompts… or building reusable AI skills? Repost if this helped ♻️ —----- Looking for Gen AI Training or Automating your business processes using Gen AI? DM or comment "GenAI" to connect for 𝐅𝐑𝐄𝐄 Consultation Follow Manas Dasgupta for insights on Gen AI Automation
-
Divay Jindal liked thisDivay Jindal liked thisHaving worked with multimodal models for several years, one things I've faced far too often is that most multimodal models quietly throw away the signal they later struggle to recover. The issue is structural. Vision encoders are trained to produce compact features that align with text, not to preserve full scene information. That works for retrieval and captioning. It breaks for dense reasoning and generation. Once spatial detail is compressed, the rest of the system is reconstructing from a lossy prior. This new paper from MetaAI, Tuna-2, removes that failure point. It treats images as sequences of pixel patches and feeds them directly into a single transformer alongside text tokens. There is no CLIP style encoder, no VAE, no latent bottleneck. The same model handles perception and generation, operating on a shared token space. The implementation is heavier than it sounds. Pixel patches are embedded and interleaved with language tokens in long sequences. Training mixes objectives like next token prediction for text and pixel reconstruction or prediction for images. There is no stage where the model switches representations. Everything is optimized jointly. That forces the model to maintain fine grained visual structure while learning language alignment, instead of trading one off for the other. Whats impressive is that this holds up at scale. Encoder based systems usually win on efficiency and stability. Here, despite slower convergence and higher compute, the model surpasses them on fine grained perception and produces more consistent generations. That suggests the information loss from encoders is a real ceiling, not just an implementation detail. Compared to standard stacks, this is a different tradeoff. Less modular, harder to train, but cleaner in terms of information flow. You are not stitching together separately optimized components. My read is this meaningfully improves tasks that depend on exact grounding, where current models fail in subtle but important ways. If this direction continues, multimodal architectures simplify. The open question shifts from how to align modalities to how to efficiently train large models directly on raw sensory data. Always a pleasure to read papers from Lukes group!
-
Divay Jindal liked thisDivay Jindal liked thisRevolutionizing E-commerce Search: When LLMs Meet Hierarchical Taxonomies Researchers at eBay have developed a fascinating Chain-of-Thought approach that's transforming how e-commerce platforms categorize user queries. Instead of relying solely on traditional behavioral signals like click-through rates, this method leverages the semantic understanding of Large Language Models. How it works under the hood: The system treats query categorization as a tree traversal problem. Starting from the root of a product taxonomy, it uses an LLM to score the semantic relevance of each category branch on a 1-10 scale. The algorithm then applies dynamic thresholding- mapping scores to a standard normal distribution and pruning categories that fall below configurable selection and minimum thresholds. What makes this approach particularly clever is its breadth-first search strategy. At each taxonomy level, it only explores the most semantically promising branches, visiting just 1.7% to 24.8% of total category nodes while maintaining high accuracy. Key technical innovations: - Context-aware scoring that incorporates user intent (buying vs. browsing vs. seeking accessories) - Relative thresholding that adapts to score distributions at each node - Scalable variants using k-NN embedding filters for high-volume deployment - Built-in taxonomy diagnostics that identify structural gaps The results are impressive: Testing with models like Mixtral-8x7B showed 89.8% improvement in F1 scores over embedding-based baselines, with significant gains in both precision and recall across human judgment datasets. Beyond performance gains, this approach offers valuable insights for taxonomy optimization- automatically flagging categories like "Designer Sunglasses" that may be buried too deep in existing hierarchies. This research demonstrates how modern NLP can enhance traditional e-commerce systems while providing interpretable, actionable insights for platform optimization.
-
Divay Jindal liked thisDivay Jindal liked thisOne journey ends. Another begins. Mixed feelings, as always. As umami (formerly whatsyum) shuts operations, I'm excited to join Sahi as Head of AI. In just a couple of weeks, I already feel at home. The energy is real. The love for product and customer obsession shows up in every conversation. The pace of execution is surreal. The bar for talent is high. AI is a way of life. Grateful to Dale Vaz and Manish Jain for trusting me with what is perhaps one of the most transformational roles at Sahi. Lets build something great 🚀
-
Divay Jindal liked thisDivay Jindal liked thisIf Claude feels wrong. You're probably the problem. I've been using Claude Code daily for months now. One thing became obvious fast: When Claude gives you inconsistent results it's not the model, it's mostly the setup. Most people install it, type a prompt, and get frustrated. They never touch the folder that controls everything. The .claude/ directory is where Claude learns how to behave in your project: memory, automation, safety, and delegation. Here's what I configured that changed everything: 𝗖𝗟𝗔𝗨𝗗𝗘. 𝗺𝗱 Stop leaving it empty. Define your stack, architecture, and conventions. Claude reads it first every session. Treat it like onboarding docs for a new engineer. 𝗖𝗟𝗔𝗨𝗗𝗘.𝗹𝗼𝗰𝗮𝗹. 𝗺𝗱 Keep your personal preferences out of the shared project file. Your workflow shouldn't break your teammates'. 𝗺𝗰𝗽.𝗷𝘀𝗼𝗻 Configure it once. GitHub, JIRA, Slack, and databases. All connected, version-controlled. No more re-explaining your tooling every session. .𝗰𝗹𝗮𝘂𝗱𝗲/𝘀𝗲𝘁𝘁𝗶𝗻𝗴𝘀.𝗷𝘀𝗼𝗻 PreToolUse and PostToolUse guardrails that run automatically. Validation, linting, blocking unsafe operations — caught before they ship. .𝗰𝗹𝗮𝘂𝗱𝗲/𝗰𝗼𝗺𝗺𝗮𝗻𝗱𝘀/ Slash commands for recurring workflows. One keystroke runs your entire review or deploy process. No more copy-pasting the same prompt. .𝗰𝗹𝗮𝘂𝗱𝗲/𝘀𝗸𝗶𝗹𝗹𝘀/ Reusable workflows that load only when needed. Testing patterns, deploy checklists, API conventions. Auto-triggered by context or invoked manually. .𝗰𝗹𝗮𝘂𝗱𝗲/𝗮𝗴𝗲𝗻𝘁𝘀/ Specialized sub-agents with isolated context. Code review, security, and docs. Each has its own scope, instead of a single overloaded conversation. The simple test: If you keep repeating yourself to Claude, that instruction belongs in your setup. What's in your .claude/ folder right now? We could help you fix that! Check this: www.gentes.ai --- Kudos to Brij kishore Pandey for the amazing infographic.
-
Divay Jindal liked thisDivay Jindal liked thisModel size was never the problem. Memory was. And someone from China 🇨🇳 just shrunk it by 93%. 🎊 Anyone who's tried to actually use a long context window in production knows the real bottleneck. It's not the model. It's the KV cache. Every token your model processes stores key-value pairs in memory. As context length grows, that memory footprint grows with it. At a million tokens, the KV cache alone can consume more GPU memory than the model weights. That's why "supports 1M context" has been a spec sheet number, not a production reality. 🔧 DeepSeek V4 just made it real. Their hybrid attention architecture — Compressed Sparse Attention interleaved with Heavily Compressed Attention — fundamentally changes how tokens track long-range dependencies. 📊 The numbers: 10% of the KV cache footprint. 27% of the inference FLOPs. Same million-token context window. Flash variant: just 7% of the original cache size. 💰 The pricing reflects the efficiency: $0.14 per million input tokens for Flash. That's roughly 100x cheaper than comparable frontier APIs. 📌 The signal is clear. The next wave of inference cost reduction isn't coming from better models. It's coming from better memory architectures.
Experience & Education
-
Meesho
********* **** *********
-
**********
**** ********* **
-
*** ****** ******
******* ******** **********
-
******** ********* ** ********** *******
********** ****** ******** ******* undefined
-
-
*** ****** ******
****** ********* ********* undefined *****
-
View Divay’s full experience
See their title, tenure and more.
Welcome back
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
New to LinkedIn? Join now
or
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Publications
-
Basic Computer Skills
-
It is a 250 pages book covering all the basics skills needed to be taught to people such as Microsoft Office , basic computer architecture , introduction data entry tools and many more useful tools aiming at the skill development of the people in need so that could generate employment for themselves . This is approved under "Kaushal Vikas Yojna".
Other authors -
Courses
-
Convolutional Neural Networks for Visual Recognition
CS231n
-
Linear Algebra
18.06 Gilbert Strang
-
Machine Learning
Abu Mostafa
-
Machine Learning
Andrew Ng
-
Probability And Statistics
STAT 110
-
RL Course by David Silver
Reinforcement learning
-
Statistical Rethinking
Bayesian Machine Learning
Organizations
-
GYANSAGAR
Co-Founder
- PresentGyansagar is a Non Profit Organisation which aims at the skill development of the needy people. We teach English,Mathematics and Science to about 200 students living in the villages near to the campus of NIT Silchar. We conduct computer classes aiming at developing basic computer skill amongst the students .
Websites
- Portfolio
-
https://divayjindal95.github.io
Other similar profiles
Explore more posts
-
Parag Patil
Helpshift • 5K followers
Sarvam AI is outperforming large global LLMs on India-specific use cases. Across real-world India-focused tasks, Sarvam AI (fine-tuned for India) is delivering stronger performance than large general-purpose models — proving that targeted fine-tuning beats sheer parameter size. 🏆 Where Sarvam AI Outperformed: ✅ Vernacular Intent Detection (Hindi + regional mix) More accurate understanding of Hinglish, code-mixed queries, and regional phrasing. ✅ Regulatory & Compliance Q&A (India domain) Higher precision in interpreting India-specific policies and terminology. ✅ Localized Customer Support Responses More culturally aligned tone with fewer generic outputs. ✅ Cultural Context & Idiomatic Understanding Better handling of local expressions and socio-cultural nuances. Why This Is Happening Most global LLMs like those from OpenAI and Google (Gemini) are trained predominantly on large-scale English and Western-centric datasets. India’s linguistic landscape is fundamentally different: • 22+ official languages • Heavy code-mixing (Hinglish, Tanglish, etc.) • Deep cultural and regulatory context • Region-specific communication styles This gap is one reason why adoption of generic global LLMs in India can be slower for core business workflows. India needs models built for India’s data, languages, and users. Task-specific fine-tuning on curated India-centric datasets enables: • Higher accuracy • Lower latency • Better cost efficiency • Stronger real-world alignment 🧠 Key Insight: For country-specific and domain-heavy applications, alignment > size. Kudos to Sarvam for leading India’s LLM movement in the right direction. #AI #IndiaAI #LLM #MachineLearning #FineTuning #GenAI #DigitalIndia
20
-
Jalaj Garg
HyperVerge • 9K followers
🚀 India just quietly triggered one of the biggest tailwinds for AI Product Careers - and many PMs might not have noticed yet. In the last two weeks, three announcements landed that will directly accelerate AI product building in India: 1) ₹1 Lakh Crore DeepTech RDI Fund (Govt. of India) A national ₹1,00,000 crore innovation fund will support AI, robotics, quantum & advanced tech - including risk capital at early stages. This means a new wave of AI-first products will get funded, and they’ll need PMs who can convert technology into usable customer value. 2) Karnataka Startup Policy 2025–2030 (₹518 Cr for 25,000 Startups) The policy aims to support 25,000 tech startups (10,000 outside Bengaluru) with priority on AI, blockchain, space tech, and quantum. This signals more 0→1 product roles across India -not just in Bangalore. 3) Nvidia joins India Deep Tech Alliance Backers now include Nvidia + venture partners to catalyse Indian deep-tech ecosystems with not just capital but also computational and GTM support. 💡 What this means: India is about to over-fund AI ideas and under-supply PMs who can turn those ideas into real products. If you’re early in your career, this is the moment to: pick a problem space you care about understand where AI creates 10× value over status quo build & ship velocity instead of perfection The next generation of strong PMs won’t be defined by: “how well they execute Jira” but “how well they convert tech → customer value → revenue” That window just opened. #AIProductManagement #Productmanagementcareers #AI #DeepTech
42
3 Comments -
Rohit Anand
Bloomreach • 6K followers
📌 Why Transformers Need Positional Embeddings — And Why Sine & Cosine? Transformers revolutionized NLP by removing recurrence and convolutions — but in doing so, they also lost any natural sense of word order. So, how does a Transformer know that “The dog chased the cat” is different from “The cat chased the dog”? 👉 Positional Embeddings. But here’s the interesting part: 🌀 The original Transformer model uses sine and cosine functions (not learned vectors) to encode token positions. Why? 🔍 Because sine and cosine: 1. Are smooth and differentiable (great for optimization), 2. Work across multiple frequencies to capture both local and global position, 3. Naturally allow the model to compute relative positions using trigonometric identities, 4. Are fixed (no training needed) and generalize well to longer sequences, 5. Stay bounded (values between -1 and 1), which stabilizes learning. ✅ Most importantly: They give the model a way to “feel” the distance between words, not just where they are. It’s one of those elegant engineering decisions — simple math, massive impact. 💡 Curious? Try plotting the sinusoidal embeddings across dimensions Image source: https://lnkd.in/ggZJgKMt #MachineLearning #DeepLearning #Transformers #NLP #PositionalEmbeddings #AI #MLCommunity #GenAI
36
7 Comments -
Mohammed Arsalan
IBM • 23K followers
Sarvam-M: A 24B Parameter Multilingual AI Model is finetuned Mistral-Small with Indic language desi nuances covering 11 major languages from india 🇮🇳 Key Features: • Dual-mode interface: Quick "non-think" responses + detailed "think" mode for complex reasoning 🧠 • Strong performance on math (GSM-8K) and coding (SWE-Bench) benchmarks 📊 • Supports both native Indic scripts and romanized text ✍️ • Built for real-world multilingual conversational agents 💬 I think it's a good start , instead of training model from scratch Sarvam focused on building indic dataset and model like in video shows good translingual conversational capability . Try it out via the Colab link - https://lnkd.in/gzcW4hru Playground - https://lnkd.in/gXijZrkY
27
1 Comment -
Ali Faraahani
Tabdeal | صرافی ارز دیجیتال… • 4K followers
Perceived Performance is more powerful than Real Performance. We usually obsess over milliseconds, backend benchmarks, load timings, database latency… But users don’t measure speed, they feel it. That’s why skeletons outperform spinners, optimistic UI feels instant, and motion creates the illusion of progress. Real speed wins engineers. Perceived speed wins users. In crypto & AI products especially, uncertainty kills trust faster than latency ever will. Design for perception, not just for numbers. Here is how: 1️⃣ Skeletons reduce uncertainty 2️⃣ Optimistic UI removes visual waiting 3️⃣ First-paint prioritizes what matters 4️⃣ Motion makes time feel shorter And yes each technique has exceptions. Good UX isn’t following patterns blindly. It’s choosing the right pattern for the right moment. Curious to hear how you design for perception. 👇 #perceivedperformance #uxdesign #productdesign #userexperience #designleadership #uidesign #productmanagement #cryptoapps #aiux #designthinking #uxstrategy #productstrategy #humanpsychology #usability
10
-
Bon Osonwanne
Osonwanne Group • 1K followers
Your 10x Employee Is Here — And We Run It For You. Some operations run on thin margins. Others run on zero-error tolerance. Both are getting crushed by the same problem: repetitive work that doesn't scale. The pattern is everywhere: → Teams manually screening hundreds of candidates. Weeks of delay. Humans drowning in admin they hate. → Critical processes that can't go down. One hiccup and the consequences cascade. → Back-office tasks multiplying faster than headcount ever could. The solution isn't more hiring. It's autonomous AI agents that actually do work — not chatbots that talk, but agents that call, screen, validate, and escalate. What we deliver: — Pre-screening agents that qualify candidates before your team ever touches a resume. Cuts weeks to days. — 24/7 outreach automation that reaches thousands, not hundreds. Customized responses at scale. — Infrastructure monitoring with instant escalation. No more "we'll check on Monday." — Managed service model — we host, maintain, and optimize on your existing infrastructure. You focus on your business, not the tech. — Human-in-the-loop for the exceptions that matter. AI handles volume. Humans handle judgment. The model: Implementation fee to get started. Monthly service to keep running. One enhanced employee delivering 10x the output — supervised by your team, powered by our agents. We handle the tech. You focus on your mission. We both win. If you're drowning in back-office tasks, compliance overhead, or scaling challenges — and you want to explore what autonomous AI could actually do for your margins — Let's talk. No pitch decks. Just a conversation about pain points and fit. Link in comments for discovery call. #AIAgents #SMB #Recruiting #LegalTech #BusinessAutomation #HRTech #ManagedServices #ScaleYourBusiness #PropertyManagement #Compliance
3
2 Comments -
VIJAY KUMAR ACHARY G
Cognizant • 169 followers
🚀 RAG Model -Built a Q&A Bot on IPL 2025 Schedule using LangChain, Chroma & Groq 🏏 Recently, I explored how to combine LangChain + Chroma + Groq LLMs to create a simple Question Answering system from PDFs. I used the official IPL 2025 schedule PDF 📄 and made the model answer queries like venue details, match dates, etc. 🔧 Tech Stack & Flow 1️⃣ Mounted Google Drive in Colab & installed required libraries (LangChain, Chroma, HuggingFace, Groq, Unstructured). 2️⃣ Downloaded the IPL schedule PDF and extracted raw text using UnstructuredFileIOLoader. 3️⃣ Split the text into chunks with CharacterTextSplitter. 4️⃣ Created embeddings with HuggingFace & stored them in ChromaDB for retrieval. 5️⃣ Connected Groq LLM (LLaMA-3.3 70B) to process queries via RetrievalQA. 6️⃣ Asked the bot: "Tell me all venue details" → Got structured answers directly from the PDF. 🎯 📌 Key Takeaway: This workflow shows how unstructured documents (like PDFs) → embeddings → vector DB → LLM retrieval can be turned into powerful search & QnA systems. 💡 Next Steps: I’m planning to extend this for multi-PDF RAG pipelines & build a small chatbot UI on top of it. 👉 If you’re working on similar GenAI + RAG projects, would love to connect & exchange ideas! #LangChain #Groq #Chroma #LLM #VectorDB #RetrievalQA #GenAI #IPL2025
16
-
Siddarth Jain
DrDroid • 22K followers
Supabase is blocked across multiple Indian ISPs. Here's what actually happened and why it matters: There is speculation[1] that Govt of India might have used Section 69A of the IT Act — the same legal mechanism used to block TikTok, PUBG, and WeChat — to issue a blocking order. Atleast 3 ISPs across India have cut access to Supabase domains.[2] The reason hasn't been made public yet. But here's what's likely: Section 69A blocks target specific domains/apps for sovereignty concerns, security reasons, or obscene content. My read is that certain apps or websites built on Supabase infrastructure may have been flagged — and the block hit the platform itself. This is unprecedented though. Previous blocks targeted consumer apps: → TikTok, WeChat, PUBG — Chinese-linked apps blocked for security reasons → MoodXVIP, Ullu, ALTT — blocked for obscene content Supabase is neither. It's backend infrastructure. Thousands of Indian startups and developers use it as their database and auth layer. If your production stack runs on Supabase and you serve Indian users — you're affected right now and your engineers have probably already implemented one of the workarounds recommended by Supabase. The bigger question for engineers: how quickly will you be able to bring things back up if a core infrastructure dependency gets blocked overnight? This is a reminder call for folks on infra dependency risk.! Edit: [1] [2] The speculation mainly comes from a tweet (which is now deleted) from official supabase handle on X -- https://lnkd.in/gdP-VbmQ
316
42 Comments -
Amit Pandey
Augmen.io • 4K followers
This story is less about a 'lie' and more about a crucial distinction in AI capabilities. The initial claim that GPT-5 had solved unsolved math problems was a 'dramatic misrepresentation.' The reality? The model proved to be an incredibly powerful tool for navigating and connecting vast amounts of scientific literature—a significant feat in itself. As mathematician Terence Tao noted, this highlights AI's immense utility as a research assistant, even if independent discovery remains on the horizon. This 'assistant' role might be the most transformative application of AI in the short term. https://lnkd.in/dndidnMY
5
1 Comment -
Shrenik Shah
Protean eGov Technologies • 12K followers
100K active users in just 7 days + 12K paid conversions in week 1. That's NOT normal for a fresh AI platform — especially one building sovereign tech in India. Atomesus AI launched Jan 12, 2026 and the numbers already scream product-market fit. Here's what stands out for builders & founders 👇 🔥 Unlimited free tier (chats + image gen) hooks mass adoption fast 🔥 ~12% paid conversion right out of the gate — rare in AI land (most wrappers stay under 5% even months in) 🔥 Android live now, iOS coming soon, daily pushes from an ISRO-rooted team 🔥 Sovereign roadmap: hybrid today, full proprietary model ahead → real data-in-India + better Indic language handling This proves a few things we obsess over in 2026: → Generous free access + natural upsell beats forced subscriptions → Local talent + sovereignty focus builds actual moats in emerging markets → Iteration velocity > starting with the biggest model As someone tracking India's deep-tech moves, this milestone hits different because the monetization speed is what VCs chase in AI-native bets right now. Have you tried the app yet? → www.atomesus.com or search "Atomesus" on Play Store What's the real unlock for Indian AI teams? -Beating globals on cost + speed? -Sovereign data edges in a world of export controls? -Or hitting 10%+ paid conversion this early? Curious what devs, founders, and product folks think — drop your honest take below. Tag someone grinding on Indic AI or sovereign infra. Let's talk execution realities. 🚀 #IndiaAI #SovereignAI #DeepTech #AIStartups #ProductLedGrowth #MakeInIndia #AtomesusAI
10
-
Zubin Kutar
AI Fluency Labs • 32K followers
Twin brothers from India built an AI platform that achieved $15M in ARR in just 90 days. Is this the Shopify-for-apps moment? Here’s is the full story Emergent, founded by twin brothers Mukund (ex-Dunzo CTO) and Madhav Jha (ex-Dropbox/Amazon infra), just raised $23M Series A led by Lightspeed (total $30M to date). Their pitch? A multi-agent ‘vibe coding’ platform that turns plain English prompts into full-stack apps: frontend, backend, auth, payments, deployment, all automated. And the traction is wild: 👉 YC copy in mid-2024: 700k users, $10M ARR in 2 months. 👉 By Sept 2025: over 1M people building 2M apps and $15M ARR in 90 days. Why does it matter? Because Emergent isn’t just “no-code.” It aims to be the Shopify for apps, where anyone can generate production-ready tools, deploy them instantly, and eventually monetise them. Think jewellery chains spinning up AI-pricing apps, SMBs ditching spreadsheets for onboarding tools, even healthcare patients building pain-management solutions. But here’s the catch 👇 Retention > creation. Generating millions of apps is flashy. Keeping them live, secure, and monetised is harder. Discovery and billing flows are still in their early stages. Quality & compliance. Building real apps means adhering to PCI compliance, implementing secure authentication, and maintaining safe data practices. Emergent claims in-house testing/security agents, but independent audits will matter. Competition is fierce. Replit, Builder(dot)ai, Rocket(dot)New, Cursor: Everyone is chasing “apps from text.” Emergent’s moat must be infrastructure, reliability, and a marketplace. Unit economics. Hosting and runtime LLM costs scale with the number of active users. Using cheaper models and caching helps, but the test is focused on gross margin at scale. The opportunity: The low-code/AI app market is forecast to hit hundreds of billions by 2030+. If Emergent nails reliability, discovery, and monetisation, it could be the default platform SMBs use to generate and sell apps in days, not months. Question for you: If you could build a full-stack app from a single prompt, what would you build first? ➕ Save this post and follow Zubin Kutar ⚡ to Learn & Grow 10x with AI
189
22 Comments -
Vinit K. Singhh
Troopod • 4K followers
Why Most Indian D2C Experiments Lie Text Thread 🫠 Indian D2C teams run a lot of experiments. Most of them don’t teach anything. Not because experimentation is wrong. But because the system is noisy. A few quiet truths: Traffic is inconsistent. Devices are slow. Networks fluctuate. User intent shifts minute to minute. So when conversion moves, teams celebrate or panic. Even when nothing meaningful changed. The problem isn’t lack of data. It’s lack of signal. Real learning happens when: • Experiments run long enough to settle • Variants don’t break edge cases • Failures are detected early, not post-launch • Teams know why something worked, not just that it did In India, small frictions amplify fast. So false positives are expensive. The best D2C teams won’t run more experiments. They’ll run cleaner ones. They’ll reduce noise before chasing uplift. They’ll trust outcomes only after systems are stable. In chaotic markets, clarity compounds faster than speed. #IndianD2C #CRO #Experimentation #Troopod #AICRO
15
9 Comments -
Anshuman Jha
Aon • 24K followers
H-1B Shockwave: Impact on Indian AI Startups The $100k H-1B fee presents a tough choice for Indian AI startups with global ambitions. While challenging, it might just be the push needed to double down on local talent and innovation, keeping brilliant minds home to build the next wave of Indian tech unicorns. #AIStartups #H1B
5
-
Himanshu Vishwakarma
Covasant • 4K followers
Can India Become a Net AI Exporter? 🇮🇳 India's economic story has long been shaped by dependency. The colonial era marked the beginning of a painful transition — from a thriving exporter to a net importer of British goods. Decades of wealth were drained, industries hollowed out, and self-sufficiency eroded. 75 years after Independence, we are still largely a net importer of goods and services. The pattern persists — different era, different products, same dynamic. But history doesn't have to repeat itself. The AI revolution presents India with a rare and defining window — one that doesn't come twice. For the first time, the rules of a technological order are still being written. Infrastructure, talent, and scale are not yet locked in by legacy players. India has the engineers. India has the data diversity. India has the hunger. The question is no longer can India build world-class AI — it's whether we will move fast enough to become the ones the world depends on, rather than the other way around. A nation that exports AI models, platforms, and solutions doesn't just grow its economy — it reclaims its agency on the global stage. The colonial chapter was written for us. This one, we write ourselves. It's time for India to stop importing the future — and start exporting it. #AIImpactSummit2026 #AIInIndia #BharatMandapam #FutureOfAI #TechSummit #ArtificialIntelligence #DigitalIndia #AIEcosystem #StartupIndia #GlobalAI #Infosys #TCS #PMO
1
-
Shubham Vedi
Build Fast with AI • 3K followers
An Indian startup just open-sourced a 105B model that outperforms DeepSeek R1 on math and agentic benchmarks. Trained entirely in India. On Indian government GPUs. With $53M in funding. Not $5B. And the weights are live on Hugging face right now: https://lnkd.in/djPhwvEB Here's why Sarvam 105B deserves your attention: The benchmarks that matter: → 88.3 on AIME 2025 (DeepSeek R1: 87.5) → 96.7 on AIME with tools → 49.5 on BrowseComp (DeepSeek R1: 3.2 — not a typo) → 68.3 on Tau2 agentic benchmark (beats o4-mini, DeepSeek R1, and Claude Sonnet 4) → 98.6 on Math500 These aren't cherry-picked comparisons against small models. They're head-to-head with DeepSeek R1, Gemini 2.5 Flash, o4-mini, and Claude Sonnet 4. The backstory makes this even more impressive. Last year, Sarvam caught heavy criticism for fine-tuning Mistral's model and calling it sovereign AI. Downloads were underwhelming. People questioned whether India could build foundational models at all. So they went back and trained from scratch. 12 trillion tokens for the 105B. 16 trillion for the 30B. 128 experts in a Mixture-of-Experts architecture. Custom tokenizer, custom kernels, custom RL pipeline. They threw out KL-divergence regularization entirely — broke from the textbook approach and it worked. The 30B model is equally wild — only 2.4B active parameters. It matches models 10x its active size on coding and reasoning benchmarks. This is what efficient scaling actually looks like. What makes this a signal, not just a model drop: ✅ Trained on IndiaAI Mission compute — sovereign infrastructure, not rented from US hyperscalers ✅ Supports all 22 official Indian languages natively ✅ Already in production — Samvaad (conversational agents) and Indus (reasoning/agentic workflows) ✅ Open-sourced under Apache License on Hugging face ✅ Demoed running on a feature phone with physical buttons That last one hit differently. A 105B-parameter model. On a dumbphone. The AI race isn't just US vs China anymore. A team of ~114 people in Bengaluru just shipped a globally competitive reasoning model on government GPUs, at a fraction of frontier lab costs. And they gave the weights away for free. Download it. Fine-tune it. Ship something with it. Hugging face link: https://lnkd.in/djPhwvEB What's your take — can sovereign AI models compete long-term with frontier labs? 👇 🔄 Repost if your network needs to see what India is building. #AI #OpenSource #SarvamAI #indi
189
4 Comments -
Ruben Kostandyan
Amazon • 1K followers
Last few days have been running ForgeCAD CAD-as-code system as a benchmark on most major AI models with same prompt to build an AC system. The results were somewhat surprising. While many new models compete on existing benchmarks and show results matching or beating the strongest models, here the results were less saturated. Only a few models continuously performed good. Opus 4.5, Opus 4.6, GPT-5.2-Codex, GPT-5.3-Codex, and surprisingly Gemini-3-Flash (for not hard models or low complexity changes). Most other models I tried had difficulties with spatial reasoning. With GPT-5.3-Codex I took it further, asking for harder and harder models, so far the capabilities are very strong. I shared the results of all tested models in the GitHub repo: https://lnkd.in/dm5SmCYT
17
-
MD Fazal Mustafa
Heva AI • 12K followers
Introducing BharatMLStack by Meesho: India's Production-Ready ML Infrastructure Platform. Meesho is democratizing machine learning at scale with a comprehensive platform that's already powering million-scale deployments across India and beyond. 🎯 Why BharatMLStack? ✅ Handle 1M+ feature vector retrievals per second ✅ Sub-10ms latency for real-time decisions ✅ 99.99% uptime with auto-scaling ✅ Petabyte-scale feature processing ✅ Cloud-agnostic Kubernetes deployment 🔧 Key Components: Horizon Control Plane - Orchestrates your entire ML ecosystem Trufflebox UI - Modern web console for ML management Online Feature Store - Real-time feature serving at scale 🇮🇳 India-First, Global Standards Built with Indian market needs in mind while maintaining enterprise-grade reliability. The platform is battle-tested in production environments, serving organisations from startups to enterprises. ⚡ Anyone can get Started in Minutes With Docker setup, sample data, and comprehensive documentation, you can deploy your first ML pipeline today. Link for the repo in comment.
30
5 Comments -
Debayan Mitra
redBus • 2K followers
Wrote an article on Byte Pair Encoding (subword Tokenization) and how it works which was potentially used in the the earlier versions of GPT / traditional Language Learning model paradigms. Took a sample of Hindi Language corpus text data to showcase the same. Article below - https://lnkd.in/gCu7Bgpf
33
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore MoreView Divay’s full profile
-
See who you know in common
-
Get introduced
-
Contact Divay directly