top of page

Letting Data Speak, AI Act!

Case Study

RAG-Based Retrieval System for Spiritual Content

A non-profit spiritual and educational organization dedicated to teaching Vedic knowledge and promoting spiritual awareness in society. The organization focuses on spreading eternal knowledge in accessible formats while organizing spiritual camps and retreats. Their core mission involves guiding spiritual seekers toward God-realization through practical devotion and charitable activities.

About the Client

A non-profit spiritual and educational organization dedicated to teaching Vedic knowledge and promoting spiritual awareness in society. The organization focuses on spreading eternal knowledge in accessible formats while organizing spiritual camps and retreats. Their core mission involves guiding spiritual seekers toward God-realization through practical devotion and charitable activities.

Untitled design - 2024-09-27T104509.589.png

Challenge

The foundation faced significant challenges in making vast spiritual teachings accessible and interactive for devotees across diverse backgrounds and languages. They possessed thousands of hours of video content, audio files, and books containing invaluable spiritual knowledge, but lacked an effective methodology to process and retrieve information from these unstructured data sources.

  • Content Accessibility Crisis: Manual searching through thousands of hours of spiritual content took hours, creating barriers for seekers needing immediate guidance

  • Unstructured Data Processing: Diverse multimedia content formats (videos, audio, transcripts) require sophisticated processing to become searchable and retrievable

  • Language Barriers: Hindi content needed translation and multilingual support to reach broader audiences

  • Technical Implementation Gap: While recognizing the potential of RAG-based solutions, they lacked a proper methodology to ingest massive volumes of diverse content into vector databases for effective querying

Without a solution, the organization risked losing engagement from spiritual seekers who needed instant access to relevant teachings, potentially limiting their mission's reach and impact.

Untitled design - 2024-09-27T105551.128.png

Key Results

  • Processed 3,000+ hours of spiritual content into a searchable format with complete metadata preservation

  • Reduced content discovery time from hours of manual searching to seconds of intelligent retrieval

  • Achieved 99%+ system reliability through robust fallback systems across multiple LLM providers

  • Decreased manual curation effort by 70% through an automated content processing pipeline

  • Enabled exact document chunk retrieval with complete source attribution and timestamp accuracy

Solution


ree

Content Processing Pipeline Implementation: JashDS developed a comprehensive ingestion system to handle video, audio, and transcript files from S3 or local storage. The solution converted videos to audio format using FFMpeg for optimized downstream processing, then generated accurate transcripts using AssemblyAI's sentence API for enhanced quality. LLM-based translation converted Hindi transcripts to English for broader accessibility, while multiple chunking strategies, including fixed sentence chunks with overlap and semantic chunking, ensured optimal information retrieval.

Advanced Indexing and Metadata Management: The system built an indexing pipeline to store rich metadata alongside vector embeddings in Pinecone, including start/end timestamps, file names, batch information, and downloadable S3 links for video/audio/SRT files. This comprehensive metadata preservation enabled precise source attribution and seamless user experience.

Intelligent RAG Query Pipeline: JashDS implemented query translation to Hindi using LLM for multilingual support, integrated OpenAI content moderation to filter irrelevant queries, and developed a multi-index data retrieval system with similarity search, keyword search, and hybrid search capabilities. Advanced reranking using Cohere Rerank, GTE-base, Cross-encoder, and LLM-based options ensured high-quality results, while configurable threshold-based filtering maintained response relevance and quality.

Reliability and Fallback Systems: The solution incorporated LLM fallback mechanisms across OpenAI GPT, Claude, and Gemini models to ensure consistent system availability and performance, guaranteeing spiritual seekers could access guidance regardless of individual model availability.


Untitled design - 2024-09-27T104509.589.png

Technologies Used

  • AWS S3 - Cloud storage and content management

  • FFmpeg - Video to audio conversion and media processing

  • AssemblyAI Sentence API - High-quality speech-to-text transcription

  • Pinecone - Vector database for embeddings and retrieval

  • Cohere embed-multilingual-v3.0 - Multilingual embedding generation

  • OpenAI GPT, Claude, Gemini - Language models with fallback architecture

  • Cohere Rerank, GTE-base, Cross-encoder - Advanced reranking systems

  • OpenAI Content Moderation API - Query filtering and content safety

Other Case Study Items

Revolutionizing Personal Loans with AI-Driven Underwriting

Revolutionizing Personal Loans with AI-Driven Underwriting

A leading Indian personal loan provider revolutionized their underwriting process by leveraging AI and machine learning to automate 80% of loan decisions. By integrating social and financial data into a sophisticated predictive algorithm, the company drastically reduced decision times to seconds expanded access to underserved segments, and achieved lower default rates compared to human underwriters.

Artificial Intelligence - Powered Tyre Dimension Extraction System

Artificial Intelligence - Powered Tyre Dimension Extraction System

JashDS developed an AI-powered computer vision system for a leading automotive e-commerce platform, enabling accurate extraction of tire dimensions from images. The solution, which increased conversion rates by 25% and reduced customer support inquiries by 80%, utilized advanced technologies such as YoloV8 for instance segmentation and custom-designed augmentation techniques to simplify the online tire purchasing process.

Enhanced Jira Data Analysis for Strategic Insights

Enhanced Jira Data Analysis for Strategic Insights

JashDS developed a flexible framework for analyzing Jira project data that is capable of handling varying export structures and custom fields. The solution leveraged GenAI and LLM technologies to provide actionable insights, identify productivity trends, and uncover potential risks across diverse software projects, resulting in a ___% improvement in team efficiency and a ___% increase in successful project outcomes.

bottom of page