Spaces:
Running
A newer version of the Gradio SDK is available:
5.29.0
π§ Intelligent Financial Document Q&A App
Built with Langchain, Hugging Face, and RAG-powered LLMs
I developed a robust, AI-driven Q&A application that reads and comprehends financial reports in PDF format to deliver insightful, accurate answers along with reasoning. This project showcases the fusion of document intelligence, vector search, and large language models to simplify financial data exploration and analysis.
1. β Project Overview:
The application is built to process and answer questions from quarterly financial reports of Titan Company Ltd and Hindalco Industries Ltd, covering a period from Q1 FY2024 to Q3 FY2025. These documents include a mix of narrative audit reports and structured financial tables, such as balance sheets and profit & loss statements.
2. π Core Workflow and Architecture:
Document Ingestion and Processing Leveraged Langchain's UnstructuredPDFLoader to ingest PDF documents. This component efficiently extracts both textual narratives and tabular data, enabling a unified representation of structured and unstructured financial content.
3. Semantic Embeddings & Vectorization
Transformed the extracted data into high-dimensional embeddings using two top-tier Hugging Face models: sentence-transformers/gtr-t5-large BAAI/bge-large-en-v1.5 These embeddings were stored and indexed using FAISS, enabling fast and accurate similarity-based retrieval.
4. Retrieval-Augmented Generation (RAG) Chatbot
Built an intelligent chatbot using Langchainβs HuggingFace RAG pipeline, powered by the cutting-edge mistralai/Mixtral-8x7B-Instruct-v0.1 model. This allows the chatbot to fetch relevant document fragments and generate contextual, reasoned responses to user queries.
5. Deployment
The entire application was deployed seamlessly on a Hugging Face Space, offering an intuitive chat interface for users to explore financial insights in real-time.
π‘ Key Features:
- Handles both narrative and tabular data from PDFs
- Provides contextual answers with reasoning and document references
- Blazing-fast response time using FAISS vector search
- Deployed and accessible via a web-based interface on Hugging Face Spaces
π Use Cases:
- Financial analysts and investors querying specific metrics
- Internal audits and compliance teams validating figures
- Executive summaries derived from dense quarterly reports
- Rapid due diligence and report comparison across companies