Skip to content
View svpathak's full-sized avatar
  • Mumbai

Block or report svpathak

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
svpathak/README.md

Hi, I'm Sanchit Pathak πŸ‘‹

Typing SVG

About Me

  • πŸŽ“ MTech in Data Science, IIT Guwahati (CPI: 9.61) Β |Β  BTech in Electrical Engineering, VJTI Mumbai
  • πŸ’Ό ML Engineer / Data Scientist with ~2.5 years of ML/AI experience β€” NLP pipelines, LLM-based RAG systems, and cloud-scale analytics on Azure & GCP
  • πŸ”¬ ACM Published β€” S-VQA at ICVGIP 2023
  • πŸ† LeetCode Knight β€” Top ~5% globally
  • πŸš€ Passionate about LLMs, RAG, PEFT/LoRA fine-tuning, Agents, and NLP evaluation
  • 🌱 Currently exploring: systematic ML evaluation, long-document QA, and agent orchestration patterns

πŸ› οΈ Tech Stack

Skills

  • Core areas: LLMs Β· RAG Β· PEFT/LoRA Β· NLP Β· Computer Vision Β· Transformers Β· ML Evaluation
  • Cloud: Azure (Databricks, AI Search, AI Foundry) Β· GCP (BigQuery, GCS)
  • Languages: Python Β· SQL Β· C/C++

πŸš€ Projects

Project Description Links
Diagnosing RAG Failure Modes Systematic evaluation of RAG on long-document QA (QASPER dataset). 4 stress-test experiments, custom Evidence Coverage Score (ECS) metric. Live Demo Β· GitHub
S-VQA Sentence-based Visual Question Answering β€” TDIUC-SVQA dataset construction and multi-task multimodal modeling. ACM Β· ICVGIP 2023. GitHub Β· Paper
Business Analytics Chatbot Conversational analytics agent built with Google ADK + Gemini. Natural language queries returning concise answers and charts. GitHub
PEFT LoRA Fine-tuning Parameter-efficient fine-tuning of LLMs using LoRA β€” with a live interactive demo on HuggingFace Spaces. Live Demo Β· GitHub
Movie Recommendation System Collaborative filtering on the MovieLens dataset with recommendation-specific evaluation including Long Tail analysis. GitHub
Article Bias Prediction (LSTM) Multi-approach similarity-based political bias detection in news articles using LSTM. GitHub
Image Caption Generator Automatic image captioning model inspired by the Show and Tell architecture. GitHub

πŸ“„ Publication

S-VQA: Structured Visual Question Answering
ACM Β· ICVGIP 2023
πŸ“– View Paper β†’ Β |Β  πŸ’» View Paper Summary β†’

loT Based Real-Time Harmonic Monitoring System for Distributed Generation
IEEE Β· I2CT 2018
πŸ“– View Paper β†’

🎯 Currently

  • πŸ” Open to MLE / Applied Scientist / Senior Data Scientist roles at product-based companies
  • πŸ› οΈ Building: systematic RAG evaluation frameworks and agentic AI workflows
  • πŸ“¬ Best way to reach me: LinkedIn

πŸ“Š GitHub Stats

🌐 Connect with Me

Β  Β  Β  Β 


Pinned Loading

  1. rag-failure-modes rag-failure-modes Public

    Systematic evaluation of RAG on long-document QA (QASPER dataset). Four experiments, custom Evidence Coverage Score (ECS) metric.

    Python

  2. svqa-sentence-based-vqa svqa-sentence-based-vqa Public

    Sentence-based Visual Question Answering with TDIUC-SVQA dataset construction and multi-task multimodal modeling (ICVGIP 2023, ACM).

  3. agenticai-business-analytics-chatbot agenticai-business-analytics-chatbot Public

    An Agentic AI solution to ask business/market share/finance related question to an AI ChatBot and get answeres in conise bullet points and illsutrative chart. Developed on Google's ADK and Streamli…

    Python 3

  4. peft-lora-finetuning-dialogsum peft-lora-finetuning-dialogsum Public

    Parameter-efficient fine-tuning of LLMs using LoRA with a live interactive demo on HuggingFace Spaces.

    Jupyter Notebook

  5. Collaborative-Filtering-based-Movie-Recommendation Collaborative-Filtering-based-Movie-Recommendation Public

    A collaborating filtering based system on Movie Lens dataset to recommend user specific movie suggestions. The model was evaluated with recommedation specific metrics including Long Tail plot using…

    Jupyter Notebook

  6. Multi-Approach-Similarity-Based-Article-Bias-Prediction-using-LSTM Multi-Approach-Similarity-Based-Article-Bias-Prediction-using-LSTM Public

    Multi-approach similarity-based political bias detection in news articles using LSTM.

    Python