Skip to content
NR
Back home

Résumé · Open to full-time roles

Neeraj Bhargav Rondla

AI Engineer · RAG & Agentic Systems · Full-Stack AI Products

01

Profile

AI engineer building production-grade retrieval, agents, and LLM evaluation plus the streaming front-end, data layer, and infrastructure underneath. I ship full products end-to-end — from ChromaDB schemas and FastAPI microservices to WCAG-compliant React interfaces deployed on AWS with CI/CD.

02

Experience

Jan 2025 – Present

Remote · USA

AI Software Engineer — RAG, Agents & Eval

T-Mobile (via Wipro)

  • Shipped a production RAG pipeline over 10K+ enterprise documents — LangChain + ChromaDB + FastAPI, sub-2s p95 retrieval, hybrid BM25 + dense with cross-encoder re-ranking
  • Built an autonomous ReAct agent on GPT-4 function-calling — 5–10 tool calls per query across CRM, billing, and knowledge APIs, deflecting Tier-1 support load
  • Stood up an LLM evaluation framework with RAGAS + LangSmith — measured faithfulness, context recall, and hallucination rate; cut hallucinations ~60% across the assistant
  • Authored 50+ prompt templates with safety guardrails, tone alignment, escalation logic, and PII redaction — reviewed and version-controlled like code
  • Built the streaming chat UI in Vue 3 + React + Tailwind — token-level rendering, retry/resume, full WCAG 2.1 AA keyboard and screen-reader support
  • Deployed services as Dockerized FastAPI microservices on AWS — zero-downtime rollouts via GitHub Actions, structured logs to CloudWatch, health/readiness probes

Aug 2023 – Present

Remote

AI Engineer — Open Source & Contract

Independent

  • Built and open-sourced VIBETTER — an MCP server that gives AI coding IDEs (Claude Code, Cursor) richer project context via stateful tool use
  • Built Context Refinery — a LangGraph multi-agent prompt optimizer with a Tauri desktop UI; iteratively rewrites prompts against eval rubrics
  • Operate a personal RAG over my resume + projects + writing — Claude primary, GPT-4o + Gemini 1.5 fallback, citation-grounded responses
  • All systems Dockerized on AWS EC2 + Nginx with CI/CD, structured logging, and error budgets — production-grade, not demoware

Aug 2021 – Jul 2023

Hyderabad, India

Software Engineer — NLP & Full-Stack

Wipro Ltd.

  • Built an AI code-review assistant on Python + OpenAI Codex — cut PR cycle time 25%, adopted by 30+ engineers across 4 product teams
  • Designed an NLP ticket-classification pipeline (BERT-based) hitting 91% accuracy on 5,000+ tickets/month — reduced manual triage by 40%
  • Embedded an LLM chatbot into 3 enterprise portals serving 10,000+ users — deflected ~40% of repetitive support queries
  • Built a reusable React component library adopted across 3 products, accelerating delivery by 30%
  • Led a jQuery → React SPA migration cutting page load 45%; tuned MySQL schemas and indexes across tables of 5M+ records (~55% query speedup)
03

Selected projects

NYC Agentic Underwriter & Feasibility

CRE Intelligence Platform

Built an autonomous CRE underwriting platform resolving NYC street addresses via DCP GeoSearch. Integrates parallel DoITT PLUTO & ACRIS Deed Sales queries and a Zoning RAG vector database. Features a pure CSS 3D buildable massing visualizer, interactive pro forma calculator, and a Claude-powered 'Ask the Agent' chat drawer.

React · FastAPI · Python · Claude Sonnet · Leaflet

Open-Source MCP Server for AI Coding IDEs

VIBETTER

Built and maintained an open-source MCP server that brings deep codebase intelligence to AI coding IDEs — git-diff explanations, source-grounded Q&A with file:line citations, dependency graph visualization, and audio code walkthroughs.

Python · FastMCP 3.0 · Gemini API · Vue Flow · Watchdog

LangGraph Multi-Agent Context Orchestration Engine

Context Refinery

Built a cross-platform desktop app (Tauri v2 + Vue 3 + FastAPI) with a LangGraph multi-agent pipeline — intent analysis, hybrid retrieval (ChromaDB dense + BM25 sparse + Reciprocal Rank Fusion), cross-encoder reranking, and iterative self-refinement — with multi-provider LLM support including local models via Ollama.

Python · LangGraph · FastAPI · Vue 3 · Tauri v2

Enterprise Knowledge Retrieval & LLM Evaluation at T-Mobile

Production RAG System

Built a production RAG pipeline for customer support knowledge retrieval — from document ingestion and recursive chunking to embedding, vector indexing, and semantic search with metadata filtering. Paired with an LLM evaluation framework that measurably reduced factual errors.

Python · LangChain · ChromaDB · FastAPI · RAGAS

04

Capabilities

LLM Systems

GPT-4 / 4o / o-series · Claude (Sonnet / Opus) · Gemini 1.5 Pro · Llama 3 / Mistral (vLLM) · Prompt Engineering · Structured Outputs · JSON Mode · Streaming · SSE · Function / Tool Calling · Token & Cost Optimization · Fine-tuning · LoRA / QLoRA

Retrieval & RAG

LangChain · LlamaIndex · ChromaDB · Pinecone · FAISS · pgvector / Postgres · Hybrid (BM25 + Dense) · Cross-Encoder Re-ranking · Semantic / Recursive Chunking · Document Pipelines · Unstructured.io

Agents & Orchestration

ReAct / Plan-and-Execute · LangGraph · Multi-Agent Systems · Tool Use · Function Calling · Memory · Short / Long Term · MCP / FastMCP Servers · Anthropic & OpenAI SDKs · DSPy · Workflow State Machines

Evaluation, Safety & Observability

RAGAS · LangSmith · LangFuse · Weights & Biases · LLM-as-Judge · Hallucination Detection · Guardrails · NeMo / LlamaGuard · Prompt Injection Defense · PII Redaction · Red-Teaming · Eval Harnesses

Frontend & AI UX

React 18 · Next.js 14 · TypeScript · Vue.js 3 · Pinia · Tailwind CSS · Streaming Chat Interfaces · Server Components · RSC · WCAG 2.1 / AA · Framer Motion · Design Systems · Figma

Backend, Data & Infrastructure

Python · FastAPI · Pydantic · Node.js · Express · PostgreSQL · Redis · MongoDB · Docker · Kubernetes (basics) · AWS · EC2 / S3 / Lambda / ECS · Vercel · Render · GitHub Actions · CI/CD · Nginx · Observability (CloudWatch)

05

Education

2025

M.S. Computer Science

New Jersey Institute of Technology

Machine Learning · Deep Learning · NLP · Human-Computer Interaction · Database Systems

B.E. Computer Science

JNTU, Hyderabad, India

Updated Jun 2026 · This résumé is an on-site page — use Print → Save as PDF for a file copy.