Jan 2025 – Present
Remote · USA
AI Software Engineer — RAG, Agents & Eval — T-Mobile (via Wipro)
- Shipped a production RAG pipeline over 10K+ enterprise documents — LangChain + ChromaDB + FastAPI, sub-2s p95 retrieval, hybrid BM25 + dense with cross-encoder re-ranking
- Built an autonomous ReAct agent on GPT-4 function-calling — 5–10 tool calls per query across CRM, billing, and knowledge APIs, deflecting Tier-1 support load
- Stood up an LLM evaluation framework with RAGAS + LangSmith — measured faithfulness, context recall, and hallucination rate; cut hallucinations ~60% across the assistant
- Authored 50+ prompt templates with safety guardrails, tone alignment, escalation logic, and PII redaction — reviewed and version-controlled like code
- Built the streaming chat UI in Vue 3 + React + Tailwind — token-level rendering, retry/resume, full WCAG 2.1 AA keyboard and screen-reader support
- Deployed services as Dockerized FastAPI microservices on AWS — zero-downtime rollouts via GitHub Actions, structured logs to CloudWatch, health/readiness probes