NR* — 001 · Jersey City, NJOpen to Forward Deployed roles

● Issue 01 · an introduction

Hi, I’m Neeraj.

I take ambiguous problems from zero to production AI systems — owned end-to-end, model to interface to infrastructure, with zero handoffs.

The engineer you hand the ambiguous, end-to-end problem to. I sit with it, run the technical discovery, decompose it, and ship the whole stack — retrieval, agents, evaluation, the streaming UI, and the infrastructure that runs it. Currently sole AI engineer at Landair Advisors, owning five production systems zero-to-one in NYC commercial real estate, where a hallucination costs real money. Depth where the problem is hard; range where shipping demands it. Based in Jersey City, NJ — targeting Forward Deployed Engineer roles.

See the work

Get in touch

Scroll · 02 · About

About the engineer

One vertical — retrieval, agents, evaluation, and the interfaces that ship them.

Issue 02

One engineer, the entire AI stack.

One person carrying a feature from the data layer to the pixels — so nothing gets lost in a handoff, because there isn't one.

“I’m the full vertical — retrieval, agents, evaluation, streaming UI, and the AWS pipeline that ships it. Fewer tickets. Faster iteration.”

Currently the sole AI engineer at Landair Advisors (NYC commercial real estate) — five production systems, zero-to-one. Open-source on the side: VIBETTER, Context Refinery, OmniContext, and the CRE Intelligence Platform. Previously Innovcentric (RAG, agents, ~60% hallucination cut) and Wipro NLP. Targeting Forward Deployed Engineer roles.

Own it end-to-end.

From discovery to deployment to the 2 a.m. page — no handoffs, no excuses.

ii.

Sit with the problem first.

Technical discovery before code. Framing the right system is the hard part.

iii.

Ship systems, not demos.

Deployed, monitored, behind CI/CD — or it doesn't count.

iv.

Evaluation is the product.

If you can't measure it, you can't put it in front of a customer.

Outside the terminal

Photography
Gym
Gaming
Reading
Travel
Anime
Motorcycles
Cars
Building things end-to-end

Selected work

Seven AI products — drag the rail, sector lights track your progress.

Issue 03

Personal & Open-Source.

Alongside my enterprise work, I independently build these tools and platforms to stay hands-on with the rapidly evolving agentic AI stack. Drag or scroll to explore.

live

Full-Stack Developer

SupportIQ

AI Customer Support Platform

Next.jsReact 19TypeScript

2026Case study

live

Creator

CRE Intelligence Platform

NYC Agentic Underwriter & Feasibility

ReactFastAPIPython

2026Case study

Creator & Maintainer

OmniContext

Universal State Manager & Handoff Bridge

Next.jsReactTailwindCSS

2026Case study

Creator & Maintainer

Context Refinery

LangGraph Multi-Agent Context Orchestration Engine

PythonLangGraphFastAPI

2026Case study

Creator & Maintainer

VIBETTER

Open-Source MCP Server for AI Coding IDEs

PythonFastMCP 3.0Gemini API

Feb 2026Case study

AI Software Engineer

Production RAG System

Enterprise Knowledge Retrieval & LLM Evaluation at Innovcentric

PythonLangChainChromaDB

Nov 2025Case study

AI Engineer

Autonomous AI Agent

Multi-Step Reasoning with Tool Use & Cited Reports

PythonLangChainGPT-4 Function Calling

2025Case study

All work index

Background & stack

Experience timeline plus the full toolkit — always visible, no tabs.

Issue 04

History & stack.

Three roles, one habit — take a fuzzy AI problem, ship a system you can measure, then keep shipping. Here's the toolkit behind it.

Experience

Applied AI Engineer — Owns 5 Production AI Systems
2026 – Present
Landair Advisors · New York, NY · On-site
- Applied AI Engineer at a NYC commercial real-estate brokerage — inherited and own five production AI systems end-to-end on NYC property data (ACRIS, PLUTO, DOB, NY DOS)
- Skiptracer — LLC-chain owner resolution via a NY DOS → ACRIS → IDI waterfall with confidence scoring; ≥95% owner + phone accuracy target. Python · Supabase · Claude Haiku
- CompScope — Supabase database of 9,391+ comparable sales with PLUTO + ACRIS asset-type classification, powering broker deal analysis
- BuyerScope — ranks buyers per asset type from ACRIS transfers + DOB permits into top-30 outreach lists; 2,456 scored, 109 Tier-A. Python · Supabase
- P1 Generator — webhook MMS owner outreach: CallTools → Flask → Twilio MMS + Cloudflare R2 + SendGrid
- Call Notes Worker — inbound-call transcription + scoring: CallTools audio → Whisper → GPT-4o → structured output. Node.js · Linode
- ABV — agentic real-estate valuation tool on the same data stack, demoed to management and live at realestate-agentic.vercel.app
AI Software Engineer — Full Lifecycle, Zero Handoffs
Nov 2025 – Jun 2026
Innovcentric LLC · Remote · USA
- Owned the complete Gen AI strategy and implementation lifecycle for enterprise customer-experience platforms — technical discovery with product owners and operations managers, problem decomposition from ambiguous business needs into production AI architectures, pilot validation before full rollout, and post-deployment optimization — one engineer, model to interface to infrastructure, zero handoffs
- Built a production-grade RAG pipeline (LangChain + ChromaDB + FastAPI + Pydantic) over 10,000+ pages of support documentation — PDF/DOCX ingestion, recursive chunking (512-token windows, 50-token overlap), OpenAI embeddings, and metadata filtering — delivering sub-2s semantic search for 100+ support agents and replacing manual document lookup entirely; benchmarked 4 chunking strategies and 3 embedding models to tune retrieval
- Deployed an autonomous ReAct agent on GPT-4 function calling with MCP-style tool-use chains — knowledge retrieval, account-context lookup, multi-step reasoning, and sub-agents for specialized tasks — running 5–10 tool calls per query with self-correction loops, dual memory (conversation buffer + vector store), and Human-in-the-Loop escalation for billing-sensitive edge cases
- Built the eval harness that proved it worked: RAGAS (faithfulness, relevance, hallucination scoring), LangSmith observability, drift monitoring that held the line through the GPT-4 → GPT-4o migration, and prompt versioning — cutting factual errors ~60% and codifying the results as repeatable deployment artifacts
- Engineered 50+ prompt templates (chain-of-thought, few-shot, instruction tuning) with safety guardrails, prompt-injection defense, tone alignment, and a version-controlled lifecycle — reviewed and shipped like production code, not tuned ad hoc
- Owned the full stack with zero handoffs: a streaming chat UI (real-time token rendering, accessible, mobile-responsive) in Vue 3/React + Tailwind, a FastAPI service layer with structured error handling and Redis caching, and Docker + Nginx + GitHub Actions CI/CD — shipped to production with zero-downtime deployments across environments
- Designed the UX in Figma (WCAG 2.1 AA, component libraries, responsive breakpoints) and validated it with real users; ran white-glove delivery — L1–L2 support, weekly office hours, and stakeholder runbooks — backed by OpenTelemetry instrumentation, structured logging, and health checks
Software Engineer — NLP, AI Tools & Full-Stack
Aug 2021 – Jul 2023
Wipro Ltd. · Hyderabad, India
- Built and deployed an AI-powered code-review assistant (Python + OpenAI Codex) with custom prompt engineering for logic-error and code-smell detection — cutting PR merge cycles 25% and adopted by 30+ engineers
- Designed an NLP ticket-routing pipeline (Scikit-learn + HuggingFace BERT, Pandas, NumPy) hitting 91% accuracy on 5,000+ tickets/month and reducing manual triage 40%; delivered a conversational AI chatbot with intent detection, FAQ automation, conversation memory, and escalation logic across 3 enterprise portals — 40% fewer repetitive queries for 10,000+ users
- Built and maintained enterprise React applications at 99.5% production SLA — interactive components, form validation, Chart.js dashboards, and Lighthouse/code-splitting performance work; created a reusable component library that accelerated delivery ~30% and led a jQuery → React SPA migration for 45% faster page loads
- Engineered event-driven REST and GraphQL services (Java + Python, PostgreSQL + SQLAlchemy + Alembic) with query optimization across tables of 5M+ records (~55% faster); trained supervised ML models (Random Forest, Logistic Regression, Gradient Boosting) with automated hyperparameter tuning and cross-validation, and resolved 200+ defects across 4 release cycles

Stack

Seven domains, end to end — the tools I reach for to take an AI problem from prototype to production.

Forward-Deployed Delivery

End-to-End Gen AI OwnershipZero-to-One Production DeploymentTechnical Discovery & ScopingProblem DecompositionAmbiguity NavigationPilot → ProductionStakeholder CommunicationWhite-Glove Onboarding · L1–L2Runbooks · Office HoursPost-Deployment Optimization

LLM Systems

GPT-4 / 4oClaude Sonnet 4.5 / OpusGemini (2M context)Llama 3 / Mistral (Ollama)Prompt Engineering · VersioningStructured Outputs · JSON ModeStreaming · SSEFunction / Tool CallingSFT · LoRA / QLoRAToken & Cost Optimization

Retrieval & RAG

LangChainLlamaIndexChromaDBPinecone · FAISSpgvector · WeaviateHybrid (BM25 + Dense + RRF)Cross-Encoder Re-rankingSemantic / Recursive ChunkingAgentic RAGDocument Ingestion Pipelines

Agents & Orchestration

ReAct · Plan-and-ExecuteLangGraphMulti-Agent SystemsMCP · FastMCP 3.0Sub-Agents · Tool RegistryMemory (buffer + vector)Human-in-the-LoopDSPyasyncio · Event-DrivenWorkflow State Machines

Evaluation & Observability

RAGAS · DeepEvalLangSmith · AgentOpsArize PhoenixLLM-as-JudgeHallucination DetectionFaithfulness ScoringDrift MonitoringSafety GuardrailsPrompt Injection DefenseOpenTelemetry

Machine Learning & NLP

HuggingFace TransformersScikit-learnTensorFlow · PyTorchText Classification · NERSummarizationTransfer LearningCNNs · ResNetGrad-CAMPandas · NumPyMLOps

Frontend & AI UX

React 18/19 · Next.js 16TypeScriptVue.js 3 · Vue FlowTailwind CSS v4Streaming Chat UIsServer Components · RSCWCAG 2.1 AAFramer MotionTauri v2 · LeafletFigma · Lighthouse

Backend & Infrastructure

Python · FastAPI · PydanticNode.js · NestJSJava · GraphQL · WebSocketsPostgreSQL · Prisma · SupabaseMongoDB · RedisDocker · Docker ComposeAWS (EC2 / S3 / Lambda / ECS)Vercel · GitHub ActionsNginx · MicroservicesHealth Checks · Structured Logging

Education

M.S. Computer Science — New Jersey Institute of Technology (NJIT)2023 – Dec 2025

B.Tech. Computer Science & Engineering — JNTUH — Hyderabad, India2018 – 2022

OPT / STEM extension eligible

Essays & notes

Production lessons, RAG architecture, and agentic systems.

Issue 05

Thinking out loud.

Notes from actually building this stuff — what worked, what I'd do differently, and the decisions that don't fit in a README.

019 min read

Forward Deployed

Forward-Deployed: Shipping Five AI Systems Zero-to-One

What it actually takes to be the only AI engineer at a brokerage — technical discovery, decomposition, and owning five production systems end-to-end with no handoffs.

Read essay

028 min read

Evaluation

An Agent You Can't Measure Is One You Can't Trust

The eval harness — not the model — is what let a production RAG agent survive a GPT-4 to GPT-4o migration and cut factual errors ~60%.

Read essay

037 min read

Agentic Systems

When Verification Is the Product

A skiptracing agent the team acts on without re-checking. The contact lookup is easy; the cross-verification that makes it trustworthy is the actual system.

Read essay

All writing

Targeting Forward Deployed Engineer roles · End-to-end Gen AI ownership · Zero-to-one production systems · Applied AI Engineer @ Landair Advisors · 5 production AI systems · NYC CRE · Model to interface to infrastructure · Jersey City · NJ · OPT + STEM eligible · 60% factual-error cut via eval harness · Targeting Forward Deployed Engineer roles · End-to-end Gen AI ownership · Zero-to-one production systems · Applied AI Engineer @ Landair Advisors · 5 production AI systems · NYC CRE · Model to interface to infrastructure · Jersey City · NJ · OPT + STEM eligible · 60% factual-error cut via eval harness ·

Start a conversation

Targeting Forward Deployed Engineer roles — tap the shutter for a vCard.

Issue 06

Have a problem worth solving?

Open to full-time AI roles, and always up for a good problem. Drop me a line — I usually reply same day.

Primary channel

rondlanbr@gmail.com

Write an email

Open to Forward Deployed roles · Jersey City, NJ

/in/neerajbhargav

GitHub

/neerajbhargav

Hi, I’m Neeraj.

I take ambiguous problems from zero to production AI systems — owned end-to-end, model to interface to infrastructure, with zero handoffs.

One engineer,One engineer, the entire AI stack.the entire AI stack.

Own it end-to-end.

Sit with the problem first.

Ship systems, not demos.

Evaluation is the product.

Outside the terminal

Personal &Personal & Open-Source.Open-Source.

SupportIQ

CRE Intelligence Platform

OmniContext

Context Refinery

VIBETTER

Production RAG System

Autonomous AI Agent

HistoryHistory & stack.& stack.

Applied AI Engineer — Owns 5 Production AI Systems

AI Software Engineer — Full Lifecycle, Zero Handoffs

Software Engineer — NLP, AI Tools & Full-Stack

Forward-Deployed Delivery

LLM Systems

Retrieval & RAG

Agents & Orchestration

Evaluation & Observability

Machine Learning & NLP

Frontend & AI UX

Backend & Infrastructure

ThinkingThinking out loud.out loud.

Forward-Deployed: Shipping Five AI Systems Zero-to-One

An Agent You Can't Measure Is One You Can't Trust

When Verification Is the Product

Have aHave a problem worth solving?problem worth solving?

One engineer, the entire AI stack.

Personal & Open-Source.

History & stack.

Thinking out loud.

Have a problem worth solving?