
Senior AI Engineer with 5+ years across backend engineering and applied GenAI, owning production-grade RAG and agentic systems in regulated legal and finance workflows. Led end-to-end development of a B2C/B2B GenAI legal research product serving ~700 daily users, architecting hybrid retrieval at scale (Milvus, TF-IDF + dense embeddings, RRF, cross-encoder reranking) to deliver high precision and low-latency performance. Strong in LLM reliability engineering (schema-safe outputs, rate-limit resilience, multi-model fallbacks, token optimization), OCR-driven document pipelines, and decision automation systems with human-in-the-loop controls. Proven technical leader who authors architecture docs, drives design reviews, mentors engineers, and ships measurable outcomes.
GenAI / Agents / RAG:Agentic RAG, LangChain, LangGraph, Tool Calling, MCP, A2A, OpenAI, Claude, Gemini, Mistral