
Senior AI Engineer with 5+ years across backend engineering and applied GenAI, owning production-grade RAG and agentic systems in regulated legal and finance workflows. Led end-to-end development of a B2C/B2B GenAI legal research product serving ~700 daily users, architecting hybrid retrieval at scale (Milvus, TF-IDF + dense embeddings, RRF, cross-encoder reranking) to deliver high precision and low-latency performance. Strong in LLM reliability engineering (schema-safe outputs, rate-limit resilience, multi-model fallbacks, token optimization), OCR-driven document pipelines, and decision automation systems with human-in-the-loop controls. Proven technical leader who authors architecture docs, drives design reviews, mentors engineers, and ships measurable outcomes.
GenAI / Agents / RAG:Agentic RAG, LangChain, LangGraph, Tool Calling, MCP, A2A, OpenAI, Claude, Gemini, Mistral
Retrieval / Search / Ranking:Milvus, pgvector, Elasticsearch, TF-IDF, Reciprocal Rank Fusion, Voyage Embeddings, Voyage Cross-Encoder Reranking
Backend / Data / Streaming:Python, Java, SQL, FastAPI, Flask, Spring Boot, REST, gRPC, Apache Kafka, Apache Flink
Cloud / Datastores / OCR / DevOps:Azure OpenAI, AI Foundry, Blob Storage, Azure Document Intelligence, AWS Lambda, AWS S3, Vertex AI, PostgreSQL, MongoDB, Neo4j, Mistral OCR, Git, Jenkins