Summary
Overview
Work History
Education
Skills
Accomplishments
Certifications & Training
Timeline
Generic

Faizan Khan

Data Engineer
Bangalore

Summary

I build reliable, scalable data solutions that drive business value. With deep expertise in Databricks, Apache Spark, Azure, and Airflow, I’ve migrated critical pipelines, engineered real-time wealth data integrations, and maintained production stability across 200+ high-impact jobs. I focus on building fault-tolerant infrastructure that accelerates delivery, simplifies monitoring, and aligns tightly with business goals

Overview

3
3
years of professional experience

Work History

Data Engineer

Stone X Group Inc
03.2023 - Current
  • Designed and deployed scalable data pipelines using Databricks, Airflow, and Python, enabling reliable market data processing for Securities Lending and Risk platforms.
  • Migrated legacy ingestion systems from StreamSets to Databricks, improving pipeline reliability by 80% and enabling Rancher decommissioning, saving $50K annually.
  • Built robust CDC and streaming architecture with Debezium and Redpanda, delivering real-time Risk data for downstream consumers using the Medallion architecture (Bronze → Silver → Gold).
  • Provided critical production support across Databricks, Airflow, and Dremio using OpsGenie, resolving high-priority incidents and contributing to platform stability.

Data Analyst Intern

Titan Company
09.2022 - 11.2022
  • Executed extensive web scraping to gather market data across diverse platforms, resulting in a dataset of 10,000+ records from 20 different sources.
  • Utilized machine learning algorithms and data visualization tools to compare products from various brands, implementing a recommendation system that led to a 15% increase in sales for top-performing products

Education

Bachelors of Engineering - Computer Science Engineering

CMR Institute of Technology
Bangalore
05.2001 -

Skills

ETL development

Accomplishments

  • Received Star Performer Award and a Fastrack Promotion for consistent high-impact contributions
  • Selected for the AI Champions group to lead GitHub Copilot onboarding and innovation in GenAI tooling.

Certifications & Training

  • Databricks Lakehouse Platform Accreditation – Covered Delta Lake, Unity Catalog, Lakehouse architecture, and pipeline development.
  • Apache Kafka Fundamentals – Pluralsight – Focused on Kafka producers, consumers, brokers, topic partitioning, and real-time streaming.
  • Current 2025 Conference – Confluent – Explored event-driven systems, Kafka innovations, and future of real-time data platforms.
  • LEGO Serious Play Workshop – StoneX – Hands-on workshop to develop creative thinking, communication, and collaborative problem-solving.
  • Generative AI – Exploring LLMs, RAG architecture, prompt engineering, and building GenAI-based tools.
  • Hackathons – Regular participant in AI and data engineering challenges focused on innovation, automation, and rapid prototyping

Timeline

Data Engineer

Stone X Group Inc
03.2023 - Current

Data Analyst Intern

Titan Company
09.2022 - 11.2022

Bachelors of Engineering - Computer Science Engineering

CMR Institute of Technology
05.2001 -
Faizan KhanData Engineer