Summary
Overview
Work History
Education
Skills
Certification
Languages
Websites
Timeline
Generic

Muzna Buden Khan

Bangalore

Summary

Senior Data Engineer with 9+ years of experience designing and building scalable batch and streaming data pipelines using PySpark, Spark, Ab Initio, and Kafka on cloud platforms like Azure and AWS. Proven track record of automating data workflows, improving system reliability, and delivering multi-million-dollar business impact. Strong expertise in distributed data systems, CI/CD, and cross-functional collaboration.

Overview

10
10
years of professional experience
1
1
Certification

Work History

Software Engineer

Microsoft
Bangalore
07.2022 - Current
  • Built and managed scalable batch and streaming pipelines using Spark, Ab Initio, Kafka, Hadoop, and Cosmos DB, orchestrated via Concourse CI, Jenkins, and Kubernetes.
  • Led the Ab Initio to Spark migration, including end-to-end testing and reconciliation to ensure complete data accuracy and integrity.
  • Delivered a Post-Auction Discount Enablement system that unlocked $50–100 million in potential ad spend and contributed $5–10 million in incremental revenue by integrating into Microsoft’s ad auction workflows.
  • Developed automated schema and binary build pipelines with Concourse CI, reducing deployment time by over one hour, improving reliability, and compliance.
  • Engineered SLA-compliant batch pipelines for MSN and Xbox Exchange BI, delivering actionable insights through cross-functional collaboration with global product and dev teams.
  • Contributed to the codebase migration from Stash to Azure DevOps, streamlining repository access and governance.
  • Enabled rewarded video support in Microsoft Casual Games, and extended Video Analytics to support monetization insights, laying the foundation for future Xbox platform integration.
  • Integrated S3 with data pipelines to ingest vendor files, trigger downstream ETL processes, and set up AWS CloudWatch alerts, and S3 event notifications to monitor file uploads and track failures or delays.

Data Engineer

AT&T
Bangalore
04.2020 - 06.2022
  • Designed a QA test data pipeline using Ab Initio and shell scripting to enable continuous regression testing of the TL data platform, reducing test cycles to under one hour.
  • Migrated legacy MapReduce jobs to Ab Initio, improving performance and maintainability.
  • Contributed to the Hadoop 2 to Hadoop 3 migration, modernizing the data platform.
  • Developed Ab Initio continuous flows to read and write Kafka streaming data, expanding real-time data use cases.
  • Worked on the performance improvement of the existing graphs in Ab Initio.
  • Actively involved in LiveSite, providing real-time monitoring, and ensuring the quick resolution of production issues.

Senior Application Engineer

Securonix
Bangalore
06.2019 - 03.2020
  • Worked on SNYPR, a big data security analytics platform built on Hadoop, Spark, and AWS.
  • Managed infrastructure, ensured SLA adherence, and debugged Spark/Hadoop pipelines for threat detection use cases.
  • Debugged and resolved Spark job failures by analyzing logs using Spark UI, YARN, and AWS CloudWatch, ensuring timely recovery, data integrity, and adherence to SLA requirements.
  • Handled deployments for version upgrades, patches, and Windows updates across servers.
  • Built a utility that scans and parses all XML files, fetches certificates of the customers, notifies us before the SSL certificate expiry, and triggers an alert or mail. Used Python scripting and the OpenSSL utility for this.

Cloud Operation Engineer

SAP Labs
Bangalore
09.2015 - 05.2019
  • Providing operational support for all HCM SuccessFactors applications.
  • Managed and maintained SAP Cloud Data Center infrastructure across Windows, Linux, and HANA environments, ensuring high availability of applications and servers.
  • Installed and configured SSL certificates for customers, ensuring secure communication and compliance with security standards.
  • Performed application deployments and major code releases across multiple servers, coordinating with development and operations teams to minimize downtime.
  • Investigated and resolved customer incidents and service requests using ticketing systems such as JIRA, ServiceNow, and SPC, ensuring timely closure and high user satisfaction.

Education

Bachelor of Engineering - Computer Science & Engineering

SDM College of Engineering & Technology
Dharwad,Karnataka
06-2015

Skills

Languages: Python, SQL, Bash, Java

Big Data: PySpark, Apache Spark, MapReduce, Hive, HDFS, Kafka

Workflow orchestration: Kubernetes, Docker, Concourse CI, Jenkins

Databases and storage: Cosmos DB, MySQL, Azure Data Lake Storage (ADLS), HDFS

Cloud platforms: AWS (S3, EMR), Azure, Kubernetes

ETL: Ab Initio

Certification

Databricks Certified Data Engineer Associate (in progress)

Languages

English
First Language

Timeline

Software Engineer

Microsoft
07.2022 - Current

Data Engineer

AT&T
04.2020 - 06.2022

Senior Application Engineer

Securonix
06.2019 - 03.2020

Cloud Operation Engineer

SAP Labs
09.2015 - 05.2019

Bachelor of Engineering - Computer Science & Engineering

SDM College of Engineering & Technology
Muzna Buden Khan