Bilingual Data Engineer with over 10 years of experience in designing and leading enterprise-scale ETL pipelines and data warehouses, specializing in large data systems and regulatory compliance solutions within the financial services sector. Expertise includes migrating legacy SAS, Ab Initio, and Teradata platforms to modern big data ecosystems such as Hadoop, Hive, PySpark, and Databricks. Proven leadership in managing client engagements across the US and LATAM, complemented by 5.5 years of onsite experience in Bogotá, Colombia. Committed to driving innovation and efficiency through data-driven solutions that enhance organizational performance.
Overview
11
11
years of professional experience
1
1
Certification
Work History
Assistant Consultant (C3A) – Data Engineer / ETL Lead
Tata Consultancy Services (TCS)
Chennai, Tamil Nadu, India
06.2015 - Current
Key Achievements & Responsibilities
Led end-to-end migration of legacy SAS, Ab Initio, and Teradata processes to Hive and PySpark, significantly improving scalability and reducing processing costs.
Designed and implemented CCPA/CPRA-compliant data-deletion frameworks for Enterprise Fraud Detection (EFD) EDW using retention logic and legal preservation rules.
Built Hive-based regulatory models for Reg Z (TILA), enabling automated dispute tracking, SLA monitoring, and customer communication workflows.
Developed and operationalized Data Quality rules across staging and standardization layers in collaboration with Data Owners and Stewards using Collibra and DQP.
Automated SLA forecasting and monitoring by integrating AutoSys jobs with Datalens dashboards.
Troubleshot Spark application performance bottlenecks and resolved job failures using Application IDs and cluster logs.
Managed requirement grooming and sprint planning with US and LATAM business teams; translated complex regulatory requirements into technical deliverables.
Owned CI/CD pipelines using Jenkins, RLM, and Bitbucket for zero-downtime production deployments.
Notable Projects
Enterprise Fraud Detection EDW & Data Lake Migration
Re-engineered Ab Initio graphs and replicated logic in Datahub (HDFS + Hive) as part of enterprise shift from data warehouse to lakehouse architecture.
US Regulatory Reporting – Reg Z / Reg E
Designed Hive + Internal Spark framework models for automated dispute management and regulatory compliance.
Vallar – Internal Fraud Monitoring
Built ingestion and standardization layers for employee activity logs using Hadoop ecosystem.
AML / KYC Testing
Performed manual testing of AML/KYC data in SAS datasets using Proc SQL, Proc Append, Proc Freq, etc.
Served as point of contact for clients regarding follow-up, meeting scheduling, and responses to questions.
Education
Bachelor of Engineering (B.E.) - Electrical & Electronics Engineering
Sri Sai Ram Institute of Technology
Chennai, India
01.2015
Skills
Data Engineering: Hadoop, Hive, HDFS, Teradata, HUE, PySpark (Intermediate)
ETL & Programming: SAS (Base, Macros), Ab Initio, SQL, HQL