AWS Data Engineer with 2.6 years of experience in maintaining and optimizing cloud-based data
engineering solutions.Skilled in AWS services like S3, Glue, Lambda,
Kinesis, and Redshift, with expertise in Veeva Network CRM for mastering HCP and HCO records.
Adept at monitoring workflows, automating processes, and collaborating with teams to deliver
robust, scalable solutions aligned with business goals.
• Provided continuous support for AWS-based data pipelines, troubleshooting issues related to Glue, S3, and Athena.
• Monitored the performance of data workflows, proactively identifying bottlenecks and optimizing resource usage in Glue and Lambda.
• Collaborated with the development team to support the scaling of data storage on S3 and Lambda functions, ensuring efficient handling of increasing data volumes.
• Supported incident management and resolution, identifying root causes for data pipeline failures and implementing fixes to ensure system reliability.
• Managing data security and access permissions using AWS IAM, ensuring compliance with company policies and external regulations.
• Regularly worked on data integrity checks and optimized data retrieval processes in Athena, improving query performance.
• Supported the maintenance and enhancement of data integration systems built using AWS Glue, Lambda, and EC2, ensuring smooth day-to-day operations.
• Handled troubleshooting of data pipeline failures, provided timely resolution, and implemented improvements to reduce recurring issues.
• Conducted routine performance tuning of data processing jobs and optimized MySQL queries to enhance overall system responsiveness and resource efficiency.
Cloud Services: AWS S3, Glue, Lambda, EC2, Kinesis, SNS, IAM
Data Engineering: ETL Pipeline Optimization
Big Data Technologies: PySpark
Programming: Python
Database Management: MySQL, AWS Athena, Veeva Network
Monitoring & Automation: CloudWatch, Jenkins, Git
HCP and HCO Mastering with Veeva Network CRM
Managed and supported the mastering of Health Care Practitioners (HCPs) and Health Care
Organizations (HCOs), ensuring accurate data relationships and compliance.
Real-Time Streaming Pipeline Optimization
Supported and optimized AWS Kinesis-based streaming pipelines, reducing data latency by 20%
and enhancing real-time analytics capabilities.
Centralized Data Lake Maintenance
Maintained and enhanced AWS S3 data lakes, improving data retrieval times by 30% through
optimized ETL workflows and query tuning.