Lead data engineer with 11+ years of experience in developing, implementing and optimizing data plumbing systems and ETL processes. Leading a team of developers to implement data-driven solutions, resulting in an increase in data accessibility and accuracy. Collaborated with business stakeholders, data scientists, and engineers to develop data pipelines, resulting in improved business-critical decision making.
Description:
Consumer Gold Layer (Data Mart) is an application developed to track consumers subscribed to magazines or journals, along with marketing permission consent, by integrating two different consumer data collection platforms: CDS (the legacy platform by Experian) and Martech (the in-house platform by Chargebee), for all the brands of Conde Nast. It is one of the major applications that contribute to the revenue of Conde Nast.
Tech Stack: Data Build Tool (DBT), Databricks, AWS S3, Astronomer, Github.
Description:
The EDH/US-SHARP is the application system developed to support Headquarters Commercial Analytics and Field Sales Reporting tool in AstraZeneca North America. The current reporting solution US SHARP leverages Aws technologies as the technical platform.
Contribution:
Working with Big data applications deployed on AWS (Amazon Web Services) that provide a cloud platform to build and maintain cost & time-efficient infrastructures .
AWS services include EMR, EC2 cluster instances, S3 buckets for storage, Redshift Database, Athena for querying, EMR to manage ETL processes using PySpark, Glue Crawler, IAM roles & policies, Airflow as an orchestration tool.
BAU activities include PySpark and SQL querying, resolving user issues, data validation, meeting SLA targets, and resilient communications with vendors and the Leadership Team.
Tech Stack: AWS EMR, Redshift, Apache Spark, Apache Airflow, Apache Hive, AWS Athena, Bitbucket.
Description:
The Cornerstone Reporting & Analytics environment is the current reporting system developed to support Headquarters Commercial Analytics and Field Sales Reporting Tool in AstraZeneca. Cornerstone reporting environment utilizes Micro strategy Business Intelligence Tool as the platform for delivering the needed data. The current reporting solution leverages Netezza and Informatica as
the technical platform.
Contribution:
Build and maintain data warehouse applications using Informatica, Netezza and Oracle. Implement continuous improvements as per the business needs.
Conduct peer reviews of the code developed. Prepared unit test cases, system test case for enhancements request. Facilitate UAT sessions with users.
Tech Stack: Informatica Power Center, Oracle, Unix shell scripting
Credential ID - 7KPEWPMD1NFE1WWS
Credential URL - http://aws.amazon.com/verification
Credential ID - 61141765
Credential URL - https://credentials.databricks.com/16d64ba0-6da2-43b1-84e3-ae356da70688
Credential ID - fc8e3d1f-4888-4cf4-b3d5-e10e3d1236d1
Credential URL - https://credentials.getdbt.com/fc8e3d1f-4888-4cf4-b3d5-e10e3d1236d1