Summary
Overview
Work History
Education
Skills
Certification
Awards and Recognitions
Timeline
Generic

Ayush Gupta

Gurgaon

Summary

  • A Data Engineer professional with a mix of technical and business skills with over 7 years of experience in data warehousing using big data technologies, sound knowledge of ETL process, building data engineering solutions to Fortune 500 Global Pharmaceutical clients.
  • Specialized in AWS Cloud platform and Solution Architecture for Big Data. Experience of setting up AWS EMR, EC2, S3, Redshift, Lambda, VPC services.
  • Good hand in Pyspark and SQL for data manipulation.
  • Experience in Agile/Scrum and waterfall methodology. Experience in working with bitbucket, confluence , Jira tools. Actively involved in both development and operations parts of projects.
  • Worked on solving Hadoop, HDFS and Linux server issues single handedly. Tuned the services like Tez and Spark for optimized performance.
  • Currently working with Axtria as Manager assuming role of Data Engineer and Architect.

Overview

8
8
years of professional experience
1
1
Certification

Work History

Manager

Axtria India Pvt. Ltd.
06.2017 - Current

Projects

Medical Affairs

  • Identified medical unmet needs and patient cohort for three disease areas based on business rules and generated patient physician mapping to be targeted by field reps.
  • Used Kedro and pyspark to develop the data engineering pipelines.
  • Managed the project in both development and support phases.

APLD Processed Claims

  • Processed raw claims data in-house (on client environment) replacing processed data purchase from IQVIA.
  • Offered flexibility of customization, speed, efficiency, accuracy and pursued cost savings of $3M annually.
  • Processed claims data consumed by field sales team to identify patient activity, physician demographic details and treated patient claims.
  • Used AWS Glue, Lambda, Athena, S3 and Pyspark to build the solution.
  • Managed the engagement starting from requirement gathering to development and support.

AWS Architect

  • AWS infrastructure setup and support role using services like Redshift, EMR, IAM, Lambda, RDS, EC2, S3 and Cloudwatch.
  • Formulated an extensive cost optimization plan and successfully reduced the infrastructure cost by $800 per month.

JET Wave 2 Development

  • JET is a data science model designed to enhance the experience of sales rep and healthcare professional's partnership.
  • The model generates personalized suggestions for the sales rep to target the HCP based on various attributes such as specialty to make HCPs aware of the latest products that will eventually increase revenue.
  • Responsible as a data engineer lead to communicate with different stakeholders to gather requirements, design and develop the pipelines for data engineering processes using Python, Pyspark and dataiku.

JET Operations

  • Delivered the call and email suggestions generated using machine learning models based on the historical connects with the physicians and the resultant sales.
  • Represented the team in the client meetings for regular deliverables, issues and enhancements.
  • Created different features for the Health Care Professionals based on different attributes.
  • Used Pyspark, Dataiku, AWS S3 and Redshift to meet the objective.

Omnichannel Development

  • Developed a common platform 'Omnichannel' for marketing and sales data for pharmaceutical client to get the trends and insights on a single platform.
  • Gathered requirements and business rules from respective data scientists and created triggers for sales reps.
  • AWS Redshift, Hive, Dataiku and Python were used in the development.

Hadoop and Linux Server Administration (AWS)

  • Worked as Hadoop Administrator in data warehousing project. Created and maintained more than 10 AWS EMR clusters with up to 120 nodes.
  • Optimized AWS Infrastructure used in the project with Auto-scaling and HDFS, Yarn, Tez and Spark tuning significantly by almost 38%.

Dataiku Infrastructure Implementation and Support

  • Created infrastructure platform for Data Science and Engineering projects to be developed on Dataiku.
  • Docker containers have been deployed to offload the python jobs on dedicated machines and keep the environments uniform on each container.
  • EMR clusters have been introduced to cater Pyspark jobs.
  • Involved users and access control management for current users and new onboarding.

Talend and EMR Upgrade

  • Worked on upgrading Talend ETL tool and AWS EMR, consisting of Hive & Spark Technologies.
  • Solved various compatibility issues like snappy compression version conflict, enforced removal of redundancy in Tez framework.

Data Warehouse Design and Development

  • Developed a Hadoop based data warehouse using the ETL tool Talend, Pyspark and Hive query language.
  • Migrated the traditional “Oracle” based data warehouse for a US pharma major client.

Education

B.Tech - Computer Science And Engineering

Maulana Azad NIT Bhopal
Bhopal, India
06-2017

Senior Secondary -

Board of Secondary Education Rajasthan
Kota
05-2012

Skills

  • AWS services: EMR, EC2, S3, Redshift, Lambda, VPC etc
  • Data Warehousing, MLOps and Data Engineering
  • ETL tools such as Talend
  • Hadoop, Hive, Pyspark, SQL, Shell scripting, Python, Kedro

Certification

  • Dataiku Advanced Designer (Dec 2021 – Dec 2023)
  • Dataiku Core Designer (Jan 2021 – Jan 2023)
  • Business Analytics Program from S P Jain
  • AWS Developer Associate, AWS - (Sept 2018 - Dec 2021)
  • PERFECT: Manager Excellence Program at Axtria

Awards and Recognitions

  • Received Bravo Award from Axtria Inc. for the excellence in project tasks
  • Served as vice president of Organizing Committee of "Tooryanaad" India’s Largest Hindi Literary Fest
  • Received Governor Award from Hon. Pratibha Patil for social works at Bharat Scout Guide
  • Secured 99.9 percentile in 12th Rajasthan board exam in 2012
  • Secured All India Rank 8934 in JEE-Mains 2013

Timeline

Manager

Axtria India Pvt. Ltd.
06.2017 - Current
  • Dataiku Advanced Designer (Dec 2021 – Dec 2023)
  • Dataiku Core Designer (Jan 2021 – Jan 2023)
  • Business Analytics Program from S P Jain
  • AWS Developer Associate, AWS - (Sept 2018 - Dec 2021)
  • PERFECT: Manager Excellence Program at Axtria

B.Tech - Computer Science And Engineering

Maulana Azad NIT Bhopal

Senior Secondary -

Board of Secondary Education Rajasthan
Ayush Gupta