Summary
Overview
Work History
Education
Skills
Timeline
Generic

Ashish Sharma

Summary

Data Engineer with 6.5 years of experience in building data intensive applications. Insightful Senior Data Engineer known for high productivity and efficiency in task completion. Possess specialized skills in ETL development, and cloud computing solutions. Excel in problem-solving, teamwork, and communication, ensuring successful project outcomes and effective collaboration with cross-functional teams.

Overview

6
6
years of professional experience

Work History

Senior Data Engineer

Decision Culture
08.2022 - Current
  • Client- Home Depot-Company Overview: It is the largest home improvement retailer in the United States
  • Worked extensively with property datasets, implementing incremental data refresh strategies in BigQuery based on composite key columns (e.g., CLIP, tax_year, assessed_year) to ensure only updated records are processed.
  • Refactored Jupyter Notebook workflows into Airflow DAGs using Cloud Composer, enabling modular pipeline orchestration and better dependency management across GCP services.
  • Utilized BigQuery and Cloud Functions to analyze user behavior from email contact history datasets, identifying device patterns during website logins.
  • Implemented cost-efficient backup solutions by converting frequently accessed tables to Parquet format, leveraging Cloud Dataflow and Cloud Storage (GCS) for scalable and compressed storage.
  • Set up proactive monitoring using Stackdriver (now Cloud Monitoring) to detect failures in daily/weekly pipelines; used Pub/Sub for alerting and Cloud Logging for root cause analysis and fast issue resolution.
  • Designed and built Looker Studio dashboards pulling data from BigQuery to visualize pipeline runtimes and SLA adherence, improving transparency for stakeholders.
  • Maintained CI/CD pipelines using Cloud Build and Cloud Source Repositories, streamlining deployments for development and production environments.
  • Authored and maintained technical documentation in Confluence and GCS buckets, ensuring all GCP workflow steps and dependencies were traceable and auditable.
  • Designed scalable ELT pipelines in Dataform, integrating with BigQuery to modularize SQL transformations and enable version control via Git.
  • Led migration of Quality Control (QC) processes into Dataplex, enabling centralized data governance and automated metadata classification.
  • Received three Caught Orange Hand awards in recognition of cross-functional collaboration and cloud-first innovations.

Software Development Engineer I

Precisely Software and Data India
Noida
04.2021 - 08.2022
  • Company Overview: Precisely rebranded from Syncsort Incorporated in May 2020, a leader in data integrity, where I contributed to big data integration, quality assurance, and high-performance data pipelines across AWS and Snowflake platforms.
  • Independently managed the production data ecosystem hosted on AWS (S3, Lambda, EC2, and Glue), performing extensive standardization across ingestion, transformation, and reporting pipelines.
  • Achieved 95%+ automation by redesigning previously manual tasks using a combination of AWS Step Functions, Lambda functions, and scheduled Glue jobs, significantly reducing operational overhead.
  • Reduced AWS costs by 15% through resource optimization—consolidating redundant EC2 jobs, introducing lifecycle policies in S3, and improving Glue job configurations for efficient data processing.
  • Performed efficient data scraping and mining using Python's BeautifulSoup.
  • Received 5 spot awards for innovation, performance tuning, and proactive ownership in critical data initiatives.

Associate Software Engineer

Exertia Consultancy
Noida
12.2019 - 03.2021
  • Company Overview: Pitney Bowes Software business has been acquired by Syncsort and formed precisely
  • Worked on ETL pipeline using Redshift, AWS.
  • Built and managed data pipelines on AWS-hosted Databricks to process data using PySpark.
  • Tuned Spark jobs for optimal performance on large datasets
  • Created data quality checks and alerts within Databricks workflows, integrating with CloudWatch for monitoring and logging.

Associate Software Engineer

Tata Consultancy Services
Chennai
01.2019 - 12.2019
  • Company Overview: Tata Consultancy Services is an Indian multinational information technology services and consulting company.
  • Independently managed production workloads in an on-premise Hadoop ecosystem, ensuring data standardization across ingest, transform, and serve layers using tools like Hive, Sqoop, and Shell scripts.
  • Used Python (with BeautifulSoup) to scrape external data sources, then structured and bulk-loaded the data into HDFS and Hive tables for downstream analytics.
  • Worked with HBase to store semi-structured data requiring low-latency lookups and integrated it with Hive for batch analytics.

Education

B.Tech - Information Technology

Guru Gobind Singh Indraprastha University
Delhi, India
06-2018

Skills

  • Python
  • SQL
  • PySpark
  • Airflow
  • GCP
  • Hive
  • AWS
  • Snowflake
  • GIT
  • AZURE
  • Bigquery
  • S3
  • GCS
  • Looker Studio
  • Dataform
  • Athena
  • Glue
  • AMI
  • EBS
  • Redshift
  • IAM
  • Composer
  • Dataplex
  • ADLS
  • Databricks
  • Streaming data processing

Timeline

Senior Data Engineer

Decision Culture
08.2022 - Current

Software Development Engineer I

Precisely Software and Data India
04.2021 - 08.2022

Associate Software Engineer

Exertia Consultancy
12.2019 - 03.2021

Associate Software Engineer

Tata Consultancy Services
01.2019 - 12.2019

B.Tech - Information Technology

Guru Gobind Singh Indraprastha University
Ashish Sharma