Summary

Overview

Work History

Education

Skills

Timeline

Ashish Sharma

Summary

Data Engineer with 6.5 years of experience in building data intensive applications. Insightful Senior Data Engineer known for high productivity and efficiency in task completion. Possess specialized skills in ETL development, and cloud computing solutions. Excel in problem-solving, teamwork, and communication, ensuring successful project outcomes and effective collaboration with cross-functional teams.

Overview

years of professional experience

Work History

Senior Data Engineer

Decision Culture

08.2022 - Current

Client- Home Depot-Company Overview: It is the largest home improvement retailer in the United States
Worked extensively with property datasets, implementing incremental data refresh strategies in BigQuery based on composite key columns (e.g., CLIP, tax_year, assessed_year) to ensure only updated records are processed.
Refactored Jupyter Notebook workflows into Airflow DAGs using Cloud Composer, enabling modular pipeline orchestration and better dependency management across GCP services.
Utilized BigQuery and Cloud Functions to analyze user behavior from email contact history datasets, identifying device patterns during website logins.
Implemented cost-efficient backup solutions by converting frequently accessed tables to Parquet format, leveraging Cloud Dataflow and Cloud Storage (GCS) for scalable and compressed storage.
Set up proactive monitoring using Stackdriver (now Cloud Monitoring) to detect failures in daily/weekly pipelines; used Pub/Sub for alerting and Cloud Logging for root cause analysis and fast issue resolution.
Designed and built Looker Studio dashboards pulling data from BigQuery to visualize pipeline runtimes and SLA adherence, improving transparency for stakeholders.
Maintained CI/CD pipelines using Cloud Build and Cloud Source Repositories, streamlining deployments for development and production environments.
Authored and maintained technical documentation in Confluence and GCS buckets, ensuring all GCP workflow steps and dependencies were traceable and auditable.
Designed scalable ELT pipelines in Dataform, integrating with BigQuery to modularize SQL transformations and enable version control via Git.
Led migration of Quality Control (QC) processes into Dataplex, enabling centralized data governance and automated metadata classification.
Received three Caught Orange Hand awards in recognition of cross-functional collaboration and cloud-first innovations.

Software Development Engineer I

Precisely Software and Data India

Noida

04.2021 - 08.2022

Company Overview: Precisely rebranded from Syncsort Incorporated in May 2020, a leader in data integrity, where I contributed to big data integration, quality assurance, and high-performance data pipelines across AWS and Snowflake platforms.
Independently managed the production data ecosystem hosted on AWS (S3, Lambda, EC2, and Glue), performing extensive standardization across ingestion, transformation, and reporting pipelines.
Achieved 95%+ automation by redesigning previously manual tasks using a combination of AWS Step Functions, Lambda functions, and scheduled Glue jobs, significantly reducing operational overhead.
Reduced AWS costs by 15% through resource optimization—consolidating redundant EC2 jobs, introducing lifecycle policies in S3, and improving Glue job configurations for efficient data processing.
Performed efficient data scraping and mining using Python's BeautifulSoup.
Received 5 spot awards for innovation, performance tuning, and proactive ownership in critical data initiatives.

Associate Software Engineer

Exertia Consultancy

Noida

12.2019 - 03.2021

Company Overview: Pitney Bowes Software business has been acquired by Syncsort and formed precisely
Worked on ETL pipeline using Redshift, AWS.
Built and managed data pipelines on AWS-hosted Databricks to process data using PySpark.
Tuned Spark jobs for optimal performance on large datasets
Created data quality checks and alerts within Databricks workflows, integrating with CloudWatch for monitoring and logging.

Associate Software Engineer

Tata Consultancy Services

Chennai

01.2019 - 12.2019

Company Overview: Tata Consultancy Services is an Indian multinational information technology services and consulting company.
Independently managed production workloads in an on-premise Hadoop ecosystem, ensuring data standardization across ingest, transform, and serve layers using tools like Hive, Sqoop, and Shell scripts.
Used Python (with BeautifulSoup) to scrape external data sources, then structured and bulk-loaded the data into HDFS and Hive tables for downstream analytics.
Worked with HBase to store semi-structured data requiring low-latency lookups and integrated it with Hive for batch analytics.

Education

B.Tech - Information Technology

Guru Gobind Singh Indraprastha University

Delhi, India

06-2018

Skills

Python
SQL
PySpark
Airflow
GCP
Hive
AWS
Snowflake
GIT
AZURE
Bigquery
S3
GCS

Looker Studio
Dataform
Athena
Glue
AMI
EBS
Redshift
IAM
Composer
Dataplex
ADLS
Databricks
Streaming data processing

Timeline

Senior Data Engineer

Decision Culture

08.2022 - Current

Software Development Engineer I

Precisely Software and Data India

04.2021 - 08.2022

Associate Software Engineer

Exertia Consultancy

12.2019 - 03.2021

Associate Software Engineer

Tata Consultancy Services

01.2019 - 12.2019

B.Tech - Information Technology

Guru Gobind Singh Indraprastha University

Ashish Sharma

Summary

Overview

Work History

Senior Data Engineer

Software Development Engineer I

Associate Software Engineer

Associate Software Engineer

Education

B.Tech - Information Technology

Skills

Timeline

Senior Data Engineer

Software Development Engineer I

Associate Software Engineer

Associate Software Engineer

B.Tech - Information Technology

Similar Profiles

Rama Krishna PallerlaRama Krishna Pallerla

Narasimha AdhikarlaNarasimha Adhikarla

Raj DebRaj Deb

Sandeep KumarSandeep Kumar

Chaitanya AnnediChaitanya Annedi