Summary
Overview
Work History
Education
Skills
Projects
Domain Specific Knowledge
Technical Skills
Certification
Languages
Timeline
Generic
TUSHAR SRIVASTAVA

TUSHAR SRIVASTAVA

Bengaluru

Summary

Accomplished Data Engineer with extensive experience at Titan Company Limited, specializing in the design and optimization of ETL pipelines. Proven proficiency in SQL, Python, and data analysis, resulting in a 15% increase in customer retention through actionable insights. Skilled in stakeholder management, delivering scalable solutions that enhance business success and operational efficiency.

Overview

4
4
years of professional experience
1
1
Certification

Work History

Deputy Manager (Data Engineer)

Titan Company Limited
Bengaluru
05.2024 - Current
  • Designed, built, and optimized 100+ ETL pipelines to process large-scale data efficiently, handling over 10 million records daily, with improved throughput by 40%.
  • Ingested real-time transactional data from POS systems, enabling timely insights into sales, inventory, and customer behavior.
  • Conducted data analysis to uncover trends, customer behaviors, and actionable business insights, contributing to a 15% increase in customer retention.
  • Partnered with business stakeholders to translate requirements into scalable, data-driven solutions that supported key decision-making processes.
  • Developed interactive dashboards and reports in Tableau, delivering near real-time KPIs to business users, and reducing data-to-decision time by 50%.
  • Automated data workflows improve operational efficiency and reduce manual reporting time by 30%.
  • Leveraged machine learning models to automate ETL and data modeling tasks, improving data classification accuracy by 20%, and reducing development time.

Senior Engineer (Data Engineer)

Titan Company Limited
Bengaluru
08.2021 - 04.2024
  • Developed and maintained scalable data pipelines for efficient data processing and storage.
  • Developed 150+ ETL pipelines across domains such as Retail, CRM, and Merchandising.
  • Designed and optimized database models to improve query performance and data accessibility.
  • Worked closely with business analysts to define data requirements, ensuring data integrity.
  • Implemented ETL solutions to extract, transform, and load data from multiple sources.
  • Created technical documentation and conducted knowledge-sharing sessions for the team.

Education

Bachelor of Technology - Computer Science and Engineering

SRM Institute of Science And Technology
Delhi NCR
07.2021

High School - ICSE Board

St. Dominic Savio College
Lucknow, UP
05.2017

Skills

  • SQL
  • Python
  • PySpark
  • Apache Airflow
  • AWS Glue
  • AWS Lambda
  • IBM DataStage
  • Amazon Redshift
  • MS SQL Server
  • Data modeling
  • Tableau
  • Excel
  • Descriptive and exploratory data analysis
  • Performance tuning
  • Pandas
  • NumPy
  • Regression analysis
  • NLP
  • AWS CodeCommit
  • GitHub
  • AWS CodePipeline
  • Requirement gathering
  • Data storytelling
  • Stakeholder management
  • Data analysis
  • ETL development
  • Process automation
  • Problem solving

Projects

International Business Data Warehouse, Led the end-to-end implementation of a robust data warehouse for an international business, enabling seamless data integration, advanced analytics, and informed decision-making., Collaborated with stakeholders from various business divisions to gather requirements and understand business needs, ensuring the data warehouse design aligned with organizational goals., Developed efficient ETL processes using tools like AWS Glue and IBM DataStage to extract data from multiple sources, transform it to meet analytical needs, and load it into the data warehouse (Redshift)., Integrated data from various systems including POS, CRM, ERP, Marketing, and external market data, ensuring data consistency and accuracy across the business., Designed a scalable and flexible data warehouse and datalake architecture using industry-leading technologies (Amazon Redshift and AWS S3) to support diverse data sources, high query performance and cost effectiveness., Implemented optimization techniques such as indexing, partitioning, and caching to ensure high performance and quick access to critical business data., Established data governance practices including data cataloging, metadata management, and role-based access control to ensure data quality, security, and compliance across global business units., Implemented encryption protocols for sensitive customer data (e.g., mobile numbers and addresses) across the U.S., Singapore, and GCC regions, ensuring compliance with major data protection laws including CCPA/CPRA (U.S.), PDPA (Singapore), and regional GCC privacy frameworks (e.g., UAE PDPL), thereby strengthening data governance and minimizing privacy risk., Developed comprehensive documentation and conducted training sessions for end-users to maximize the adoption and effective use of the new data warehouse. AI-Driven Review Insights Engine, Built an OpenAI-powered NLP pipeline to process Google My Business and other review data, extracting key features, sentiment, and recurring issues., Improved insights accuracy by 30% and reduced manual analysis time by 70%, enabling faster business decision-making. Implementation of CI/CD for ETL Jobs, Contributed significantly to the implementation of Continuous Integration and Continuous Deployment (CI/CD) pipelines for AWS Glue and IBM DataStage., Collaborated with cross-functional teams to identify current challenges and define requirements for the CI/CD pipeline., Acted as a single point of contact (SPOC) to integrate AWS tools such as AWS CodePipeline, AWS CodeBuild, and AWS CodeDeploy to automate build, test and deployment processes for AWS Glue and IBM DataStage., Leveraged AWS CodeCommit for version control, ensuring efficient management of code changes, collaboration, and tracking of development progress., Conducted training sessions and created detailed documentation to ensure the development and operations teams were proficient in using the new CI/CD processes. Metadata & DDL Automation with Schema Validation, Automated metadata and DDL generation for database tables, incorporating an auto-validation mechanism to ensure schema consistency and prevent pipeline failures.

Domain Specific Knowledge

  • Proficient in designing efficient, scalable data schemas using dimensional and normalized modeling techniques.
  • Familiar with data governance frameworks to ensure data quality, lineage, security, and regulatory compliance.
  • Foundational understanding of machine learning algorithms and their application in data pipelines and production systems.
  • Strong domain knowledge in domestic and international business operations, particularly in Merchandising, Marketing, Retail, and CRM processes.

Technical Skills

SQL, Python, Apache Spark- PySpark, Apache Airflow, AWS Glue, AWS Lambda, Amazon API Gateway, IBM DataStage, Tableau, Excel, Descriptive & Exploratory Data Analysis, A/B Testing, Amazon Redshift, MS SQL Server, Amazon DynamoDB, Data Modeling, Performance Tuning, Pandas, NumPy, Regression Analysis, Natural Language Processing, AWS CodeCommit, GitHub, AWS CodePipeline, Requirement Gathering, Data Storytelling, Stakeholder Management, Translating business requirements into data solutions

Certification

  • AWS Educate Machine Learning Foundations
  • Databricks Fundamentals
  • Databricks Generative AI Fundamentals

Languages

Hindi
First Language
English
Proficient (C2)
C2

Timeline

Deputy Manager (Data Engineer)

Titan Company Limited
05.2024 - Current

Senior Engineer (Data Engineer)

Titan Company Limited
08.2021 - 04.2024

Bachelor of Technology - Computer Science and Engineering

SRM Institute of Science And Technology

High School - ICSE Board

St. Dominic Savio College
TUSHAR SRIVASTAVA