Summary
Overview
Work History
Education
Skills
Timeline
Generic

Suman Singh

Big Data Engineer
Pune,MH

Summary

14+ Years of experience in Software Development and Data warehouse. Skilled across Data Engineering, SRE leadership, and Project Delivery. Proven ability to deliver successful projects and optimize processes for peak production efficiency.

Overview

15
15
years of professional experience

Work History

Data Engineering|Data Analytics Lead

Cognizant Technologies (Client : NSDL)
04.2023 - Current
  • Led the migration of RDBMS data into a Hadoop data lake, enabling efficient processing and analysis of massive datasets (2-2.5 TB daily).
  • Optimized data extraction and transformation pipelines using Spark (Python/Dataframes, RDDs) for faster consumption by data visualization tools (e.g., PowerBI).
  • Reduced processing time through performance optimization techniques in HDFS, Spark, and Impala.
  • Developed and implemented data processing pipelines to support ad-hoc business requests, fostering data-driven decision making.
  • Mentored and managed a Hadoop developer team, ensuring successful project delivery and fostering team growth.

Key Achievements:

  • Successfully designed and built a scalable data ingestion mechanism for the NSDL data lake, consolidating data from various sources into a unified Common Data Model.
  • Optimized data processing pipelines, resulting in significant improvements in data availability and time-to-insights for business users.
  • Established a DevOps culture within the team, leveraging tools like GitHub, Rundeck, and Ansible scripting to automate deployments and streamline workflows.

Site Reliability Engineering | SRE Lead

Schlumberger
03.2019 - 03.2023
  • Led the implementation of Site Reliability Engineering (SRE) practices to improve data platform uptime, reliability, and scalability.
  • Designed and developed monitoring dashboards (PowerBI & Microsoft Data Studio) to visualize key SLI metrics against defined SLOs using GCP (Google Big Query). This enabled proactive identification and resolution of potential issues.
  • Established and maintained coding best practices, including documented SOPs and knowledge bases in Confluence, ensuring code quality and maintainability.
  • Managed data flow automation (DataLake to DataPond) using tools like Informatica, Big Data, SQL, Unix, Minifi, Nifi, and Rundeck.
  • Championed knowledge sharing by introducing Confluence as a central knowledge repository.
  • Implemented a data platform monitoring and communication process to ensure timely issue identification and resolution.
  • Owned the definition and implementation of an RACI matrix to clearly define roles and responsibilities within the SRE team.

Data Engineering| Senior Developer

NTT DATA Services (Client: UBS)
08.2015 - 02.2019
  • Led a high-performing team of 9 data engineers responsible for building and maintaining data pipelines. This ensured timely and accurate data flow for the Prime Brokerage project.
  • Developed automation scripts (Shell) to streamline critical tasks including data extraction, performance reporting, and table space management. These scripts significantly reduced manual effort and improved overall data processing efficiency.
  • Built interactive dashboards (Power BI) to visualize daily rejection records, enabling stakeholders to quickly identify and address data quality issues.
  • Implemented an hourly status monitoring system through automated emails, providing real-time insights into data pipeline health and proactively alerting teams of potential problems.
  • Leveraged expertise in Hadoop, Shell scripting, and SQL to troubleshoot and resolve complex data loading and performance issues. This ensured data integrity and maintained optimal system performance.
  • Proactively identified opportunities for automation and developed custom Shell scripts to streamline data processing tasks and minimize ongoing support requirements. This focus on continuous improvement resulted in a more efficient and scalable data management environment.

Application Support & Project Delivery | Technical

Cognizant Technologies (Client: Credit Suisse)
09.2012 - 08.2015
  • Led a team of application support specialists in providing L2 support and technical guidance for the LCDB (Legal and Compliance Database).
  • Ensured application uptime through 24/7 support, addressing performance, resource, software, and application problems.
  • Implemented production changes, documented processes, and performed root cause analysis to improve application stability.

Application Support & Project Delivery | Software

IBM (Client: Vodafone)
12.2009 - 09.2012
  • Supported daily data loads for prepaid and post-paid systems, ensuring accurate and timely processing.
  • Developed Shell Scripts and PLSQL queries in UNIX to meet specific reporting needs.
  • Developing a scalable batch framework to efficiently process partial Usage Detail Records (UDRs).

Education

Bachelor of Engineering - Computer Science

Bapurao Deshmukh College of Engineering
Wardha, India
04.2001 -

Skills

Data Processing Hadoop/Map Reduce Spark-RDD/ DataFrame /Python HIVEData Streaming Sqoop Apache KafkaDevOps GitHub RundeckCommit/Build/PipelineGCP Cloud Big Query Logging and Monitoring Storage Bucket IAMScheduling Control-M/Rundeck/AutosysData P reparation/Visualization Power-BI/Microsoft Data StudioMethodology SCRUM (Agile)SQL Databases Oracle MYSQL MSSQLNo SQL Databases HBASEDomain Banking and Finance

Timeline

Data Engineering|Data Analytics Lead

Cognizant Technologies (Client : NSDL)
04.2023 - Current

Site Reliability Engineering | SRE Lead

Schlumberger
03.2019 - 03.2023

Data Engineering| Senior Developer

NTT DATA Services (Client: UBS)
08.2015 - 02.2019

Application Support & Project Delivery | Technical

Cognizant Technologies (Client: Credit Suisse)
09.2012 - 08.2015

Application Support & Project Delivery | Software

IBM (Client: Vodafone)
12.2009 - 09.2012

Bachelor of Engineering - Computer Science

Bapurao Deshmukh College of Engineering
04.2001 -
Suman SinghBig Data Engineer