Summary
Overview
Work History
Education
Skills
Projects
Technical Course
Timeline
Generic

Subash Roshan

Summary

Highly skilled Data Engineer with over 2 years of experience in designing and optimizing data pipelines while collaborating with cross-functional teams to deliver streamlined data solutions and business insights. Proficient in SQL, Python, Hadoop, Hive, and Microsoft Azure services (Data Factory, Databricks, Synapse Analytics) with expertise in data integration, cleaning, processing, and real-time analytics. Currently engaged with Axis Bank as a Manager in the Business Intelligence Unit (BIU).

Overview

2
2
years of professional experience

Work History

Manager - Data Engineer, BIU

Axis Bank
Mumbai
04.2024 - Current
  • Worked on continuous improvements in the existing KRA (Key Result Area) and incentive program data pipeline by implementing new logic enhancements, optimizing existing SQL codes, and fulfilling ad-hoc data requests for business insights by creating customized datasets in the Oracle SQL database.
  • Worked on designing and building automated data pipelines in Informatica for tracking Premium Segment Relationship Managers (RMs) and Customer Daily Connect, which will be sent as a daily push in RMs' calendars to enhance the customer experience delivered by the RMs.
  • Worked on control table implementation, setting up automated data quality checks on multiple datasets owned by the team, to ensure high data quality and timely data availability to our stakeholders.
  • Acting as a Scrum Master in Jira to effectively manage sprints, track progress, provide guidance, remove obstacles to achieve project goals, and improve individual and team performance.
  • Regularly communicate with stakeholders, ensuring transparency regarding sprint progress, priorities, and any potential challenges.

Deputy Manager - Data Engineer, BIU

Axis Bank
Mumbai
07.2022 - 03.2024
  • Worked on designing and building automated data pipelines in Informatica for KRA and quarterly Incentive Program (% of revenue generated by the Relationship Manager will be paid as Variable Pay) of Relationship Managers [Premium Segment of Retail Liabilities vertical].
  • Worked closely with cross-functional teams, including ABR (Axis Business Reporting), and the Business team, to address several data quality issues. Identified the root causes and implemented fixes that have resolved these issues.
  • Developed and automated the migration of customer-level masked datasets from Oracle Database to Azure DataLake using Informatica Cloud Service for integration with Power BI dashboards.

Education

Bachelor of Technology - Computer Science And Engineering

National Institute of Technology
Trichy, India
07-2023

Skills

  • Programming Languages: SQL, Python, PySpark, Shell Scripting
  • Big Data Technologies: Hadoop, Spark, Hive
  • Tools Proficient: Microsoft Azure (Data Factory, Data Bricks, Synapse Analytics, Data Lake, Event Hub,Microsoft Fabric), Informatica, Power BI
  • Version Control: Git, BitBucket

Projects

Credit Risk Mitigation Using Apache Spark

  • Designed and implemented a distributed data processing pipeline using Apache Spark, processing a 1.7GB CSV file stored in HDFS.
  • Conducted comprehensive data cleaning and processing to remove bad data (e.g., duplicate member IDs, missing records), ensuring high-quality datasets and storing the final cleaned data back in HDFS for reliable access.
  • Facilitated multi-team collaboration by creating external tables for secure SQL-based access and a dynamic, auto-refreshing view to provide consolidated, up-to-date loan scores every 24 hours.

End-to-End Azure Data Engineering Pipeline

  • Built an end to end pipeline by ingesting tables from on-premise SQL server database using Azure Data Factory and then store the raw data in Azure ADLS Gen2.
  • Transform the raw data using Azure Databricks and load the cleaned data to Azure Synapse Analytics and integrate with Microsoft PowerBI to build an interactive dashboard.
  • Used Azure Active Directory (AAD) and Azure Key Vault for the monitoring and governance purpose

End to End Realtime Streaming Azure Data Engineering

  • Built an end to end realtime streaming pipeline by Ingesting Weather report API using Azure Data Factory and Azure Functions.
  • Use Event Hub for streaming and Microsoft Fabric for real time intelligence. Load data into Kusto DB and create an interactive Power BI dashboard.
  • Set up real-time alerts for extreme weather using Data Activator.

Technical Course

  • Course Name: Trendy Tech's Ultimate Big Data Masters Program (Cloud Focused) [6 months]
  • Topics: Hadoop, MapReduce, Hive, Apache Spark, Azure Data Factory, Azure Data Lake, Azure Databricks, Azure Synapse, GIT and CICD

Timeline

Manager - Data Engineer, BIU

Axis Bank
04.2024 - Current

Deputy Manager - Data Engineer, BIU

Axis Bank
07.2022 - 03.2024

Bachelor of Technology - Computer Science And Engineering

National Institute of Technology
Subash Roshan