Summary
Overview
Work History
Education
Skills
Certification
Technical Summary
Accomplishments
Timeline
Generic
Sandipan Ghosh

Sandipan Ghosh

Vice President
Bengaluru,KA

Summary

Over 20 years of diverse data engineering experience, including 8.5 years in Banking. Extensive hands-on expertise in ETL tools such as AB INITIO, Datastaged, Informatica, HADOOP, SPARK, Python, DB2, ORACLE, UNIX, and Shell scripting. Certified as a Hadoop developer (CCDH) and MAPR HBase developer (MCHBD). Proficient in data warehouse design using the Ab Initio framework. Skilled in team leadership, mentoring, and workshop facilitation. Deep expertise in the Hadoop ecosystem and column-family based Databases like HBase for delivering complex projects with geographically dispersed teams and establishing best practices for reliable solution design.

Overview

21
21
years of professional experience
2004
2004
years of post-secondary education
3
3
Certifications
2
2
Languages

Work History

VICE PRESIDENT Apps Development Sr Manager

Citi
04.2020 - Current
  • Manage CitiKYC ETL Framework Applications Development team to accomplish established goals as well as conduct personnel duties for team (e.g., performance evaluations, hiring and disciplinary actions)
  • Utilize in-depth knowledge and skills across multiple Applications Development areas to provide technical oversight across systems and applications
  • Review and analyse proposed technical solutions for projects
  • Contribute to formulation of strategies for applications development and other functional areas
  • Develop comprehensive knowledge of how areas of business integrate to accomplish business goals
  • Provide evaluative judgment based on analysis of factual data in complicated and unique situations
  • Impact the Applications Development area through monitoring delivery of end results, participate in budget management, and handling day-to-day staff management issues, including resource management and allocation of work within the team/project
  • Ensure essential procedures are followed and contribute to defining standards negotiating with external parties when necessary
  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behaviour, conduct and business practices, and escalating, managing and reporting control issues with transparency, as well as effectively supervise the activity of others and create accountability with those who fail to maintain these standards.
  • Technical Role: Reviewing and refining business requirements and creating technical design specifications/approach documents for application enhancements and releases for ETL
  • ETL Ab initio code design, review, manage build, delivery.
  • Supporting DIT, SIT, UAT and Pre-Prod testing cycles, managing data loads.
  • Developing implementation and support documentation and providing SME support
  • Working in a global team environment, providing guidance, and training to development resources.
  • Engaging with other Citi technology teams globally as well as external vendors.
  • CTS Site lead role: Providing site leadership to CTS teams, coordination with HR, local site leads, recruitment, and talent management.
  • Liaise and align with CTS Pune and global for strategic planning in demand and delivery.

Managing Consultant

Exusia
03.2017 - 04.2020
  • Lead or participate in the definition, execution and communication of ETL strategy and technical solution
  • Development of data staging, architecture and consuming internal and external application interfaces
  • Created Hive schema, tables with partitioning and bucketing to optimize the utilization.
  • Importing and exporting data into HDFS from Oracle 10.2 database and vice versa using SQOOP
  • Developed data pipeline using Sqoop to ingest customer behavioral data in to HDFS for analysis
  • Cooperate with business and operational users, other data management staff and infrastructure teams to ensure adherence to client and company guidelines, restrictions and requirements as well as industry standards
  • Efficient assessment and resolution of complex technical design issues
  • Provide project team with leadership and guidance on defining, monitoring and managing business data quality and ETL operational metrics
  • Responsible for deploying technical methodologies for client projects, such as functional/technical requirements, design topics, code review and testing procedures
  • Frequent interaction with management and support teams on project status and priority setting
  • Research new solutions, tools and methodologies that may improve the overall operation model, process and/or the result of the project or projects at hand
  • Employ extreme attention to detail and flexibility to adapt to dynamic client environments and changing business, operations and technology priorities

ETL Team Lead, Architect

VISA
01.2011 - 03.2017
  • Document the major issues, solutions and create knowledge base documents
  • Have created HIVE managed and external tables and views as part of reporting by sqoop the data from RDBMS to Hadoop cluster in the form of AVRO file format
  • Identify manual intensive activities & Automate them to reduce effort either by consolidating the tasks & automate to have one job or automating things where it involves other teams
  • Impact analysis for all downstream if there is any data issue at upstream for any code issue & then guiding support team for data Fixes
  • Involving active discussion with product owners for billing enquiries & several other queries for mismatch of key fields & thereby finalizing code changes in result of discussion
  • Involved in estimating project budget, duration and resources.
  • Vendor management for both onsite and offshore site.
  • Conflict resolution between team members.
  • Proper work distribution and team building.
  • Ensuring project quality and SDLC followed.
  • Ensure smooth transition between development and operations.
  • Organize the team, ensure that goals and objects are perceived as attainable.
  • Accomplishments: Appreciated by senior management for AB Initio CPU saving work
  • Achieved consistent SLA after redesigning in AB Initio to use bulk updates for Operational Data Store, appreciated by Senior Director and was awarded 'Go Beyond'.
  • Reduced manual effort at maintenance time and also day end by 50-man hours after adding automated shutdown to continuous graphs, appreciated by direct manager and was awarded 'Go Beyond'.
  • Reduced manual effort by 100 hours in every install, after creating AB Initio graph to make automated validation using validation extension
  • Reduced manual cleaning of files at 11th hour to control inode and storage by identifying file patterns or users which are top contributors
  • Reduced specint cost after changing design of several graphs
  • Automated script for configuration-based rollback, saving several man hours every day
  • Created graph for making install impact analysis and thereby saving a lot of production failures and data issues
  • Created custom component for SFTP with retry saving a lot of production failures

ETL Team Lead/Onsite Coordinator

IBM
06.2004 - 12.2010
  • Project: IBM, AMEX, India (04/2010-12/2010) - Environment Informatica, Oracle, Shell scripting - Technical Team Lead managing resources in diverse locations - Responsible for Design, Development, & turning over to QA Informatica mappings. Providing KT to new team members to speed up them in project
  • Project: IBM, LLOYDS, U.K (10/2009-04/2010) - Environment DataStage, Shell scripting, IBM Information Analyzer - Lloyds Banking Group has acquired HBOS and has engaged in a project to complete data analysis and conduct Data Mapping Sessions for the Commercial applications. Creation of DataStage mappings for migrating data from legacy system to new data model. Creation of data profiling to understand the data content
  • Project: IBM, RTA (Road & Transport Authority), India (02/2009-09/2009) - Environment IBM Information Analyzer, PL/SQL, SQL Server Reporting Services - As a part of Enterprise Data Modeling initiative in RTA, IBM proposed a quick Data Quality Assessment of RTA’s primary source systems. Identification of core data elements from all source tables which might contain Data Quality Issues. Preparation of Data Quality Criteria Workbook with the business rules and Data Quality Assessment result and recommendations. Preparation of custom reports for the KPI (Key Performance Indicator) in SQL Server Reporting Services. Preparation of Stored Procedure in SQL SERVER to load data from KPI staging tables to fact tables to have the reporting done.
  • Project: IBM, VISA, Loyalty, USA (01/2007-12/2008) - Sandipan was involved in Loyalty side of VISA’s application system as an Onsite Lead in Foster City. Environment AB Initio, Shell scripting, Control M - Promoted as an SME to directly engage in meetings with business. Participate in architecture and design meetings. Preparation of the critical design documents. Designing of the AB Initio graphs based on the critical design.
  • Project: IBM, AOL, India (06/2004-12/2007) - Environment AB Initio, Shell scripting, Autosys - Preparation and review of High-Level Design (Mezzo), Technical Specification and Detailed Design Document, creation of AB Initio graph. Preparation as well as Reviewing Autosys script for job scheduling. Preparation of UTP, QAR and PSD.

Education

Master of Technology (MTech) - Information Technology

Indian Institute of Engineering Science and Technology

Skills

Skilled in writing complex shell scripts

Certification

Cloudera Certified Developer for Apache Hadoop (CCDH), 100-014-388

Technical Summary

Shell Scripting, HIVE, Spark, SQL, PL/SQL, ETL AB INITIO, GitHub, HBase, DB2, Oracle, Unix, Control Center, System Development Life cycle (SDLC), Agile, Scrum

Accomplishments

    Go Beyond in VISA on 11th August, 2015

    Sandipan played a significant role in redesign of Magellan application under CMLS which was very unstable and thus impacted the expectations of the customer. The application has a critical significance and is still growing in importance in recent times as it feeds data to a group of critical strategic applications. As a result, this effort has a great impact. He also ensured that the design is made simple enough which resulted in reducing the overall effort from the application supportability standpoint. Well done.

    Go Beyond in VISA on 21st June, 2013

    For your outstanding contribution and quick and accurate turn around in developing and implementing the automated catch up and parameter based bulk load process in ODS 10min cycles. This helped to reduce the catch up time significantly and helped to reduce the manual effort and associated risk to do catch up every day.

    Bravo Award in IBM for VISA Loyalty Project on July 31st, 2008

    Outstanding Contribution on VISA Account

    Bravo Award in IBM for America Online Project on Dec 23rd, 2005

    A consistent performer; single handedly managed and completed assignment; has shown maturity by accepting more responsibility and challenges…a great coordinator of social activities

Timeline

VICE PRESIDENT Apps Development Sr Manager

Citi
04.2020 - Current

Managing Consultant

Exusia
03.2017 - 04.2020

ETL Team Lead, Architect

VISA
01.2011 - 03.2017

ETL Team Lead/Onsite Coordinator

IBM
06.2004 - 12.2010

Master of Technology (MTech) - Information Technology

Indian Institute of Engineering Science and Technology
Sandipan GhoshVice President