

15+ years of experience in Data/Solution Architect in SPARK & Abinitio ETL Tool includes Administration, Analysis, Design, Development, and Implementation of data warehouse, ETL, client/server applications and Automation testing. Leading Data Delivery and Data Architecture solutions for large-scale financial operations. Architecting and implementing scalable ETL and Big Data solutions. Managing end-to-end data pipelines, real-time data processing, and data governance frameworks. Driving automation and optimization across ETL, data ingestion, and transformation processes. Collaborating with business stakeholders to define data-driven strategies and roadmaps. Built ETL pipelines with Java and Apache Spark running on AWS EMR, storing data in AWS S3 and RDS Aurora. Experience in Big Data Stack, Data Lake, and Database Design, DWH, Data Security, Data Lineage, and Data Modeling. Experience in Abinitio software components including GDE, Co>Op, EME, CC, Express>It and Query>It. Expertise in designing and developing "ETL DATA PIPELINES" in Hadoop Ecosystems. Building ETL common components in SPARK and Abinitio (Read, Write, TDQ, SK, CDC, DBLOAD and UNLOAD of Teradata and Oracle). Having working experience in Talend Tool and building ETL components with Java JET. Expertise in ingestion of structured, semi structured and unstructured data. Implemented an end-to-end big data pipeline for real-time processing, including reading data, preprocessing, TDQ, transformations, and loading into HDFS, HIVE, and DB. Designing, creating, testing and maintaining the complete data management & processing systems. Introducing new data management tools & technologies into the existing system to make it more efficient. Working Experience of Hadoop ecosystem and different frameworks inside it – SPARK, HIVE, HDFS, YARN, Kafka, and CASSANDRA. Working Experience of designing and developing micro services and APIs with Spring Boot and JPA. Real-time processing Framework (Apache Spark with java and Scala).
Java , Micro services with Sprintboot
Spark (Pyspark/Java/Scala)
SQL
Shell Scripting
Hadoop
CASSANDRA
Abinitio
Talend
AWS