~ Having 12 years of experience in the analysis, Design, Development, Project delivery and Implementation of business applications/requirements with good knowledge in the Data Engineering with On-prem Hadoop eco system and Cloud infra with Domain exposure in Banking and Health Care .
~ Big Data-Spark developer experience of 8+ years with Spark - (Scala and Python), Hadoop Eco Systems- (Hive, HDFS, HBase) and AWS Cloud-(S3, Athena, EMR) along with Cloudera.
~ Understands the complex processing needs of Big data and have experience developing codes and modules in Spark to address those needs by performing Data Analytics.
~ Developed Apache Spark jobs using Scala for faster data processing and used Spark-SQL for querying with good exposure in Spark-Core, Spark RDDs, DataFrames, Spark-SQL and Spark-Deployment.
~ Involved in the project timeline and estimations planning to deliver the features and worked in Agile-Scrum model of software life cycle.
~ Working on the business requirements and features to add new functionalities as well as enhancements in the existing application using mainly Spark, Scala and Hive.
~ Worked on the Spark Upgrade - existing application from Spark 1.x to 2.4 and Cloudera from 5.x to 6.x
~ Involved in the all phases from requirement analysis till post production support of the business requirements and tech items.
~ Working closely with analysts, peers, QAs and upstream-downstream SPOCs for the end-to-end hassle-free deliveries and application troubleshooting.
~ Worked with the team to deliver components using agile software development principles.
~ Started working on hybrid migration of application from Cloudera to AWS
Aetna – Healthcare Service Provider
~ Worked on design and development of Hadoop rewrite of existing Oracle based data hub (DataMart) with Apache Spark and Performed one-time data migration from Oracle Database to Hadoop cluster .
~ Worked on the processing and loading the feed files into Hive tables.
~ Worked upon processing of input data and generating required report data for the business using Spark-Scala and Spark-SQL
~ Participated in requirement gathering of the project in documenting the business requirements.
~ Involved in code development and support till prod deployment for new functionalities and enhancements to the application.
~ Actively involved in troubleshooting in all the phases starting from SIT till Production bugs.
Care First Healthcare Service Provider
~ I have involved in Requirement gathering, analyzing the data model, preparing Technical design document for ETL interfaces, Designing and Development of Mappings using Informatica Power Center Designer Tool as well as involved in Application End to End ETL Testing phase. He has handled all customers raised tickets and also provided ETL monitoring support, solving logical problems & real-time business scenarios. He has demonstrated leadership skills in managing teams and uses analytical and technical skills to help clients find solutions to strategic problems and enable more effective information handling.
~ Developing Informatica mappings/sessions/workflows as per the low level ETL Design Documents and unit testing them. Involved in developing UNIX shell scripts to perform additional ETL functionality such as wrapper scripts, CDC process using automated daily update of Informatica parameter files etc. Also involved in setting dependency for the Collections Release Implementation.
~ Prepared Unit Test Cases and Handled Unit Testing and System Integration Testing and Support during User Acceptance Testing.