Dynamic software professional with over a decade of experience in the IT sector, specializing in Big Data and Analytics, with a strong foundation in Data Warehousing technologies. Expertise in the Hadoop Ecosystem, including proficiency in tools such as MapReduce, HDFS, Apache Spark, and Apache Hive, to deliver impactful data-driven solutions. Proven success in leveraging Quantexa tools for ETL processes and scenario development to effectively detect fraudulent activities within trading and trade banking contexts. Recognized for optimizing queries through partitioning and bucketing while creating robust Apache NiFi pipelines for seamless data ingestion and management across various RDBMS platforms, with a commitment to fostering team collaboration and achieving high-quality results.
Project: Trade Fraud Risk Management
Description: Trade Fraud Risk Management (TFRM) aims at reducing the Trade frauds by creating client specific Fraud & Anti-Money Laundering (AML) scenarios by leveraging big data from multiple sources.
Technologies: Quantexa, Apache Spark, Scala, Python, HDFS, Hive, Kibana and Control-M
Roles and Responsibilities:
Project: Trade Fraud Risk Management
Client: Standard Chartered GBS
Description: Trade Fraud Risk Management (TFRM) aims at reducing the Trade frauds by creating client specific Fraud & Anti-Money Laundering (AML) scenarios by leveraging big data from multiple sources.
Technologies: Quantexa, Apache Spark, Scala, Python, HDFS, Hive, Kibana and Control-M
Roles and Responsibilities:
Project: Digital Transformation & Data Engineering Initiative
Client: Myanma Posts and Telecommunications
Description: MPT is the first and leading telecommunications company in Myanmar. Providing both fixed and mobile telecommunication services to people and enterprises of Myanmar. MPT BI-IOS is collecting data from different sources and transform them as per the business logic and storing into Hadoop File System. From Hadoop storage layer, generating reports, down-streams and up-streams by analyzing and computing data.
Technologies: HDFS, YARN, Apache Spark, Apache NiFi, Apache Hive, Ranger, Superset, Presto, Apache Kafka, Sqoop, PostgreSQL and Git Lab.
Roles and Responsibilities:
Project: Healthcare Data Analytics
Client: Syncrasy Labs
Description: Syncrasy Labs is a Health Care based company which collected the data from all the nearby hospitals, patients and doctors related data across US. The purpose of this project is providing the physician details to the user based on their search queries and their previous health transactions and nearby locations.
Technologies: HDFS, Apache Hive, Apache Spark, StreamSets, Apache Ranger
Roles and Responsibilities:
Client: CheckBac Inc.
Description: CheckBac is a low-cost alcohol monitoring product, where it produces the user data in daily basis in the form of JSON and this data will land in HDFS initially. Once the data is available in HDFS we load into Hive tables.
Technologies: HDFS, Apache Hive, HBase, Ranger, Java, MySQL, Tomcat, Eclipse, Maven
Roles and Responsibilities: