Cloud and Big Data Developer with 6+ years of extensive technical experience in data-related technologies. Proficient in Python, SQL, and Big Data Development, with a strong background in data migration and cloud services (GCP/AWS). Adept at working in Agile software development environments. Passionate about handling structured and semi-structured data, consistently delivering high performance. Skilled in GCP and AWS cloud services, with hands-on experience in Sqoop, HDFS, NiFi, Spark, Hive, MapReduce, Oozie, HBase, and Airflow
Project: Healthcare Data Migration
Technologies:
Description: Migrated healthcare data from Epic, Cerner, and HL7 systems to GCP for enhanced data analysis and processing.
Responsibilities:
Project: Data Migration
Industry: Education
Technologies: GCP (Dataproc, DataFlow, BigQuery, Composer, Kubernetes Engine), BigData (Sqoop, Spark, SQL), Python, Shell Script
Migrated an educational client's data from MS-SQL to BigQuery on GCP to analyze grades, classroom activities, and counseling call recordings for student progress assessment and retention improvement.
Project: Cloudera to Google Databricks Migration
Technologies: GCP, Python, Scala, Databricks, Spark, Airflow
Data Flow:
Roles and Responsibilities:
Project: Data Migration
Industry: Insurance
Technologies: GCP, Python, DataFlow (Apache Beam), BigQuery, Data Fusion, Google Map API, Cloud Composer, Jupyter Notebook
Overview:
Key Pipelines:
Responsibilities:
Project: Data Lake Ingestion Framework
Industry: Insurance Tech
Technologies: Apache NiFi 1.2, Python 2.7, Java 1.7, AWS Cloud (NiFi, HDP infrastructure), GitHub, Jenkins, Bamboo
Google Cloud Platform(GCP)
Amazon Web Services(AWS)
Python
Scala,Java
Apache Spark
Airflow
GCP tech stack: BigQuery, Dataflow,Dataproc
Build Automation Tools: Maven, Gradle, SBT