

Highly skilled Data Engineer with strong expertise in Python, SQL, and scalable ETL/ELT pipeline development across AWS and GCP cloud platforms. Proven experience in building batch and streaming data pipelines, integrating Snowflake and cloud-native data services, and implementing CDC, SCD, and data quality frameworks. Adept at working across multiple client environments, collaborating with cross-functional teams, and delivering reliable, production-grade data solutions. Proficient in data warehousing, orchestration (Airflow), CI/CD automation, and cloud security/IAM, with foundational experience in machine learning (classification, regression) and GenAI integrations. Committed to delivering high-quality, scalable, and cost-efficient data platforms.
Assisted in developing and maintaining data processing pipelines using Python and PySpark for batch and near-real-time ingestion across various data layers.
Assisted in building and maintaining Python-based data processing pipelines for ingesting, cleansing, and transforming healthcare datasets for search and analytical applications.
Role: Backend Python Developer.|Applied machine learning.
Technologies: Python, AWS (S3, Lambda, EC2), PostgreSQL, Jenkins, Scrapy, Selenium, and Machine Learning (Classification, Regression).
Technologies: Python, Pandas, Machine Learning (Classification, Regression), SQL, Django, HTML, JavaScript, and Data Analysis.