Data engineer with over 4 years of experience in software design, development, implementation and software engineering which also includes integration, maintenance and web applications. Design, development, database management and machine learning.
Built data pipelines to move data from marketing platforms, Google Analytics, and other sources to Redshift using Matillion and Fivetran, ensuring seamless integration and efficient data processing.
Led the Migration from Dynamo to Redshift using Kinesis and S3 and Lambda.
Designed and managed data warehouses on Snowflake, optimizing data storage and performance for large-scale analytics and reporting tasks.
Automated many ETL processes to process billions of rows of data, which reduced the manual workload by 30 percent.
Ingested data from different sources using a combination of SQL, Python, Google Analytics, BigData, Redshift and S3 to create data views to be used in BI Tools like Tableau/Power BI.
Build CI/CD pipelines using Jenkins to make the deployment process smooth.
Utilized Snowflake to design and implement scalable data warehouses, optimizing data pipelines for efficient processing and real-time analytics.
Built a more efficient ETL using Airflow and Redshift.
Created monitoring alerts for data pipelines.
Created Data pipelines that ingested streaming and transactional data and output cleaned data to Redshift.
Data annotations on AI-Genrative Data.
Handling uploading data on AWS.
Process documentation, configuration, and production issues resolving.
Python
Sql
Pyspark
SnowFlake
Git
Data Wearhousing
Machine learning
Apache Airflow
Jenkins (CI/CD)
AWS cloud
Matillion ,Fivetran
Flask
Docker