Project #1:
Client: Major Food distribution service provider in the US
Roles & Responsibilities:
1. Led the seamless migration of both structured and unstructured data from legacy data architecture to a modern DataLake infrastructure.
2. Spearheaded the development of an intuitive analytics tool atop the DataLake, facilitating user access and comprehensive data analysis.
3. Successfully integrated and migrated data from diverse client databases, consolidating them into a centralized data repository hosted on AWS.
4. Implemented AWS Lambda functions using Python, automating data transformations and analytics operations within EMR clusters, ensuring efficient processing of large datasets.
5. Developed and configured performance metrics to monitor system performance and utilization, leveraging SQS and SNS for timely notifications and alerts.
Project #2:
Client: Major Food distribution service provider in US
Roles & Responsibilities:
1. Developed and maintained an intuitive business analytics tool for interactive data analysis.
2. Enhanced data exploration by enabling users to select dimensions, metrics, and apply filters seamlessly.
3. Implemented custom calculation metrics using SQL functions to offer flexible analysis options.
4. Streamlined report creation, saving, and sharing across multiple users for easy access.
5. Supported download of reports in various formats (xlsx, CSV) to cater to different user needs.
6. Conducted metric comparisons for Year-over-Year (YoY) trending analysis and drillable insights.
7. Enabled scheduling features to automate report generation on a daily, weekly, or monthly basis.
8. Designed and implemented Python parser code leveraging AWS Lambda, SQS, SNS, DynamoDB, and Redshift.
9. Transformed JSON requests from the UI into SQL queries for efficient data processing.
10. Engineered a highly concurrent, cost-effective, and responsive platform with minimal latency and downtime.
Project #3:
Client: Latentview Analytics
Roles & Responsibilities:
1. Developed interactive dashboards for real-time monitoring and tracking of data pipelines(ETL).
2. Developed QA dashboard to provide immediate insights into scheduled report statuses and data quality assessments.
3. Developed an assets page functionality showcasing detailed information on AWS resources.
4. Provided search functionality to quickly locate specific data packages within the repository.
5. Ensured optimal user experience through responsive design and efficient resource utilization.
Programming Languages : Python, JavaScript, SQL
AWS Services : Lambda, DynamoDB, S3, SQS, SNS, Glue
Frameworks : Angular, Bootstrap
Styling Languages : HTML, CSS
Version Control : GitHub