Java
Experienced software engineer with a proven track record in designing and constructing robust big data pipelines. Proficient in Apache Flink and Spark, with the ability to write efficient code in Java and Python. Always eager to learn new languages as per job requirements. Strong foundation in data structures and algorithms for optimizing system performance and delivering high-quality solutions.
1. Client: 2. Client: 3. Client: 4. Client: 5. Client:
Project: Services, Interaction, and Subscriptions Data Models
Role: Big Data Engineer (2024 - Present)
About Chamberlain INC:
Chamberlain is a B2B company providing IoT-based services to clients across diverse industries such as deliveries (Amazon, Walmart, etc.), automotive (Tesla, Hyundai, Honda, etc.), and smart home solutions. The IoT devices collect vast amounts of data, which are analyzed to generate actionable business insights.
Roles and Responsibilities:
Technologies Used:
Databricks, Data Build Tool (DBT), Azure Data Factory, Azure DevOps, PySpark
Project: Providence
Role: Flink Developer (2023 - Present)
Project Overview:
The Providence project focuses on collecting and analyzing data from Warner Bros. Discovery’s applications used by clients. The data captures user interactions with the apps, along with any errors encountered during normal operations. This data helps improve app performance and enhances the user experience.
Roles and Responsibilities:
Technologies Used:
Apache Flink, Kubernetes, Terraform, GitHub Actions, JFrog, Maven, AWS Managed Flink, AWS Timestream, AWS RDS, Grafana (for visualization), Prometheus (for monitoring), Kafka (for streaming).
Programming Language:
Java
Deployment Details:
The Flink application is deployed using AWS's managed service, AWS Managed Flink, ensuring scalability and reliability.
Project: OpenSearch Analytics
Role: Data Engineer (2022 - 2023)
Project Overview:
This project focused on identifying and addressing suboptimal OpenSearch query practices that impacted performance and response times. Additionally, it analyzed heavily used indexes to optimize scaling of OpenSearch clusters based on usage patterns.
Roles and Responsibilities:
Technologies Used:
AWS Lambda, AWS Athena, AWS SQS, AWS S3, AWS Glue Crawler, CloudFormation, Jenkins, Grafana
Programming Language:
Python
Project: Observability
Role: Data Engineer (2022 - 2023)
Project Overview:
This project aimed to enhance monitoring, incident reporting, and metering of Discovery’s infrastructure and services. By tagging resources (e.g., code repositories, AWS services, alerts) with operational metadata, the project facilitated quick resolution of failures and accurate cost metering for individual teams.
Roles and Responsibilities:
Technologies Used:
PySpark, GitHub Actions, PostgreSQL, Grafana, AWS Athena
Programming Languages:
Python, Java
Project: Stream Metrics as a Service
Role: Data Engineer (2022)
Project Overview:
The project involved real-time analysis of data stream events and visualization of metrics on Grafana dashboards. These analytics supported critical business decisions for Discovery.
Roles and Responsibilities:
Technologies Used:
Apache Flink, AWS Kinesis Data Analytics, Grafana
Programming Language:
Java
Project:
Role: Data Engineer (Jan 2020- Dec 2021)
Project Overview:
The project focused on the Extraction, Loading, and Transformation (ELT) of data from diverse sources, including Oracle databases, flat files, JSON, CSV, Avro, and Parquet files, into a Data Warehouse. The processed data was then populated into Hive tables for generating reports and supporting business intelligence activities.
Roles and Responsibilities:
Technologies Used:
Spark, Hive, Python, Unix, Sqoop, MySQL
Tools:
GitHub, PyCharm, Eclipse, MobaXterm, PuTTY
Project:
Role: Data Engineer (August 2019 to Jan 2020)
Project Overview:
The project involved analyzing IPL datasets to extract valuable insights and generate reports on player and team performance. Key analyses included identifying the best batsman, bowler, and fielder for a season and predicting future events.
Roles and Responsibilities:
Technologies Used:
Power BI, Apache Pig
Tools:
Power BI, PuTTY
Apache Flink
Apache Spark
AWS
Python
Java
C
MYSQL
undefinedJava
Python
Maven
Kubernetes
Docker
Git
Github
Databricks
DBT
Grafana
Singing
Sketching
Painting
Fitness
Core Java
I have a deep passion for singing, which allows me to express myself creatively and unwind after a busy day. Whether it’s practicing classical pieces, exploring contemporary genres, or simply humming my favorite tunes, singing has always been a source of joy and inspiration. It also serves as a great way to connect with others who share a love for music.
Python Developer
Core Java