Charan V

Summary

Over 3.2 years of combined experience in Azure and Aws Data Engineering.
Expertise on Azure cloud platform tools like Azure Data Factory, Azure Databricks, ADLS Gen2, Azure Synapse, Azure SQL DB, Azure Key Vault, Logic Apps,Data Lake.
Expertise on AWS cloud platform tools like AWS S3, AWS Data Lake, AWS RDS, AWS Glue, AWS EMR, AWS Athena, AWS Lambda, AWS Redshift, AWS Kinesis, AWS DynamoDB, AWS DMS, AWS Step Functions, AWS Batch, Databricks on AWS.
Experiences on tools like Git, GitHub, Docker, PySpark, Azure CI/CD, Python, SQL, Power BI, Apache Spark.
Utilizing AWS services for ETL pipelines, data warehousing, and data processing, leveraging Python and PySpark.
Extensive experience in leveraging Azure Data Bricks for large-scale data processing, analytics.
Demonstrated proficiency in orchestrating complex data workflows and ETL processes using Azure Data Factory pipelines.
Proficient in building and tuning Apache Spark clusters within Azure Data Bricks for high-performance data processing and analysis.
Expertise in integrating diverse data sources including on-premises and cloudbased systems for seamless data movement and transformation.
Proficient in building and tuning Apache Spark clusters within Azure Data Bricks for high-performance data processing and analysis
Developing serverless workflows and event-driven architectures using Azure Logic Apps and Azure Functions.
Automated routine data processing tasks using SQL, R, and Python, reducing manual intervention and increasing operational efficiency.
Expertise in architecting and implementing data lake solutions on Azure Data Lake Storage Gen2, enabling scalable storage and analytics for structured and
unstructured data.
Implementing data governance and security protocols, ensuring compliance and safeguarding sensitive data within SQL databases.
Strong understanding of security best practices and data protection mechanisms utilizing Azure Key Vault.
Skilled in designing and implementing scalable and cost-effective storage solutions using Azure Blob Storage.
Automating data processes and optimizing query performance, reducing processing times and operational costs through advanced technical strategies.
Experience in utilizing Designed and implemented data ingestion pipelines from on-premise systems to AWS S3 using AWS Glue, Lambda, and Kinesis.
Built PySpark ETL scripts on AWS EMR to transform and process large datasets for analytics teams.
Currently utilizing SQL for data manipulation and querying within Aws environments, alongside working with databases like MySQL.
Extensive proficiency in database management, encompassing SQL databases such as SQL Server, MySQL, and PostgreSQL, complemented by a deep understanding of NoSQL database like DynamoDB.
Created Athena queries for ad-hoc analysis and reporting, improving business insights delivery by 40%.
Developed and optimized Redshift data warehouse schemas, improving query performance by 50%.
Integrated AWS CloudWatch alerts for data pipeline health monitoring and performance tuning.
Experience with an in - depth level of understanding in the strategy and practical implementation of AWS Cloud-Specific technologies including EC2, EBS, S3, VPC, RDS, SES, ELB, EMR, ECS, Cloud Front, Cloud Formation, Elastic Cache, Cloud Watch, Red Shift, Lambda, SNS, Dynamo DB, Kinesis.
Designed Glue Crawlers to automate schema discovery and catalog updates.
Collaborated with analysts to build Power BI dashboards connected to data source of Azure and AWS.
Detail-oriented Data Analyst with expertise in SQL and Python (NumPy, Pandas, Matplotlib, Seaborn), strong proficiency in Power BI, MySQL, and Advanced Excel Automated ETL workflows using Step Functions, reducing manual intervention by 80%.
Hands-on experience using Jupiter Notebook for data analysis and visualization.
ETL Developed KPI dashboards and performance metrics for senior management and stakeholders Integrated data from various sources including Teradata, SQL Server, Oracle, DB2, Netezza, and flat files for consolidated reporting.

Overview

years of professional experience

Work History

Data Engineer

Resource Pro Operational Solutions Pvt. Ltd

04.2025 - Current

Project Title: Insurance Data Processing System and Market Analytics & Prediction

Client: Wipro
Technologies: AWS (S3, Glue, EMR, Lambda, Kinesis, Redshift, Athena, Databricks) Power BI, and Azure Data Factory, Azure Databricks, ADLS Gen2, Synapse

RESPONSIBILITIES:

Designed end-to-end ETL pipelines using Azure Data Factory and Databricks.
Built real-time and batch processing pipelines for high-volume data.
Integrated multiple data sources into Azure Data Lake and Azure SQL DB.
Optimized pipelines using parallel processing and caching techniques.
Implemented data quality frameworks and validation checks
Developed CI/CD pipelines for automated deployment.
Monitored pipelines using Azure Monitor and Log Analytics.
Developed Spark/PySpark-based ETL pipelines for large-scale data migration.
Processed structured and semi-structured data into enterprise data lake architecture.
Optimized Spark jobs using partitioning, caching, and query tuning techniques.
Built data models and ETL workflows based on business requirements.
Integrated data pipelines with Power BI dashboards for reporting.
Developed Python-based data processing scripts and automation frameworks.
Performed data cleansing, transformation, and validation.
Designed and built real-time data pipelines using AWS Glue, Lambda, and Kinesis for streaming data ingestion.
Developed PySpark ETL jobs on EMR and Databricks for large-scale data processing.
Built S3-based data lake with partitioned Parquet datasets, improving query performance by 60%.
Designed and optimized Redshift schemas, improving query performance by 50%.
Implemented event-driven architecture using Lambda for real-time processing.
Created Athena queries for ad-hoc analysis and reportin.
Built streaming solutions using Kinesis + Lambda + Redshift for near real-time analytics.

Data Engineer

Sixth Energy Technologies Pvt Ltd

11.2024 - 03.2025

Project Title: Data Integration and Migration Platform

Client: Unisys
Technologies: Azure Data Factory (ADF), Azure Databricks, Azure Key Vault, Azure Logic Apps, Azure Data Lake Storage, Azure SQL Database

RESPONSIBILITIES:

Migrated on-prem data to Azure Data Lake and Azure SQL Database.
Integrated Databricks notebooks with ADF pipelines.
Implemented secure data handling using Azure Key Vault.
Automated workflows using Logic Apps and event-driven architecture.
Built Power BI dashboards for business insights.
Designed and developed Azure Data Factory (ADF) pipelines, Linked Services, and Datasets for seamless data migration and transformation.
Tailored ADF pipelines to meet specific project requirements, ensuring efficient data
movement and transformation.
Implemented Azure Logic Apps for real-time email notifications to monitor data migration progress.
Collaborated with senior team members to understand complex data migration requirements and develop effective solutions.

Data Engineer

VISIONQUEST SOLUTIONS PRIVATE LIMITED

01.2023 - 10.2024

Project Title: Inventory Management - E commerce

Client: US
Role: Internship
Technologies: S3, Glue, Redshift, EMR, Lambda, Kinesis, Athena, RDS, DynamoDB, Step Functions and Azure Data Factory, Azure Databricks, ADLS Gen2, Azure Synapse, Azure SQL DB, Azure Key Vault, Logic Apps,Data Lake.

RESPONSIBILITIES:

Implemented data quality checks, improving accuracy of reporting outputs for business intelligence.
Utilized SQL for querying databases, enhancing data retrieval efficiency across projects.
Assisted in the design and implementation of data models to support analytical needs.
Monitored system performance, troubleshooting issues to minimize downtime and enhance reliability.
Streamlined complex workflows by breaking them down into manageable components for easier implementation and maintenance.
Optimized data processing by implementing efficient ETL pipelines and streamlining database design.
Migrated legacy systems to modern big-data technologies, improving performance and scalability while minimizing business disruption.
Increased efficiency of data-driven decision making by creating user-friendly dashboards that enable quick access to key metrics.
Documented technical processes and workflows, facilitating knowledge sharing among team members.
Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
.Fine-tuned query performance and optimized database structures for faster, more accurate data retrieval and reporting.
Enhanced data quality by performing thorough cleaning, validation, and transformation tasks.
Designed and developed scalable data pipelines for ingesting, processing, and transforming large datasets using Python and SQL.
Utilized Microsoft Azure services including Azure Data Factory, Azure Data Lake, Synapse Analytics, and Databricks for data integration and transformation.
Developed and optimized data models and schemas to support business intelligence and reporting needs.
Ensured data quality, validation, and governance using monitoring and logging frameworks.
Tuned and optimized query performance and data processing jobs for cost and efficiency.

Education

Bachelor of Engineering - E.C

Visvesvaraya Technological University

Bengaluru, India

07-2024

Skills

Programming: Python, PySpark, SQL
Big Data: Apache Spark, PySpark, Databricks
Azure Cloud: Azure Data Factory, Azure Databricks, ADLS Gen2, Azure Synapse, Azure SQL DB, Azure Key Vault, Logic Apps,Data Lake
AWS Cloud: S3, Glue, Redshift, EMR, Lambda, Kinesis, Athena, RDS, DynamoDB, Step Functions
Data Warehousing: Redshift, Azure Synapse
Databases: MySQL, PostgreSQL, SQL Server, Azure SQL,Amazon RedShift
Visualization: Power BI

VCS Tools: Git, GitHub
Operating Systems: Windows, Linux
Data Transformation: Apache Spark and PySpark
SDLC: Agile & Scrum
IDEs: VS Code, PyCharm
Others: Data Modeling, ETL Pipelines, CI/CD

CAREER OBJECTIVE

Data Engineer with 3.2 years of experience in designing, building, and optimizing scalable data pipelines across AWS and Azure cloud platforms. Strong expertise in ETL/ELT workflows, big data processing, data warehousing, and real-time streaming. Proficient in Python, PySpark, SQL, and Apache Spark, with hands-on experience in Azure Databricks, Azure Data Factory,ADLS Gen2, Azure Synapse, Azure SQL DB, AWS Glue, Redshift,Kinesis, Athena and EMR. Skilled in building data lakes, implementing data governance, and optimizing performance for data-driven decision-making.

Timeline

Data Engineer

Resource Pro Operational Solutions Pvt. Ltd

04.2025 - Current

Data Engineer

Sixth Energy Technologies Pvt Ltd

11.2024 - 03.2025

Data Engineer

VISIONQUEST SOLUTIONS PRIVATE LIMITED

01.2023 - 10.2024

Bachelor of Engineering - E.C

Visvesvaraya Technological University