Analysed user requirements and designed and developed ETL processes to load enterprise data into the Data Warehouse
Provided global thought leadership in analytics solutions to benefit customers; analysed the key metrics of our product and used Pyspark, DBSCAN, and Clustering to improve our product, curbing the discrepancies as per end customer
Identified, protected, and leveraged existing data by using KPIs given by end-users; migrated the project from the Hadoop system to GCP Cloud using GCP services to enhance business requirements (such as DataProc with autoscaling and Serverless)
Scheduled jobs with better orchestration services such as Airflow, handling this feature, and created an end-to-end pipeline that covers the ETL from source to Bigquery table as per business needs; streamlined the customer data will fetching process
Created complex SQL queries and database objects to pull and manage data; recommended suggestions and improvements based on data analysis, which reduced runtime for production jobs by 25%
Awarded an appreciation certificate by Ford Motor Company (GDIA) for my contribution towards Completion of GCP migration (which signifies the adoption of a cloud strategy and the acceleration of reducing technical debt)
Data Engineer Intern
Ford
Chennai
02.2022 - 07.2022
Analysed complex datasets to identify trends and insights by using HIVE and Spark; resolved some of the bugs which were there in the code, further improving the accuracy
Collaborated with a team of approximately 8 members across departments to ensure accurate data capture and reporting needs were met; developed automated tests to ensure the quality of the output produced by ETL jobs
Performed quality assurance checks on incoming datasets, identifying and resolving issues accurately and precisely, which helped businesses to provide accurate data to the government to build the Charging Grids for EV vehicles
Utilised SQL to query large datasets from multiple sources; the data comes from Datamart; documented process flows from the source system through delivery channels; participated in sprint planning meetings with developers and product owners
Created ETL scripts to automate manual processes for efficient data loading, saving time by 30%; implemented data models to support business requirements; developed strategies for collecting additional sources of relevant data for metric analysis
Project Management Intern
Nobel Explorers
07.2021 - 09.2021
Assisted in coordinating project activities to ensure successful completion within established deadlines; created an Excel sheet to manage the timeline of our project stages and used micro, which helps in good collaboration
Helped with planning by helping document, develop, and refine concepts to align strategies with stated project objectives
Applied 3 different design specs while developing the website; maintained up-to-date records on all tasks associated with assigned projects; enhanced the website by adding some unique features as part of frontend development
Campus Ambassador Intern
Hello Study Global
05.2020 - 07.2020
Developed relationships with college officials, professors, and student club presidents to share information and collaborate on business-building endeavours; organised and promoted campus events to increase student engagement
Education
Bachelor of Technology - Information Technology
Vellore Institute of Technology
Jul 2022
Skills
Data Migration
Performance Tuning
SQL and Databases
MS Office
GCP (DataProc, BigQuery, Looker),
Airflow
Documentation Support
Data Extraction
Apache Spark
Learning Strategies
Production Support
GCP Looker
Accomplishments
Certifications: User Experience (IxDF, 2023), Security Analyst- SS/Q0901 (Nasscom, 2022), AWS Cloud Practitioner (Amazon Web Services, 2021)