Summary
Overview
Work History
Skills
PROJECTS:
Education
Accomplishments
Timeline
Generic

ARUNA MURUGESAN

Summary

Seeking a challenging opportunity in an organization wherein, I utilize my skills in the Information Technology Industry. Experienced, result-oriented, resourceful, new implementation and problem-solving Data engineer with leadership skills. Adapt and met challenges of tight release dates. Over 8+ years of diverse experience in data science.

Overview

8
8
years of professional experience

Work History

Senior Data Engineer

Virtusa software services
Bengaluru
03.2016 - Current
  • • Developed Pig Latin scripts in the areas where extensive coding needs to be reduced to analyze large data sets.
    • Involved in extracting huge volumes of data from relational databases to HDFS using Apache Sqoop.
    • Experienced in creating Hive schema, external tables and dynamic partitions.
    • Using various performance tuning technologies to handle very large set of data.
    • Worked on other tools like WinSCP for transferring the files from Windows to UNIX and vice-versa, Putty for connecting to UNIX and run the scripts.
    • Used several optimization techniques to efficiently manage and consume cluster space.
    • Handling customer escalation in day-to-day basis and giving suggestions to the support team for better resolution.
    • Involved in the complete lifecycle (SDLC) of the project i.e. Design, Development, Implementation, Unit testing and Support.
    • Used data ingestion techniques to integrated millions of data to the repositories in daily basis.
    • Worked on loading and transformation of large sets of structured data into Hadoop system.
    • Mentored and Managed team of four to develop in Hadoop applications
    • Design, develop and implement data processing and analytics solutions using Apache Spark
    • Converting SQL queries, MapReduce programs into Spark transformations
    • Using Oozie for workflow management and crontab for scheduling jobs.
    • Implemented data validation checks resulting in less errors and enhance data reliability
    • Analyze large datasets using PySpark’s DataFrame transformations and actions. This includes filtering, aggregating, joining, and performing complex data transformations.

Skills

  • Apache Hadoop
  • Apache Pig
  • Apache Hive
  • Oozie
  • Apache Sqoop
  • Pyspark
  • Shell Scripting
  • Spark

PROJECTS:

● Involved in the development of migrating the expensive tool to Hadoop from the scratch of the project.

● Finished a Spark project for a reputed client well ahead of the deadline.

Education

Bachelor of Engineering - Electronics And Communications

KSR College of Technology
Thiruchengode, TN
01-2015

Accomplishments

  • Recognized with RAVE awards for exceptional work from the Project Senior managers for complete dedication and highly rated feedback directly from the procurement director of a client company.
  • Received certificate as a token of appreciation from client for best deliverables.

Timeline

Senior Data Engineer

Virtusa software services
03.2016 - Current

Bachelor of Engineering - Electronics And Communications

KSR College of Technology
ARUNA MURUGESAN