Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Tarun Nagalla

Hyderabad

Summary

Results-driven IT professional with over 4.5 years of experience, including 2+ years specializing in Big Data. Expertise in data migration and transformation using Azure Databricks and Spark, coupled with a proven track record of enhancing data workflows through tokenization and data masking. Strong analytical skills and effective collaboration within cross-functional teams contribute to successful project outcomes. Aiming to leverage these skills in challenging environments to drive further innovation in data management.

Overview

5
5
years of professional experience
1
1
Certification

Work History

Big Data Developer

Tata Consultancy Service (TCS)
08.2024 - Current
  • Implementing data discovery and protection using Protegrity, applying tokenization and data masking techniques at the cluster level to safeguard sensitive information across the data pipeline.
  • Develop and manage protected tables and views in Hive and Impala, enabling applications to query data securely while maintaining compliance with data protection standards.
  • Implement Role-Based Access Control (RBAC) and Attribute-Based Access Control (ABAC) at the column level, ensuring granular control over sensitive data access.
  • Collaborated with the framework team to develop and integrate the Ingestion Framework for protected tables, ensuring seamless integration and compatibility.
  • Technologies: Spark, Hive, Impala, HDFS, Tokenization, Data Masking, OneTrust (discovery tool), Protegrity.

Big Data Developer

Tata Consultancy Services (TCS)
06.2023 - 08.2024
  • Migrated transformation jobs from on-premises to Azure Databricks, ensuring compatibility with Databricks clusters, and optimizing execution to minimize processing time for large datasets in the cloud.
  • Developed Spark-based transformation scripts in Azure Databricks, ensuring seamless integration with Azure Data Factory pipelines for automated orchestration.
  • Built, optimized, and monitored ADF pipelines to automate data movement from on-premises databases to Azure Data Lake Gen 2 (ADLS Gen 2).
  • Transformed data in Azure Databricks and loaded it into Azure SQL for downstream BI reporting and analytics.
  • Resolved issues within the pipeline to ensure smooth data flow and maintain data integrity.
  • Ensured data quality and accuracy by conducting comprehensive testing and validation of transformed data.
  • Collaborated with the Production Support team to assist in production deployments, and ensure smooth operation of data workflows in the cloud.
  • Technologies: Azure Databricks, ADF, Spark, HDFS, ADLS Gen 2, Azure SQL, Scala, Python, SQL, Oracle.

QA Manual Test Engineer

Capgemini India Pvt Limited
07.2020 - 05.2023
  • Creation and development of detailed manual test cases for various functionalities, ensuring comprehensive coverage of the product.
  • Conducted thorough regression testing to validate the functionality of existing features after updates and bug fixes.
  • Identified and raised defects in JIRA, documented them with clear steps to reproduce, and worked closely with developers to ensure prompt resolution.
  • Participated in defect triage meetings to prioritize and discuss critical issues.
  • Led the AU UAT team to perform regression testing, ensuring a smooth deployment.
  • Provided sign-offs for deployment after confirming that all necessary testing and defect resolution were completed.
  • Technologies: Manual Testing, Functional Testing, Defect Tracking, JIRA.

Education

Bachelor of Science - Electronics And Communication Engineering

B.V. Raju Institute of Technology
Hyderbad Telangana
04-2019

Board of Intermediate Education -

Sri Chaitanya Junior College
Hyderabad
03-2015

High School Diploma -

Kendriya Vidyalaya
Hyderabad
04-2013

Skills

  • Hadoop
  • HDFS
  • Spark
  • Sqoop
  • Hive
  • Data processing
  • Data migration
  • Apache Airflow Workflow Management
  • Azure
  • Azure Data Factory
  • Azure Databricks
  • AWS
  • Cloudera
  • S3
  • SQL
  • PostgreSQL
  • Oracle
  • Scala
  • Python
  • Linux
  • Window

Certification

  • PSM I: Professional Scrum Master (Scrum.org)
  • Azure Databricks and Spark for Data Engineers (Udemy)
  • Azure Data Factory and Spark for Data Engineer (Udemy)

Timeline

Big Data Developer

Tata Consultancy Service (TCS)
08.2024 - Current

Big Data Developer

Tata Consultancy Services (TCS)
06.2023 - 08.2024

QA Manual Test Engineer

Capgemini India Pvt Limited
07.2020 - 05.2023

Bachelor of Science - Electronics And Communication Engineering

B.V. Raju Institute of Technology

Board of Intermediate Education -

Sri Chaitanya Junior College

High School Diploma -

Kendriya Vidyalaya
Tarun Nagalla