Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic
Tarun T

Tarun T

Bengaluru

Summary

With 7.9 years of IT experience, I specialize in Big Data, Spark, Google Cloud, Core Java, SQL Server, and have hands-on expertise across the Hadoop ecosystem (HDFS, Hive, Impala, Sqoop, Oozie, Ambari, Spark Core/SQL, Shell Scripting, HBase, Pig, MapReduce). I have developed end-to-end Hive/Impala masking applications, optimized data pipelines, and worked extensively with Hive QL, Spark RDDs/DataFrames/DataSets, Cloudera/Hortonworks, and Sqoop operations. I also built scalable pipelines using GCP Dataflow, Kubernetes, Apache Beam, Azure Databricks, and PySpark.

Overview

9
9
years of professional experience
1
1
Certification

Work History

Senior Software Engineer

Publicis Sapient [Client - Lloyds Bank]
Bangalore
07.2023 - Current
  • Built data pipelines using Google Cloud Dataflow, Kubernetes, Apache Beam, Azure Databricks, and PySpark to ensure smooth and scalable data processing.
  • Performed data cleaning and transformation in BigQuery and managed storage using GCS, BigQuery, and Spanner.
  • Used Terraform to manage and maintain cloud infrastructure as code.
  • Created JUnit test cases and implemented CI/CD pipelines with Jenkins and Spinnaker for automated deployments.
  • Monitored and optimized data pipelines for performance, cost, and security, and maintained data dictionaries in Collibra.

Senior Software Engineer

Technosoft [Client - PayPal]
Chennai
01.2022 - 03.2023
  • Analyze the existing Hadoop/Big Data project and map source datasets, tables, and columns to GCP/BigQuery.
  • Convert the Hadoop/Big Data code to GCP/BigQuery and deploy it into the development framework.
  • Update configuration files, perform dry-run testing, and validate output tables using sample data.
  • Migrate the project to QA, update configs, and run both Hadoop and GCP versions with the same data to compare results and prepare test cases and runbook.

Senior Software Developer

TEKsystems [Client - Ford]
Chennai
11.2020 - 12.2021

Here are the points rewritten in simple and clear form:

  • Gather requirements from the Product Owner and request database access as needed.
  • Prepare the Technical Design Document and get approval.
  • Write HQL statements to extract and transform data from source to staging and target.
  • Review code with the Tech Anchor and upload HQL files into the GDIA framework.
  • Create Oozie bundles, run the jobs, and prepare unit test cases after batch completion.
  • Review results with the Product Owner, get approval for production launch, and prepare the runbook.
  • Migrate the application from QA to Production, perform basic checks, and monitor initial job runs.

Senior Software Developer

Mage Data (Mentis) [Client - Credit Suisse]
Chennai
08.2018 - 11.2020
  • Designed the Hive and Impala architecture for Mentis masking applications.
  • Planned product features, got approvals, and guided the team to develop as per the plan.
  • Coordinated daily with managers, clients, and internal teams to resolve issues and gather information.
  • Developed Discovery, Static Masking, and Dynamic Masking products, including Hive/Impala UDFs and Spark UDFs.
  • Created masking views, handled runtime and performance issues, and processed Hive tables using Spark.
  • Connected Hive with Spark SQL to perform transformations and analysis.
  • Monitored and debugged jobs from MENTIS UI and handled deployments with required client privileges.
  • Supported testing, Kerberos and Sentry setup, and delivered end-to-end masking products, including documentation and client sign-off.

Software Developer

VC ERP Consulting Pvt Ltd [Client - British Petroleum]
Bangalore
08.2016 - 07.2017
  • Analyzed functional requirements and created Hive tables for data processing.
  • Used partitions, bucketing, complex data types, and wrote HQL queries for efficient performance.
  • Imported and exported large datasets between Hadoop and relational databases using Sqoop.
  • Developed Hive DDLs and converted SQL queries into Hive queries.

Education

M.Tech Computers

JNT University
Kakinada
12-2014

Skills

Big Data Ecosystem:
HDFS, Hive, Impala, Sqoop, MapReduce, Oozie, Spark Core, Spark SQL, HBase, Pig, YARN, Zookeeper, Hue, Ambari

Cloud Platforms & Services:

  • Google Cloud Platform (GCP): BigQuery, Spanner, Airflow, Pub/Sub, Dataflow
  • Azure: Azure Databricks

Programming Languages:
Core Java, Scala, Python, Shell Scripting

Databases:

  • RDBMS: SQL Server
  • NoSQL: HBase

Application Server:
Apache Tomcat

Certification

Google Cloud Professional Data Engineer

Timeline

Senior Software Engineer

Publicis Sapient [Client - Lloyds Bank]
07.2023 - Current

Senior Software Engineer

Technosoft [Client - PayPal]
01.2022 - 03.2023

Senior Software Developer

TEKsystems [Client - Ford]
11.2020 - 12.2021

Senior Software Developer

Mage Data (Mentis) [Client - Credit Suisse]
08.2018 - 11.2020

Software Developer

VC ERP Consulting Pvt Ltd [Client - British Petroleum]
08.2016 - 07.2017

M.Tech Computers

JNT University
Tarun T