Summary
Overview
Work History
Education
Skills
Timeline
Generic

REENA THOMAS

Kochi

Summary

Accomplished Lead Data Engineer at Tata Consultancy Services, specializing in Big Data technologies and performance tuning with over 20 years of experience, of which 6 years were in big data. Successfully migrated VISA's Hadoop ecosystem, enhancing data load times and analytical capabilities. Proven mentor with strong skills in data modeling and project planning, driving significant improvements in data processing efficiency.

Overview

25
25
years of professional experience

Work History

Lead Data Engineer

Tata Consultancy Services
Kochi
07.2022 - Current
  • Achieved significant performance gains by improving data load times by 30% for historical data, directly enhancing reporting and analytical capabilities.
  • Drove substantial performance improvements by refactoring complex Hive queries into highly efficient Spark SQL, delivering over 60% reduction in execution time for critical data processing tasks.
  • Architected and implemented high-performance data pipelines for sophisticated financial data analysis within the VMR use case, enhancing decision-making capabilities.
  • Optimized MROI (Marketing Return on Investment) models by integrating new metrics using Spark SQL, directly contributing to improved marketing decisions and maximized ROI.
  • Worked in the critical migration of VISA's expansive Hadoop ecosystem to the Tusker environment, ensuring seamless data continuity and operational integrity for a major fintech client.
  • Mentored junior engineers and led technical discussions, fostering team growth and ensuring adherence to best practices in data engineering.

Senior Oracle Developer/ Data Engineer

Omegacube Techsystems
Bangalore
12.2016 - 06.2022

Client: NFM (Non Food Marketing)

  • Engineered robust data injection processes by meticulously analyzing diverse OLAP and OLTP system data natures, ensuring reliable data ingestion and optimizing data flow.
  • Designed and implemented Hive external tables leveraging HDFS for scalable storage and efficient processing of large datasets.
  • Utilized Spark DataFrame and Dataset APIs to perform complex data processing and transformations, improving data readiness for analytics.
  • Facilitated efficient report generation by registering DataFrames into temporary tables via SQLContext, enabling rapid querying and analysis.
  • Streamlined data ingestion using Oracle UTL Files and Sqoop, with Hive handling subsequent data processing and transformation, often involving query fine-tuning for optimal performance.

Client: Sumatech Inc

  • Developed and optimized data injection pipelines, meticulously assessing data characteristics from various OLAP and OLTP sources.
  • Established Hive external tables on HDFS to effectively store and process results from large-scale data operations.
  • Leveraged Spark DataFrame and Dataset APIs for advanced data manipulation and processing, ensuring data integrity and efficiency.
  • Enabled rapid report generation through SQLContext integration, allowing direct SQL queries on DataFrame-backed temporary tables.
  • Implemented efficient data ingestion strategies using Oracle UTL Files and Sqoop, integrating with Hive for comprehensive data processing and transformation workflows, including performance tuning of data processes.

Client: Lakeside Equipment Corporation, Hensa Manufacturing Company

  • Led the technical design and development of custom reports and interfaces based on functional specifications, significantly enhancing business functionalities.
  • Optimized database objects and schemas to align with evolving application requirements, ensuring high performance and scalability.
  • Executed a complex cross-database migration, converting over 120 Oracle packages, procedures, functions, views, and reports to PostgreSQL over 9 months.
  • Utilized pgAdmin and custom SQL scripts to facilitate seamless deployment across development, staging, and production environments, minimizing downtime.

Oracle PL/SQL Developer

Transworld Computers
03.2007 - 03.2015
  • Engaged directly with clients for comprehensive requirements gathering, translating business needs into robust back-end changes for existing products and new database designs.
  • Designed and developed scalable Oracle database solutions, contributing to the architecture and implementation of core systems.
  • Conducted application and database tuning initiatives, fine-tuning complex SQL and PL/SQL code to significantly improve system performance and responsiveness.
  • Developed critical screen formats and detailed report designs to meet specific user and business requirements.

Oracle PL/SQL Developer

Ministry of Industry and Commerce
05.2000 - 02.2007
  • Extensively utilized Oracle SQL, PL/SQL
  • Loader for advanced data manipulation, scripting, and system management.
  • Optimized complex SQL queries for performance, resulting in faster data retrieval and improved application efficiency.
  • Created and maintained database objects including Tables, Views, Indexes, Synonyms, and Sequences to support evolving data models.
  • Defined and generated comprehensive screen formats and report designs, ensuring user-friendly interfaces and clear data representation.
  • Managed end-to-end data migration from legacy systems to new platforms using SQL Loader, ensuring data accuracy and integrity.
  • Collaborated and brainstormed with Ministry stakeholders to define and gain consensus on rigorous test and acceptance criteria.
  • Developed and executed a comprehensive test plan and schedule, including detailed test cases and scenarios with expected results, ensuring solution functionality and quality.

Education

Master of Computer Applications (MCA) -

Dr. GRD College of Science, Bharathiar University
Coimbatore
01.1996

Skills

  • Big Data Technologies: Hadoop, Hive, HBase, Sqoop, Apache Spark, Scala, Apache Airflow, On-prem HDFS
  • Database Technologies: Oracle (11g, 10g Forms/Reports, APEX Reports), PostgreSQL
  • Programming Languages: SQL, PL/SQL, Java (occasional development), Python (occasional development)
  • Tools & Methodologies: ETL, Data Migration, Performance Tuning, Query Fine-Tuning, Data Modeling, Database Design, Requirements Gathering, Project Planning, Technical Design, System Optimization, Data Quality, Agile Methodologies (implied through project work)

Timeline

Lead Data Engineer

Tata Consultancy Services
07.2022 - Current

Senior Oracle Developer/ Data Engineer

Omegacube Techsystems
12.2016 - 06.2022

Oracle PL/SQL Developer

Transworld Computers
03.2007 - 03.2015

Oracle PL/SQL Developer

Ministry of Industry and Commerce
05.2000 - 02.2007

Master of Computer Applications (MCA) -

Dr. GRD College of Science, Bharathiar University
REENA THOMAS