I have 13.6 years of experience in IT industries, I have dedicated 6.6 years to hands-on work in Big Data analytics and ETL development projects. My expertise lies in data engineering, where I have extensively worked with data processing frameworks such as Apache Spark, Hadoop, Hive, and SQL. I have successfully migrated data to Big Data environments, handling both big and small data files efficiently. My role involved building and maintaining data pipelines to collect, transform, and load data from various sources. I have a strong understanding of data modeling techniques and have optimized data for analysis through ETL processes. Additionally, I have experience in ensuring seamless integration of data from different sources, focusing on data quality, consistency, and reliability.
Overview
14
14
years of professional experience
1
1
Certification
Work History
Specialist in Data Engineering
LTIMindtree Pvt Ltd
Chennai
10.2020 - 08.2024
Scala (with a focus on the functional programming paradigm)
JUnit, Mockito, Scala test (Embedded Cassandra)
Research data acquisition opportunities and new uses for existing data
Imported a large set of data into Hive tables using Sqoop fromvarious sources such as Oracle, DB2, Teradata
Managing and structuring data
Developing data modeling, mining, and productionprocesses Involved in creating Hive tables and loading and analyzing data using hive queries developed
Ensure data securityand compliance with required regulations Java code using both Data frames/SQL and RDD/MapReduce in Spark for Data Aggregation, queries and writing data back into HDFS
Fixed the data issues onthe Hive tables and solved performance issues in Hive
Ensuring they meet business needs and industry standards
They create data management systems to integrate, centralize, protect, and maintain the data sources Optimizing existing algorithms in Hadoop using Spark Context, Spark-SQL, Data Frames and Pair RDD's
Implementation knowledge like how to create user stories, how to link with an EPIC, what is sprint, project release etc
Team Lead
Syntel International Private Ltd
02.2017 - 08.2020
FedEx Global Sales Reports The scope of testing is based on onsite/offshore process model
According to this model the analysis, design of performance test, the script development and review and Performance test execution will be performed at offshore
Knowledge transition and script review, execution and Data requirement will be carried out at onsite
Performance tests are based on the assumption that the entire applications are stable and ready for PT
The scope of testing is Sales IT
Performance testing includes Load testing, Endurance testing to evaluate the stability, reliability and availability of the application
Roles & Responsibilities: Gathering requirements and ensuring the required functionalities are delivered as per the requirements
Implemented Spark jobs using Pyspark
Involvedin PROD deployment activities
Involved in Hive data analysis
Resolved application vulnerabilities occurred in PROD black duck scan report
??
Resolved code smells occurred in sonar scan report
Identifying and resolving performance bottlenecks in data processing and storage systems to ensure optimal performance and efficiency Design ETL pipelines to transform raw data into a format suitable for analysis Involves data cleansing, aggregation, and enrichment, ensuring the data is usable for data scientists and analysts Leading a team of data engineers, providing technical guidance, mentoring, and fostering a collaborative and innovative work environment Knowledge of software development lifecycle; preferably with understanding of Agile Kanban/Scrum Involved in preparing Performance test strategy and Load Test Execution Plan document.Technology / Tools Used: Pyspark, Spark, Hive.
Software Test Engineer
Hewlett Packard Enterprise
09.2016 - 02.2017
Eprime & Elite This is a web based application for online shopping of HP products
EPrime is a business-to-business e-commerce solution targeted to meet the needs of Corporate, Enterprise, and Global Accounts on Global
HP.com Business to Business is the easiest, most efficient way to manage your business relationship with HP worldwide
Adapted to each customer's unique needs, HP.com Business to Business provides complete capabilities and access to key information and e-commerce tools required by enterprise customers
A secure, customized HP.com B2B website provides single sign-in access from virtually any location in the world
Recognized at your log-on, your customer profile and role within your organization determines your content and online experience
An intuitive user interface provides instant access to the tools you need and use most often, such as specific product lifecycle information and order status
Roles & Responsibilities: Understanding and analysis the requirements Taking the overall responsibility for the project deliverables
Implemented BQ scripts to validate data after exporting from HDFS to GCP Fixed the data issues on the Hive tables and solved performance issues in Hive Optimizing of existing algorithms in Hadoop using Spark Context, Spark-SQL, Data Frames and Pair RDD's Involved in preparing Performance test strategy and Load Test Execution Plan document
Having good experience in Work Load modeling, analyzing and preparing performance test reports for load test results Technology / Tools Used: Windows, Java, Oracle, Load Runner, Performance Center, Newrelic
QA Engineer
DynamicBiz Software Solution Pvt. Ltd.
01.2013 - 07.2016
Caliber Information Inc Caliber Information Systems offers maintenance and repair (M&R) cost control and compliance solutions to the Transportation Industry
Caliber focuses on the Intermodal Sector, specializing in inspection and equipment readiness services
We combine our industry M&R experience with our applied technologies
Our core strength is using our Intermodal experience to develop services and applied technologies that meet our customers cost control goals, while ensuring compliance with corporate and governmental regulations
Calibers value is the skill of our field auditors and management team
Our team brings considerable experience and new technologies to our service offerings, which have benefited our clients
Roles & Responsibilities: Analyzed Business Requirements and clarified the business needs from the Business users
Supported for workload profile efforts
Created Scenarios based on load model to simulate the production load
Enhanced the Load Runner Scripts by inserting checkpoints, rendezvous points, parameterization, correlation, and error handling
Build the Scenarios in Load Runner Controller and add the performance counters to monitor the App, and Database servers
Executed the Scenarios and Captured the Test Results by Analyzing Available Graphs from the Load Runner Controller
Developed Load Runner scripts in VuGen to perform load and stresstesting to find out the Application behavior under load
Developed the performance monitoring scripts of all the servers involved in testing
Attended daily Scrum and weekly status meetings for the project status
Technology / Tools Used: Windows, .NET frame work 4.5, Web services, SQL Server 2008, Loadrunner 11.52.
Test Engineer
Openwave Computing Services
12.2010 - 12.2012
Analyzing the requirement specification documents, functional specification documents and creating test plan and prepare the test metrics accordingly
Designing manual creating test cases/scenarios as per requirement & Functional Specification Documents
Establishing the test environment and configuring testsystems
Performing system and integration testing as per the testcases
Updating Quality Center for Bug log and preparing Bug report documentation
Appraising the performance of the application by using various testing methods such as system & integration, functional, usability, regression, retesting & back-end (database) testing Escalating the bugs/issues to concerned team or higher level
Preparing ?Weekly Status Report
And communicating with onsite team
Participating in weekly status calls with onsite team
Co-coordinating with the development team and onsite team for providing inputs on issues or Bugs.