
● More than 6.6 years of IT experience in Big data
● Approx. 3 years of experience as a Big Data Developer
● Experience in Data Warehousing within the environment of Spark, Scala, Hive, Sqoop, HDFS
● Basic Knowledge of Databricks
● Hands-on experience in tools like Putty, Control-M, Hue, Winscp, and GIT.
● Experience in importing and exporting data from a relational database into Hadoop cluster using sqoop.
● Experience writing Hive Queries for analyzing data in Hive warehouse using Hive Query Language (HQL).
● Extensively worked on data extraction, Transformation, and loading data from Flat Files and MYSQL using sqoop
● Involved in preparing test data for testing flows to validate and prove positive and negative cases
● Experienced in working with large datasets using Partitions, Spark in Memory capabilities, effective & efficient Joins, and optimizations during the ingestion process.
● Experience in resolving Defects, presenting the Defect Status reports, resolving requirements, observing design inconsistencies, and Preparing documentation for some of the recurring defects and resolutions and business comments for those defects.
● Coordinated with all the teams, representatives, and downstream users involved in testing for sign-off approvals.
● Implemented Test logic by writing SQL/HQL queries.
● Experience in Data Analysis, Data Validation, Data Cleansing, Data Verification, and identifying data mismatch.
CLIENT : NOVARTIS
PROJECT : ATLAS ENHANCEMENT
RESPONSIBILITIES :
Understand new enhancement\requirement requested by the client and create Spark/Scala-based code with Hive as data storage on the HDFS platform.Atlas4A program is set to interface via evico platform to provide the enriched HCP and HCO information to different consumers of Novartis. Utilize the BRR framework(Metadata Driven Tool) for the implementation of different inbound and outbound interfaces.
· Key Role in Analyzing requirements of the alignment data interface.
· Understanding the overall functionality of the project and interaction with different downstream.
· Implementing the incremental logic in the ODS layer.
· Scheduling jobs in Control M.
· Actively involved in code deployment in a higher environment.
· Providing hyper care support post-production.
· Imported the data in CSV file format into hive table and data validation
· Created multiple tables and views in the hive, updated parquet files, DDL files, and Scala code, and pushed it into Git.
· Implementation of Adhoc new requirements following SDLC.
CLIENT : NOVARTIS
PROJECT : VDW REBUILD
RESPONSIBILITIES :
The objective was to migrate the existing VDW (Vantage Data Warehouse) system to a new big data platform EVICO. ODS will house data captured in the Customer Relationship Management application (CRM) namely ‘Vantage’. End users/Sales Representatives log their call/sampling and all other activities in ‘Vantage’ and ODS (though DICE) Extract that data and store it in a Database (EVICO Platform) in Novartis Network. This data is then provisioned to various other systems within Novartis for easy access.
CLIENT : NOVARTIS
PROJECT : COGNOS MIGRATION
RESPONSIBILITIES :
The objective was to compare 150+ tables containing various drugs information in MYSQL. Data were loaded in HDFS system from RDBMS, which were pushed from DEV to QA and finally to Production. We had to validate Structure,Delimiter,Data of every file.
· Prepared test cases by understanding the business requirements, Data Mapping documents
· Tested Sqoop scripts used to transfer data between RDBMS and HDFS.
· Tested Hive QL queries with Partitioning, Dynamic Partitions and Buckets.
· Compared metadata,data,structure of each file in each environment.
· Raised Bugs in JIRA\Proton as per Defect Lifecycle Concept
· Executed various Test Cases in HP Proton and captured necessary test evidences.
CLIENT : Union Bank of Switzerland
PROJECT : UBS DataLake
RESPONSIBILITIES :
The objective was to validate data and functional logic in tables containing customer related data.
· Go through the report requirements and make sure that all clarifications are done with the respective BAs.
· Complete Test Scenario/Case writing activities during Development stage within the assigned timeline.
· Actively follow up with Leads/BAs/Developers with respect to test case review and sign off, defect closure, etc.
· Ensure complete coverage of functional and non-function requirements of reports in test cases and they are mapped to the requirements through RTM for the assigned modules
· Provide regular status updates to the Test Lead in the agreed format.
· Reporting issues in JIRA, tracking of issues and interacting with the Developers for bug fixes.
· Re-test resolved defects and bring the defects to a logical closure within the agreed timelines.
· Report work progress and any problems faced to the Test Lead.
· Provide KT to the new joiners in the testing team as well as guide them technically.
· Execute the Test Cases in HP proton and capture test evidences.