Summary
Overview
Work History
Education
Skills
Certification
Languages
Personal Details:
Timeline
Generic

SATISH KARUTURI

Chennai

Summary

Dynamic software professional with over a decade of experience in the IT sector, specializing in Big Data and Analytics, with a strong foundation in Data Warehousing technologies. Expertise in the Hadoop Ecosystem, including proficiency in tools such as MapReduce, HDFS, Apache Spark, and Apache Hive, to deliver impactful data-driven solutions. Proven success in leveraging Quantexa tools for ETL processes and scenario development to effectively detect fraudulent activities within trading and trade banking contexts. Recognized for optimizing queries through partitioning and bucketing while creating robust Apache NiFi pipelines for seamless data ingestion and management across various RDBMS platforms, with a commitment to fostering team collaboration and achieving high-quality results.

Overview

10
10
years of professional experience
1
1
Certification

Work History

Big Data Engineer

Standard Chartered GBS
12.2023 - Current

Project: Trade Fraud Risk Management

Description: Trade Fraud Risk Management (TFRM) aims at reducing the Trade frauds by creating client specific Fraud & Anti-Money Laundering (AML) scenarios by leveraging big data from multiple sources.

Technologies: Quantexa, Apache Spark, Scala, Python, HDFS, Hive, Kibana and Control-M

Roles and Responsibilities:

  • Developed ETL processes using Apache Spark for efficient data transformation including extracting and parsing the data from the different internal and external sources.
  • Developed scenarios to find out the risk customers and invoices over the fraud analysis.
  • Collaborated with cross-functional teams to optimize data architecture and storage solutions.
  • Played an integral role in the design and development of ingesting MANTAS India data from Standard Chartered's private Azure data lake into on-prem.
  • Played a key role in the migration of ECDP -> Client Central, Hogan -> EBBS deliveries. Leveraged Quantexa’s ETL and scoring pipeline for solutioning. Liaised with cross functional teams in requirements analysis and documentation.
  • Played a critical role in the design and development of adding multiple new interfaces to the existing Fraud Detection framework built with Quantexa to improve the bank’s ability to identify trade frauds.
  • Orchestrated a data pipeline via On-Premise Control-M jobs
  • Conducted performance tuning and troubleshooting of big data applications for enhanced efficiency.
  • Led initiatives to ensure compliance with data governance and security standards across projects.
  • Analyzed complex datasets to derive insights that informed strategic business decisions.
  • Supported on Business as Usual (BAU) initiatives in the bank by delivering Data Quality Excellence on top of the existing framework.
  • Troubleshooted project-related issues and prioritized project activities to reduce impact.

Senior Associate

Synechron Technologies Pvt Ltd
09.2021 - 12.2023

Project: Trade Fraud Risk Management

Client: Standard Chartered GBS

Description: Trade Fraud Risk Management (TFRM) aims at reducing the Trade frauds by creating client specific Fraud & Anti-Money Laundering (AML) scenarios by leveraging big data from multiple sources.

Technologies: Quantexa, Apache Spark, Scala, Python, HDFS, Hive, Kibana and Control-M

Roles and Responsibilities:

  • Developed ETL processes using Apache Spark for efficient data transformation including extracting and parsing the data from the different internal and external sources.
  • Developed scenarios to find out the risk customers and invoices over the fraud analysis.
  • Collaborated with cross-functional teams to optimize data architecture and storage solutions.
  • Played an integral role in the design and development of ingesting MANTAS India data from Standard Chartered's private Azure data lake into on-prem.
  • Played a key role in the migration of ECDP -> Client Central, Hogan -> EBBS deliveries. Leveraged Quantexa’s ETL and scoring pipeline for solutioning. Liaised with cross functional teams in requirements analysis and documentation.
  • Played a critical role in the design and development of adding multiple new interfaces to the existing Fraud Detection framework built with Quantexa to improve the bank’s ability to identify trade frauds.
  • Orchestrated a data pipeline via On-Premise Control-M jobs
  • Conducted performance tuning and troubleshooting of big data applications for enhanced efficiency.
  • Led initiatives to ensure compliance with data governance and security standards across projects.
  • Analyzed complex datasets to derive insights that informed strategic business decisions.
  • Supported on Business as Usual (BAU) initiatives in the bank by delivering Data Quality Excellence on top of the existing framework.
  • Troubleshooted project-related issues and prioritized project activities to reduce impact.

Senior Data Engineer

innData Analytics Pvt Ltd.
04.2019 - 06.2021

Project: Digital Transformation & Data Engineering Initiative

Client: Myanma Posts and Telecommunications

Description: MPT is the first and leading telecommunications company in Myanmar. Providing both fixed and mobile telecommunication services to people and enterprises of Myanmar. MPT BI-IOS is collecting data from different sources and transform them as per the business logic and storing into Hadoop File System. From Hadoop storage layer, generating reports, down-streams and up-streams by analyzing and computing data.

Technologies: HDFS, YARN, Apache Spark, Apache NiFi, Apache Hive, Ranger, Superset, Presto, Apache Kafka, Sqoop, PostgreSQL and Git Lab.

Roles and Responsibilities:

  • Designed and implemented scalable data pipelines to enhance data accessibility and reliability.
  • Collaborating with business analysts and technical leads to analyze business requirements and technical specs.
  • Led cross-functional teams in developing data models that improved analytics capabilities.
  • Designed Apache NiFi pipelines to process data from different sources and storing into Hadoop File System.
  • Optimized ETL processes, reducing data processing time and improving system performance.
  • Designed Data Pipelines using Apache Spark to consume the data from Apache Kafka topics and transform.
  • Involved in writing Spark applications to generate Call Data Record related KPI’s.
  • Involved in creating Sqoop jobs to import data from green plum (GP) to Hadoop File System.
  • Communicating with internal/external clients to discuss specific requirements and expectations, managing client expectations as an indicator of quality.

Data Engineer

InnData Analytics Pvt Ltd.
05.2018 - 03.2019

Project: Healthcare Data Analytics

Client: Syncrasy Labs

Description: Syncrasy Labs is a Health Care based company which collected the data from all the nearby hospitals, patients and doctors related data across US. The purpose of this project is providing the physician details to the user based on their search queries and their previous health transactions and nearby locations.

Technologies: HDFS, Apache Hive, Apache Spark, StreamSets, Apache Ranger

Roles and Responsibilities:

  • Collaborated with cross-functional teams to define data requirements and ensure alignment with business objectives.
  • Designed StreamSets Data pipelines for efficient data transformation including extracting and parsing the data from the RDBMS sources.
  • Involved in creating Hive Tables and Views to store the processed data.
  • Involved in creating Ranger policies and applying row level filtering and masking on Hive data.
  • Involved in maintaining log messages.

Data Engineer

InnData Analytics Pvt Ltd.
10.2015 - 04.2018

Client: CheckBac Inc.

Description: CheckBac is a low-cost alcohol monitoring product, where it produces the user data in daily basis in the form of JSON and this data will land in HDFS initially. Once the data is available in HDFS we load into Hive tables.

Technologies: HDFS, Apache Hive, HBase, Ranger, Java, MySQL, Tomcat, Eclipse, Maven

Roles and Responsibilities:

  • Developed interface for WEB pages like user registration, login, registered access control for users depending on logins using HTML, JSP, and Java Script.
  • Collaborating with business analysts and technical leads to analyze business requirements and technical specs.
  • Involved in creating Hive tables and views.
  • Involved in writing CRON jobs to auto remove/move files from HDFS.
  • Involved in creating Ranger policies to restrict the user permissions.

Education

Master of Computer Applications (MCA) - Computer Applications Development

Koushik College of Engineering
Visakhapatnam, India
06-2013

B.Sc. - Computer Science

Sasi Degree College
Velivennu
06-2010

Intermediate - Intermediate

Vidya Sagara Junior College
Tadepalligudem, India
05-2007

Skills

  • Big Data Tools: Hadoop, Map Reduce, HDFS, Apache Spark, Apache Hive, Apache NiFi, Apache Sqoop, Apache Ranger and Apache Superset
  • Programming Languages: Java, Scala & Python
  • Frameworks: Quantexa
  • Scripting Languages: Bash Scripting
  • Platform: Windows, Ubuntu
  • Databases: MySQL, PostgreSQL
  • Data warehousing & modeling
  • Cloud Technologies: Azure
  • Web Services: REST
  • Agile methodology
  • Git version control

Certification

  • Quantexa user Foundations Program by Quantexa Academy
  • Databricks Fundamentals by Academy Accreditation
  • Generative AI Fundamentals by Academy Accreditation

Languages

English
Advanced (C1)
Telugu
Bilingual or Proficient (C2)
Tamil
Elementary (A2)

Personal Details:

  • Date of Birth: 02-07-1990
  • Nationality: Indian
  • Marital Status: Married

Timeline

Big Data Engineer

Standard Chartered GBS
12.2023 - Current

Senior Associate

Synechron Technologies Pvt Ltd
09.2021 - 12.2023

Senior Data Engineer

innData Analytics Pvt Ltd.
04.2019 - 06.2021

Data Engineer

InnData Analytics Pvt Ltd.
05.2018 - 03.2019

Data Engineer

InnData Analytics Pvt Ltd.
10.2015 - 04.2018

Master of Computer Applications (MCA) - Computer Applications Development

Koushik College of Engineering

B.Sc. - Computer Science

Sasi Degree College

Intermediate - Intermediate

Vidya Sagara Junior College
SATISH KARUTURI