Summary
Overview
Work History
Education
Skills
Additional Information
Certification
Timeline
Generic

UTTAM KEDIA

Data Scientist
Bangalore,KA

Summary

Highly analytical and process-oriented data engineering lead and scientist focusing on data cleaning, processing, applying data mining techniques, doing statistical analysis, and building high-quality prediction systems to discover the information hidden in vast amounts of data.

Overview

13
13
years of professional experience
4
4
years of post-secondary education
2
2
Certifications

Work History

Senior Manager

CAPGEMINI SE
09.2021 - Current

Data Scientist

Client - Vangaurd : FAS
Bangalore, IN
09.2021 - 01.2022
  • Provided comprehensive analysis and recommend solutions to address complex business problems and issues using data from internal and external sources and applied advanced analytical methods to assess factors impacting growth and profitability across product and service offerings.
  • Developed quarterly roadmaps based on impact, effort, and test coordination, working with stakeholders to achieve short-term and long-term goals.
  • Evaluated processes and data, identifying productivity gains and sales growth within digitization segment.
  • Build and train production-grade ML models on large-scale datasets to solve various business use cases.
  • Partner with data engineers on data quality assessment, cleansing, and analytics
  • Leveraged mathematical techniques to develop engineering and scientific solutions.
  • Tested and validated models for accuracy of predictions in outcomes of interest.
  • Developed polished visualizations to share results of data analyses.

Technical Solution Lead Engineer

Client - DB : EAP
Bangalore, IN
01.2020 - 09.2021
  • Design and implement product features in collaboration with business and technology stakeholders
  • Drive the implementation of new data management projects and re-structure of the current data architecture
  • Cleaned, prepared, and optimized data at scale for ingestion and consumption
  • Delivering the application module including analytics solution on AWS cloud
  • Build the data pipeline from feed area to lake area using all Hadoop technologies along with Python, PySpark, and hive/impala.
  • Helping the team to resolve the issues, deploying the solution, and supporting the go-live.
  • Built product feature lists with customers and internal stakeholders.
  • Evaluated trends to understand competitive environments and assess current strategies.
  • Mentor and develop other data engineers in adopting best practices

Assistant Consultant

TATA CONSULTANCY SERVICES
05.2008 - 08.2021

Senior Data Engineer

Rolls Royce : Data Lab
Bangalore, IN
01.2019 - 01.2020
  • Designed and built reusable components, frameworks, and libraries at scale to support analytics products
  • Drive collaborative reviews of design, code, test plans and dataset implementation performed by other data engineers in support of maintaining data engineering standards
  • Identified and solved issues concerning data management to improve data quality
  • Cleaned, prepared, and optimized data at scale for ingestion and consumption
  • Troubleshoot complex data issues and perform root cause analysis to proactively resolve product and operational issues
  • Ran statistical analyses within the software to process large datasets.

Data Engineer

Client - JPMC : AWM
Bangalore,, US
01.2013 - 01.2019
  • Project: “AWM Strategic
  • Analyzed complex data and identified anomalies, trends, and risks to provide useful insights to improve internal controls.
  • Collaborated with the team on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Employed data cleansing methods, significantly enhancing data quality.
  • Authored specifications for data processing tools and technologies.
  • Compiled, cleaned, and manipulated data for proper handling.
  • Developed polished visualizations to share results of data analyses.

ETL Developer

JP Morgan Chase
Bangalore, Delaware
01.2011 - 01.2013
  • Designed integration tools to combine data from multiple, varied data sources such as RDBMS, SQL and big data installations.
  • Designed and created ETL code installations, aiding in transitions from one data warehouse to another.
  • Collaborated with business intelligence staff at customer facilities to produce customized ETL solutions for specific goals.
  • Wrote and optimized in-application SQL statements.
  • Interpreted data models for conversion into ETL diagrams and code.
  • Designed data models for complex analysis needs.

Developer

BT Wholesale Calls
Kolkata, IN
01.2008 - 01.2011
  • Developed programs from the ground up using a measured, market-focused approach to eliminate waste and streamline the implementation cycle.
  • Collaborated with other developers to identify and alleviate software errors and inefficiencies.
  • Revised, modularized, and updated old code bases to modern development standards, reducing operating costs and improving functionality.

Education

Bachelor of Technology - Information Technology

Sikkim Manipal Institute of Technology, SMUHMTS University
06.2003 - 06.2007

Skills

    Python

Machine learning

Data analysis

Big Data technologies

AWS Cloud technologies

PySpark

Additional Information

  • Experience in working with customer-centric algorithm models and tailoring them to each customer as required and constructing ETL processes and extracting actionable insights from large databases.
  • Formulating statistical, machine learning, or mathematical solutions to test and optimize automated solutions to business problems Provide easy-to-understand visualization of complex data sets using descriptive and predictive technologies.
  • Worked on PySpark for big data processing and transformation on a high volume of data to build the solution for business analytic.
  • Good knowledge of statistics & complex mathematics including vectors, matrix operations, etc.
  • Understanding of data collection pipeline creation using Stream sets and migrating the data into Snowflake.
  • Exposure to distributed data/computing tools: Map/Reduce, Hadoop, Hive, Spark, or related Big Data technologies
  • Have exposure in Docker and Kubernetes Technologies and have deployed the Container in OpenShift Container Enterprise Platform PaaS in Fabric
  • Experience/knowledge in BI tools such as Tableau and Visualization using python libraries matplotlib, plotly, and seaborn
  • Have designed and deployed large-scale and distributed system software in a cloud environment
  • Good understanding of probability and statistics and good in interpreting and analyzing the results using statistical techniques
  • Analyze complex data elements and systems, data flow, dependencies, and relationships to contribute to conceptual physical and logical data models
  • Ave hands-on skills in sourcing, manipulating and analyzing large volumes of data including SQL and NoSQL databases
  • Passionate about generating insights from data to solve business problems
  • Flexibility to wear many hats and adapt to a fast-changing environ-ment.

Certification

AWS Certified Solutions Architect – Associate

Timeline

Senior Manager

CAPGEMINI SE
09.2021 - Current

Data Scientist

Client - Vangaurd : FAS
09.2021 - 01.2022

AWS Certified Solutions Architect – Associate

08-2021

Google Associate Cloud Engineer

03-2021

Technical Solution Lead Engineer

Client - DB : EAP
01.2020 - 09.2021

Senior Data Engineer

Rolls Royce : Data Lab
01.2019 - 01.2020

Data Engineer

Client - JPMC : AWM
01.2013 - 01.2019

ETL Developer

JP Morgan Chase
01.2011 - 01.2013

Assistant Consultant

TATA CONSULTANCY SERVICES
05.2008 - 08.2021

Developer

BT Wholesale Calls
01.2008 - 01.2011

Bachelor of Technology - Information Technology

Sikkim Manipal Institute of Technology, SMUHMTS University
06.2003 - 06.2007
UTTAM KEDIAData Scientist