Summary
Overview
Work History
Education
Skills
Additional Information
Timeline
Soft Skills
Generic

Padmashri Kokkalaki

Data Engineer
Bangalore

Summary

An enthusiastic Data Engineer having 2+ years experience in the world of Data Engineering. Responsibly gather, collect, evaluating business needs and objectives and store the data. Worked on Apache Hive, PySpark, Oozie, Informatica Intelligent Cloud Services (IICS), Oracle SQL, MS SQL, Azure Data Lake Storage. Have been working in project which follow agile and scrum processes. Looking forward to working as data scientist, data engineer, machine learning Engineer.

Overview

3
3
years of professional experience
7
7
years of post-secondary education
3
3
Languages

Work History

Data Engineer

IBM India
BANGALORE
03.2022 - Current
  • Experience in Data Warehouse teamed with Requirements gathering, Data modelling, Effort estimation, ETL Development, Unit and Integration system testing, and Implementation
  • Responsible for developing, support and maintenance for ETL (Extract, Transform and Load) processes using Informatica Intelligent Cloud Services (IICS)
  • Developed, implemented and maintained data analytics protocols, standards and documentation
  • Worked on complex mappings and workflow to meet business needs ensured they are reusable transformation to avoid duplications
  • Extensively used ETL to transfer and extract data from source (Flat files, DB2, Oracle, SQL Server) and load data into target (Flat files, Parquet file, SQL Server)
  • Responsible for Performance Tuning at Mapping Level, Session Level, Source Level and target Level for Slowly Changing Dimensions (SCD) Type1, Type2 for Data Loads. Responsible for Data quality analysis to determine cleansing requirements
  • Used ETL to extract mainly from Salesforce and provide data as parquet file
  • Developed ETL Pipeline to extract data from Reference 360 to MS SQL Server through Azure Data Lake Storage (ADLS) layers.

Data Engineer

IBM India
BANGALORE
03.2020 - 03.2022
  • Worked as Data engineer for USA based Automotive manufacturing client
  • Have experience in creating and managing data infrastructure using Oracle SQL, Apache Hive, and Spark
  • Development of dataset using Hadoop Hive, transforming, and analyzing of data which are required for Business purpose and documented same, showcasing data flow diagram
  • Provided full support to clients from initial client interaction and requirements analysis through design document, coding, solving bugs and testing
  • Performed operations and monitored jobs in YARN. Managed to handle 5 operations per day
  • Automated list of various time taking data engineering operations using PySpark script and Oozie workflow and schedules, requiring zero manual efforts
  • Interacted with client to understand requirements to produce Technical Solution architecture.

Education

Bachelor of Engineering - Electrical And Electronics

K. L. E Institute of Technology
Hubballi, Karnataka, India
06.2015 - 06.2019

12th Grade - Science

Vidyaniketan PU College
Hubballi, Karnataka, India
06.2013 - 04.2015

10th Grade (ICSE Board) -

Parivarthan Gurukul School
Hubballi, Karnataka, India
06.2012 - 03.2013

Skills

Machine learning

undefined

Additional Information

Training and Certification:

Advanced Program in Computational Data Science, IISC, March 2022 - Present

  • Sound knowledge of Machine Learning and Deep Learning fundamentals
  • Supervised Learning models known are Linear Regression, Logistic Regression, Decision Trees, Random Forest, Classification Modeling
  • Unsupervised Learning models known Clustering
  • Knowledge of machine learning tools and libraries such as TensorFlow, Scikit-learn
  • Sound knowledge of exploratory data analysis to analyze and summarize datasets.

Academic Projects:

  • Predict the loan defaulters using a Logistic Regression model on the credit risk data and calculate credit scores
  • Build a model to recognize emotion from speech using Ensemble learning
  • Perform customer segmentation for an Online Retail using an Unsupervised Clustering technique (K-Means)
  • Predict the bike-sharing counts per hour based on features including weather, day, time, humidity, wind speed, season.

Timeline

Data Engineer

IBM India
03.2022 - Current

Data Engineer

IBM India
03.2020 - 03.2022

Bachelor of Engineering - Electrical And Electronics

K. L. E Institute of Technology
06.2015 - 06.2019

12th Grade - Science

Vidyaniketan PU College
06.2013 - 04.2015

10th Grade (ICSE Board) -

Parivarthan Gurukul School
06.2012 - 03.2013

Soft Skills

  • Excellent communication skills in English (spoken and written)
  • Open-minded, independent problem-solver and highly motivated team player
  • Decision making, planning and leadership capabilities
  • Willingness to learn and adapt to new technologies and quick learning ability
  • Well trained to work with larger team sizes
Padmashri KokkalakiData Engineer