Summary
Overview
Work History
Education
Skills
Timeline
Personal projects
Githublink
Generic

Nakul Soni

New Delhi

Summary

Detail-oriented Data Engineer designs, develops and maintains highly scalable, secure and reliable data structures. Accustomed to working closely with system architects, software architects and design analysts to understand business or industry requirements to develop comprehensive data models. Proficient at developing database architectural strategies at the modeling, design and implementation stages.

Overview

5
5
years of professional experience

Work History

Data Engineer

Australian Red Cross Society
2022.03 - 2024.03
  • Created pipelines, and data flow in Azure data factory to connect data source to Azure storage container
  • Created Azure synapse link to move data from power apps to Azure blob storage
  • Create triggers in Azure data factory to run pipelines
  • Set up alerts in ADF to ensure failure of pipeline
  • Developed code in Pyspark for data transformation, data cleansing, and ETL process
  • Perform source system data analysis to manage source-to-target data mapping.

Data Science Intern

Truuth
2021.06 - 2021.11
  • Implement Machine learning technique that will detect text from image document
  • Apply Bounding box technique to draw box on features
  • Apply RBG method in Python to check whether black-and-white document is original
  • Extract features and match them with source feature
  • Optimize current image falsification process and make current process more efficient in detecting false image
  • Improve model accuracy and make it more efficient in detecting text information from image documents
  • Automate process of document collection and storage
  • Define and test approach to handle classification errors.

Data Analyst

R1RCM Pvt limited
2019.06 - 2020.02
  • Automated moving of Data from FTP and loading them into the database using SSIS and scheduled it as a daily job
  • Created an SSIS package for loading the data and used multiple transformations in SSIS to extract data from various sources
  • Move data from the old server to the new server using SSIS Packages in Business Intelligence Development Studio
  • Build reports in Power BI to change the data format or add new reports in the existing dashboard of the Power BI desktop
  • Connect Power BI to SQL server to fetch user insurance information and to check for which treatment users are insured
  • Create Power BI reports to convert medical codes into explanations and then combine them with other data to generate user-specific reports for the hospitals
  • Design, create, automate, and maintain sophisticated programs that extract, convert, and load data.
  • Utilized data visualization tools to effectively communicate business insights.

SQL Developer

Conduent Business service LLP
2018.08 - 2019.06

Created SQL Server objects such as tables, views, stored procedures, functions, triggers along with other SQL developers

  • Created and Modified tables, views, Indexes, User Defined Functions, Stored Procedures, cursors in SQL Server 2016
  • Designed Logical and Physical models of Relational Database
  • Created indexes and views, stored procedures to enhance database/application performance
  • Created stored procedures to extract data and stored in the data warehouse
  • Involved in Performance Optimization of Queries & Stored Procedures by exploring Query Plans, blocking queries, Identifying missing indexes etc
  • Monitored query using query optimizer and tuned queries and procedures to boost database performance.

Business Technology Analyst

Technodata analytics services Pvt limited
2017.02 - 2018.08
  • Designed and developed (ETL) packages to extract, transform, and load data from CSV files or FTP systems to the data warehouse
  • Build SSIS packages to match users’ information and then extract their primary and secondary information from the Database
  • Match the user’s information with the Melissa data component in SSIS, to check the authentication of the users
  • Create match rules in SSIS to match the user’s information with the database information
  • Proposed solutions to improve efficiency and reduce expenses
  • Applied transformation rules to change the data types of the fields
  • Perform the ETL process to produce clean and transformed data for the stakeholders.

Education

Master of Science - Data Science

Macquarie University
Sydney, NSW
01.2022

Bachelor of Technology - Information Technology

Guru Gobind Singh Indraprastha University
Delhi, Delhi
06.2016

Skills

  • TSQL
  • Python
  • Py-spark
  • Machine learning
  • Deep learning model
  • Database creation and maintenance
  • JIRA
  • GitHub
  • Data warehousing
  • ETL Development
  • Data analytic
  • Data Architect
  • Data Modelling
  • Azure Databricks
  • SQL Server
  • Azure Data Factory

Timeline

Data Engineer

Australian Red Cross Society
2022.03 - 2024.03

Data Science Intern

Truuth
2021.06 - 2021.11

Data Analyst

R1RCM Pvt limited
2019.06 - 2020.02

SQL Developer

Conduent Business service LLP
2018.08 - 2019.06

Business Technology Analyst

Technodata analytics services Pvt limited
2017.02 - 2018.08

Master of Science - Data Science

Macquarie University

Bachelor of Technology - Information Technology

Guru Gobind Singh Indraprastha University

Personal projects

  • Develop a web application in Javascript.
  • Implement a machine learning model that will predict the Genre of the book using the summary. Use Label Encoder that will convert the text genre into features and then apply TF-IDF vectorization to compute the word count in the summary and use Logistic regression with OnevsRestClassifier to predict the genre.
  • Analyse global coronavirus data track the number of cases in each country and examine countries' situation based on the data.
  • Experience in gathering and uploading data On Apache Kafka and using that data for report creation.
  • Construct a report in Imply to display the number of children studying in different NSW Schools.
  • Build a project to detect the Botnet detection using a machine-learning algorithm. Applied machine learning models like Decision Trees, KNN Classifier, and Neural Networks found the accuracy in all the models to determine the most suitable model for botnet detection based on accuracy.
  • Implement facial recognition that will detect the emotion through the image. Done feature extraction through principal component analysis applied CNN model and applied Support vector machine algorithm and find out the accuracy and R2 value.
  • Analyzed the Twitter data and did NLTK on text data to extract the keywords and then applied Named Entity recognition to find the named entity of the keywords.
  • Experience in Writing the Map Reduce function in Python for data analysis.

Githublink

https://github.com/NSoni25

Nakul Soni