Work Preference

Summary

Overview

Work History

Education

Skills

Languages

Personal Information

Websites

Certification

Timeline

Open To Work

Siddharth Shankar Saraf

Pune,Maharashtra

Work Preference

Job Search Status

Open to work

Desired start date: Flexible

Desired Job Title

Responsible AI SpecialistData ScientistSpecialist – Data EngineeringML/AI Engineer

Work Type

Full TimeConsultingGig Work

Location Preference

RemoteOn-SiteHybrid

Location: Pune, MaharashtraBengaluru, KAHyderabad, TG

Open to relocation: Yes

Salary Range

₹199000/yr - ₹200000/yr

Important To Me

Career advancementWork-life balanceHealthcare benefitsWork from home option

Summary

Dynamic Responsible AI Specialist at Accenture with expertise in Python and PySpark, driving compliance and innovation in AI systems. Achieved 85% accuracy in false positive identification, enhancing operational efficiency. Proven ability to collaborate effectively with cross-functional teams, ensuring adherence to responsible AI principles while delivering impactful risk assessment reports.

Overview

14

14

years of professional experience

8

8

Certifications

Work History

Responsible AI Specialist

Accenture

Pune

10.2024 - Current

Designed and developed LLM-based applications with integrated responsible AI safeguards throughout development lifecycle.

Reviewed over 50 technical documents and AI use cases, categorizing them per EU AI Act risk levels.
Developed automated NLP analysis pipeline utilizing sentence transformers to align technical specifications with responsible AI principles.
Identified 15 high-risk use cases needing compliance enhancements for proactive risk mitigation.
Collaborated with legal and compliance teams to validate risk assessments and ensure regulatory adherence.
Delivered more than 20 detailed risk assessment reports, informing AI project approval decisions exceeding $2M.
Established comprehensive risk severity scoring methodology incorporating technical, ethical, and regulatory dimensions.
Presented complex technical risks as actionable insights to C-suite executives and board members.

Data Scientist

UBS Business Solutions

Pune

11.2021 - Current

Responsibilities & Key Contributions

Delivered effective Big data and machine learning solutions and solve complex technical problems.
Designed and developed Machine Learning/Deep Learning applications for predictive modelling, synthetic data generation, sentiment analysis and trend analysis and deployed in Azure.
Designing & developing ML applications for one of the largest Multinational bank for- Anti Money Laundering project.
Finding pattern and trends in data sources and sharing the outlook with seniors and decision makers.
Used Spark, Databricks Azure to write ML application that leverage big data processing frameworks.
Find client/user sentiment from the emails they send for products and services offered.
Used machine Learning Algorithms such as Random Forest, SVM, Regression, XGBoost to predict carbon emission of application servers.
Involved in different stages of ML pipeline like Data cleaning, EDA, Feature selection, Model creation and Hyperparameter tuning and Model deployment.
Used Deep Learning algorithms like ANN, CNN and RNN and frameworks like Tensorflow, ScikitLearn to implement Supervised, Unsupervised and Reinforcement Learning.
Achieved ML accuracy with 92% for one of projects by experimenting with different algorithms.
Developing, designing ML application for my projects as well as other project as contribution.
Driving the interaction and partnership between the managers, developers and other stakeholders to ensure active cooperation in identifying as well as defining analytical needs and data needs.
Automating work flows via Shell Scripting, Apache NiFi and Airflow.
Monitoring Model deployed and involved in Data Engineering.
Updated existing ML application to have more metrics and 18% performance increase.
Deployed model in Google cloud platform built with Tensorflow 2.1.

Specialist – Data Engineering

Larsen & Toubro Infotech

Bengaluru

07.2020 - 11.2021

Responsibilities & Key Contributions

Designed and developed ETL production pipelines using Hive, PySpark to optimize the extraction of data from raw and disparate sources to clean, integrated analytic datasets used for predictive modeling.
Designing & developing Big data application for one of the largest Multinational bank for- Anti Money Laundering project.
Involved in data ingestion from various source systems and processing in HDFS using PySpark.
Used Spark SQL and Apache Hive to analyze, store data.
Build ETL pipeline for data to be used for predictive modelling.
Used machine Learning Algorithms such as Decision Trees, Random Forest, SVM, Regression, K-Means to classify and predict events.
Involved in different stages of ML pipeline like Data cleaning, EDA, Feature selection, Model creation and Hyperparameter tuning and Model deployment.
Used Deep Learning algorithms like ANN, CNN and RNN and frameworks like Tensorflow, ScikitLearn to implement Supervised, Unsupervised and Reinforcement Learning.
Creating Runbooks for application process flow.
Developing, designing ML application.
Communicating with different interfacing teams to create data ingestion pipeline.
Automating work flows via Shell Scripting, CA Autosys and Apache NiFi.
Monitoring Model deployed and involved in Data Engineering.
Built different models for regression, classification and clustering and tested them to highest accuracy.
Deployed model in AWS, used Transfer Learning for applications which had similar problem statement and selected from model zoo.

Solution Delivery Lead

Deloitte Touche Tohmatsu Limited (Deloitte US-India)

Bengaluru

08.2019 - 06.2020

Responsibilities & Key Contributions

Designing product which is used by hundreds of different Life Science and Healthcare clients using PySpark, Linux, Machine Learning and Python.
Developed a machine learning modelling framework in python which showed 20% improvement in sales per customer with an incremental revenue of ~5M.
Leveraged rule-based extraction learning models to predict likelihood of a customer making a purchase.
Developing applications and enhancement for proprietary product.
Migrating existing python (pandas) code into PySpark.
Processed TBs of data using Spark SQL and Apache Hive.
Build ETL pipeline from data to be used for predictive modelling.
Used machine Learning Algorithms such as Decision Trees, Random Forest, SVM, Regression, K-Means to classify and predict events.
Application development with automation and scheduling.
Developed application in pyspark, pandas, hive.
Reporting mechanism of all working scripts and interface connections e.g. Hadoop (Hive)-Tableau interface.
Involved in building machine learning models, testing them and deploying them in production.
Integrating with cloud services like AWS, GCP and Azure.

Data Engineer

Accenture

Mumbai

09.2018 - 08.2019

Responsibilities & Key Contributions

Designing and developing applications which can handle high volume data processing and can integrate with other technologies e.g. Talend, SAP and SQL.
Developing Big data application for FMCG client.
Implementing Big data ingestion mechanism with Apache Kafka and scheduling with NiFi.
Processed TBs of data using Spark SQL and Spark Streaming.
Used Scala language to write Spark application with RDDs, Spark SQL and Spark Streaming concepts.
Used Spark Streaming to stream data from external sources using Kafka service.
Used machine Learning Algorithms such as Decision Trees, Random Forest, SVM, Regression, K-Means to classify and predict events.
Application development with automation and scheduling.
Reporting mechanism of all working scripts and interface connections e.g. Hadoop (Hive)-Tableau interface.
Hortonworks security implementation with Kerberos authentication.

Application Development Senior Analyst

Accenture

Mumbai

11.2017 - 09.2018

Responsibilities & Key Contributions

Developing Distributed application which can integrate with other outbound system and can drive insight from the source Data.
Developing Big data application for Health Care and Retail clients.
Implementing Big data ingestion mechanism with Sqoop and scheduling with Oozie and Control-M.
Processed TBs of data using Spark and used Optimization Techniques/Tuning like Kryo serialization and Memory Tuning, etc.
Used PySpark to write distributed code accessing Spark engine to access data from various source systems and of multiple file types e.g. JSON, TEXT, CSV, PARQUET and process and store in my project.
Used Spark Streaming to stream data from external sources using Kafka service.
Used machine Learning Algorithms such as Decision Trees, SVM, Regression, K-Means to classify and predict events.
Application development with automation and scheduling.
Reporting mechanism of all working scripts and interface connections e.g. Hadoop-R interface and SAP-HDFS interface.
Apache Sentry security implementation.

Software Engineer

Tech Mahindra Ltd

Pune

04.2012 - 09.2017

Responsibilities & Key Contributions

Developing Big Data application for UK Telecom giant and analyzing Network data to predict speed of cables based on loss and other parameters.
Extracting data from Source systems: RDBMS (Oracle Database) and HIVE (HDFS FileSystem).
Implementing Big data ingestion mechanism with Sqoop and populating data in my project HDFS.
Processing TBs of data using Spark and Hive.
Scheduled various types of jobs with interdependencies using Oozie.
Used machine Learning Algorithms such as Decision Tree, SVM, Regression, K-Means to classify and predict events.
Converting Hive based applications to Spark Framework.
Developed and designed Big data applications to overcome challenge of analyzing large amount of Network data.
Involved in automating manual processes with Shell scripts and Scheduled them with multiple applications to gain performance improvement both in terms of Time and Cost.

Education

B-Tech - Electronics and Telecommunication Engineering

Biju Pattnaik University of Technology

Bhubaneswar, Odisha

10.2011

Skills

Python and PySpark
Large language models
Prompt engineering
Retrieval-augmented generation
Model fine-tuning
Evaluation and testing
Linux scripting
Hadoop ecosystem
Apache Spark
Scikit-learn and PyTorch
Deep neural networks
Recurrent neural networks

Sqoop and Kafka
Hive data warehousing
Pandas and Keras
Oozie workflow management
NiFi data flow management
MySQL database management
Red Hat Linux and Ubuntu
Windows and Mac OS environments
Git version control
Docker containerization
Cloud platforms: GCP and Azure

Languages

Fluent

Personal Information

Hobbies: I love reading books especially Technical journals and magazines., I write poems, stories and Blogs(Technical)., I am an ever learner and learn latest technologies and tools., I learn Advanced Physics and Mathematics in my spare time., I participated in two Hackathons: Data processing Hackathon and AI Hackathon, from which Won the competition for Data processing Hackathon., My publications: https://siddharthsaraf.medium.com/

Websites

https://siddharthsaraf.medium.com/

Certification

Microsoft Certified: Azure Fundamentals
Microsoft Certified: Azure Data Fundamentals
Quantennium: Machine Learning in Finance
Microsoft Certified: Azure AI Fundamentals
AWS: Generative AI with Large Language Models
Google: Cloud Digital Leader Certification
Google: Deploy an Agent with Agent Development Kit (ADK)
Accenture: Reinvention with Agentic AI

Timeline

Responsible AI Specialist

Accenture

10.2024 - Current

Data Scientist

UBS Business Solutions

11.2021 - Current

Specialist – Data Engineering

Larsen & Toubro Infotech

07.2020 - 11.2021

Solution Delivery Lead

Deloitte Touche Tohmatsu Limited (Deloitte US-India)

08.2019 - 06.2020

Data Engineer

Accenture

09.2018 - 08.2019

Application Development Senior Analyst

Accenture

11.2017 - 09.2018

Software Engineer

Tech Mahindra Ltd

04.2012 - 09.2017

B-Tech - Electronics and Telecommunication Engineering

Biju Pattnaik University of Technology

Similar Profiles

Akash JajuAkash Jaju
Responsible AI Specialist at Accenture Solution Pvt LtdResponsible AI Specialist at Accenture Solution Pvt Ltd
Avinash YerramilliAvinash Yerramilli
Senior Data Engineer at SkidmoreSenior Data Engineer at Skidmore
Yashas Besanahalli VasudevaYashas Besanahalli Vasudeva
Data Engineer and AI Specialist at Luminy DigitalData Engineer and AI Specialist at Luminy Digital
Naveen Maranayakanahalli BeluraiahNaveen Maranayakanahalli Beluraiah
Specialist, AI Data Engineer at DefinitySpecialist, AI Data Engineer at Definity