Summary
Overview
Work History
Education
Skills
Certification
Websites
Work Availability
Work Preference
Timeline
Generic
Thomas John

Thomas John

Kochi

Summary

Data Science and Big Data Analytics expert with over 15 years of programming experience and 7 years focused on scalable solutions using python, pyspark, and machine learning. Proven success in leading high-impact projects, optimizing the performance of Google Looker as a Business Intelligence tool, and enhancing financial data analysis for better decision-making. Currently pursuing a Master's in AI and Machine Learning to further strengthen my expertise and drive continued professional growth.

Overview

15
15
years of professional experience
1
1
Certification

Work History

Senior Data Scientist

Solute GmbH
Kochi
01.2022 - Current

Traffic Partner Blacklisting Project:

  • Led online shop performance enhancement project, focusing on improving traffic quality.
  • Developed system using Python and PySpark for comprehensive traffic and profit analysis.
  • Optimized budget allocation using linear programming, enabling efficient blacklist creation.
  • Established performance-based blacklisting criteria, thus improving the overall gross profit and traffic quality.

Reporting Tool Evaluation Project:

  • Led a comprehensive 3-month project to evaluate and recommend the optimal reporting tool for the organization, focusing on future integration with cloud-based solutions and enhancing the current on-premise big data infrastructure.
  • Conducted an in-depth analysis of Looker, Power BI, and the existing on-premise cube-based reporting tool.
  • Evaluated tools based on criteria such as web-based development, version control, collaboration features, self-service capabilities for business users, and performance.
  • Collaborated with cross-functional teams to gather requirements and understand current and future reporting needs.
  • Developed and executed test cases to compare performance, usability, and integration capabilities of each tool.
  • Presented findings and recommendations to senior management, highlighting the pros and cons of each solution, and aligning them with our strategic goals.

Manager - Data Science Team

Capgemini
04.2021 - 01.2022
  • Conducted financial data analysis using Python, Hive, and SQL for accurate forecasting.
  • Utilized Apache Spark for large-scale data processing, delivering timely financial insights.
  • Followed agile methodologies and managed projects with Jira, improving collaboration and efficiency.
  • Migrated data science projects from SAS to Python, enhancing flexibility and scalability.
  • Applied Dataiku DSS for efficient data engineering, including preparation and transformation.
  • Trained team members on Dataiku DSS and Apache Spark.
  • Engaged in collaborative coding and version control with GitHub.
  • Presented financial forecasts using Tableau, aiding data-driven decision-making for stakeholders.

Data Scientist

International School of Engineering (INSOFE)
01.2018 - 04.2021

Semiconductor Wafer Defect Signature Analysis:

Developed a computer vision model to analyze scanned images of semiconductor wafers, detecting defect locations and shapes (e.g., circles, lines, blobs).

AI-based approach for MRO Optimization and Similar Parts Detection:

Designed an AI-driven solution to identify and eliminate duplicate spare parts from a client's inventory of over one million items, enhancing inventory management and approval rates.

Data Scientist Intern

International School of Engineering (INSOFE)
05.2017 - 12.2017

Log analysis-based failure monitoring in Hadoop cluster

  • Built a monitoring system to monitor the resource usage of the cluster by monitoring the applications running on YARN.
  • Designed and built statistical analysis models using Apache Spark ML

Senior Software Engineer

Xoriant
09.2016 - 05.2017
  • A network controller used to improve the network traffic monitoring and analysis
  • Got excellent feedback from customers after implementing a rule based classification algorithm to automatically prioritize the ACL rules to redirect the traffic to the corresponding monitoring services.

Senior Software Engineer

L&T Technology Services
07.2014 - 09.2016
  • The Linux Foundation's OPEN-Orchestrator Project & A SDN based service aware Network Controller and management system

Software Engineer

Avaya
07.2012 - 12.2013
  • Web-based network management solution that offers configuration, provisioning and troubleshooting for a wide range of technologies
  • The system managed multiple network devices, and provided management for services across different network elements

Project Engineer

Wipro
08.2009 - 07.2012
  • Web-based software application that helps to automate the daily operations for the service provisioning of access NEs, such as DSLAMs

Education

Master of Technology - MTech - Artificial Intelligence and Machine Learning

BITS Pilani Work Integrated Learning Programme
04.2025

Bachelor of Technology - BTech - Electronics and Communication Engineering

Rajiv Gandhi Institute of Technology, Kottayam
01.2008

Skills

  • Python and PySpark
  • Data Analysis with SQL (BigQuery, Hive)
  • Data Science Platforms (DataIku DSS)
  • Business Intelligence (Google Looker)
  • Machine Learning and Deep Learning AI
  • Agile Project Management

Certification

  • Certificate Program in Big Data Analytics and Optimization, International School of Engineering (INSOFE)
  • Dataiku Core Designer, Dataiku, 04/2021, 04/2023
  • Dataiku ML Practitioner, Dataiku, 05/2021, 05/2023
  • Dataiku Advanced Designer, Dataiku, 05/2021, 05/2023
  • Build LookML Objects in Looker Skill Badge, Google
  • Applying Advanced LookML Concepts in Looker, Google
  • Analyze and Visualize Looker Data Skill Badge, Google
  • Analyzing and Visualizing Data in Looker, Google
  • Amazon Web Services Cloud Practitioner, Amazon Web Services (AWS), 06/2019, 06/2022

Work Availability

monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse

Work Preference

Work Type

Full Time

Work Location

Remote

Timeline

Senior Data Scientist

Solute GmbH
01.2022 - Current

Manager - Data Science Team

Capgemini
04.2021 - 01.2022

Data Scientist

International School of Engineering (INSOFE)
01.2018 - 04.2021

Data Scientist Intern

International School of Engineering (INSOFE)
05.2017 - 12.2017

Senior Software Engineer

Xoriant
09.2016 - 05.2017

Senior Software Engineer

L&T Technology Services
07.2014 - 09.2016

Software Engineer

Avaya
07.2012 - 12.2013

Project Engineer

Wipro
08.2009 - 07.2012

Master of Technology - MTech - Artificial Intelligence and Machine Learning

BITS Pilani Work Integrated Learning Programme

Bachelor of Technology - BTech - Electronics and Communication Engineering

Rajiv Gandhi Institute of Technology, Kottayam
Thomas John