Summary
Overview
Work History
Education
Skills
Projects
Certification
Languages
Timeline
Generic

GOKUL N

Kozhikode

Summary

Data Scientist with hands-on experience in Python, machine learning, statistical modeling, and NLP. Skilled at transforming complex, raw datasets into actionable insights that drive strategic decision-making and operational improvements. Passionate about developing innovative AI solutions to solve real-world problems, enhance business outcomes, and accelerate digital transformation.

Overview

1
1
Certification

Work History

Data Science Intern

360DigiTMG
Bengaluru
01.2025 - 07.2025
  • Worked on two end-to-end real-world data science projects focused on forecasting, automation, and deep learning. Responsibilities included data preprocessing, model development, evaluation, and deployment using Python and Streamlit.

Education

Certificate Program - Data Science

360DigiTMG
Bengaluru, Karnataka
01.2024

Bachelor - Computer Application

Oriental School Of Hotel Management
Lakkidi, Wayanad
01.2023

Skills

  • Python and Flask
  • Data analysis and visualization
  • Machine learning and NLP
  • Database management with MySQL
  • Cloud services with AWS and S3
  • Data preprocessing

Projects

1. Optimization of bus ticketing demand and forecasting

Client: a leading bus transportation company

  • Developed machine learning models (ARIMA, Random Forest, Gradient Boosting) to forecast future bus ticket demand using historical sales data (~14,000 rows).
  • Performed EDA, missing value imputation, and outlier treatment using Pandas and NumPy.
  • Deployed the best-performing ARIMA model using Streamlit for real-time use by transportation stakeholders.
  • Achieved forecasting performance with RMSE ≈ 112.5 and MAPE ≈ 40%.

 2. Automated transcript analysis to optimize recruitment decisions

 Client: One of the leading HR-tech companies that provides end-to-end recruitment solutions for top enterprises globally. They conduct thousands of interviews weekly across different geographies and roles, leading to high manual effort in evaluating and shortlisting candidates.

  • Built a deep learning pipeline to classify interview transcripts (Fit vs. Not-Fit) using models like TextCNN, BiLSTM with Attention, and ELECTRA.
  • Engineered features like sentiment, red flags, and skill count from unstructured text data.
  • Achieved ~74.8% accuracy with TextCNN; deployed using Streamlit for real-time HR decision support.
  • Addressed challenges like label imbalance, text noise, and metadata sparsity.

Tools & Technologies Used:

Python, Pandas, NumPy, Scikit-learn, PyTorch, Hugging Face Transformers, ARIMA, Streamlit, Matplotlib, MongoDB

Certification

  • Certificate Program on Data Science, 360DigiTMG, 2024
  • Career Essentials in Data Analysis by Microsoft and LinkedIn, Nov 10, 2024

Languages

Malayalam
First Language
English
Intermediate (B1)
B1

Timeline

Data Science Intern

360DigiTMG
01.2025 - 07.2025

Certificate Program - Data Science

360DigiTMG

Bachelor - Computer Application

Oriental School Of Hotel Management
GOKUL N