Summary
Overview
Work History
Education
Skills
Awards
Projects
Disclaimer
Websites, Portfolios and Profiles
Timeline
Generic

Faheem Uddin Tayyab Mohammed

London

Summary

Results-driven AI/ML data scientist and business intelligence consultant with over 3 years of experience recognized for delivering enterprise-grade analytics solutions across healthcare and financial sectors.

Overview

2
2
years of professional experience

Work History

AIML Data Scientist

Astro Data Ltd
02.2025 - Current

Client: Duradiamond, Perth

Project: Duradiamond Healthcare – Patient Portal

Project Description: Duradiamond Patient Access online appointment bookings and cancellations means less time spent on hold. Ensure patients get their regular medication when they need it with online repeat prescription ordering and automated delivery to their preferred pharmacy. Patients can view test results online, saving them from having to travel to the practice for an unnecessary appointment. Availability 24-hours a day means that patients don’t have to wait for practice opening times and can access services whenever they need. Patients can securely share their medical record – an important feature if they need care abroad, out of hours, or in an emergency situation. Shared GP record history allows patients to view an audit of where, when and why someone has accessed their record using a data sharing agreement.

Website: https://www.duradiamondhealth.com

Responsibilities:

  • Led end-to-end design and implementation of predictive models to optimize patient appointment bookings, repeat prescription forecasts, and test result delivery timelines using supervised learning techniques (Random Forest, XGBoost, Logistic Regression), improving service availability by 34%.
  • Performed advanced exploratory data analysis (EDA) and statistical modeling using Python (NumPy, Pandas, Seaborn, Scikit-learn) to identify patient behavior patterns, sentiment indicators, and cancellation risks, enhancing the personalization engine across the portal.
  • Developed and deployed machine learning pipelines on Azure Databricks and Azure Data Lake to support real-time recommendation systems and anomaly detection for secure medical record access, ensuring compliance with GDPR and HIPAA protocols.
  • Built and visualized patient engagement dashboards using Power BI and Excel, integrating insights from SQL Server and cloud storage layers to present actionable KPIs to stakeholders, and deployed deep learning models (CNNs, RNNs using TensorFlow & Keras) for OCR-based medical data classification.

Environment: Python, R, SQL, Pandas, NumPy, Scikit-learn, TensorFlow, Keras, Matplotlib, Seaborn, Azure Data Lake, Azure Databricks, AWS Sagemaker, GCP, Power BI, Tableau, Excel, SQL Server, Oracle, Jupyter Notebook, Anaconda, GitHub, TensorBoard.

AIML Data Scientist

Astro Data Ltd
02.2024 - 01.2025

Client: NHS

Project: NHS AI Assistant – Dr.GPT Bot

Project Description: The Dr.GPT Bot project aimed to develop an AI-powered assistant for the NHS TaskLearn Application, capable of answering quiz-based questions using a hybrid Retrieval-Augmented Generation (RAG) approach. Data was sourced from the TaskLearn App's MongoDB (ConvertedQuizzes collection) and ingested into Azure Data Lake Storage using Azure Data Factory. A pipeline was built to automate data extraction, transformation, and loading. The model architecture utilized FAISS for semantic search and a fallback mechanism invoking a fine-tuned Google FLAN model via Hugging Face Transformers. A Streamlit-based user interface was created to allow natural language interactions with the bot. The final solution provided reliable, context-aware responses, improving accessibility and engagement with quiz data for NHS users.

Website: http://65.0.204.147/ and http://65.0.204.147/api/

Responsibilities:

  • Designed and deployed an AI-powered chatbot (Dr.GPT) using a hybrid RAG (Retrieval-Augmented Generation) architecture, integrating FAISS-based semantic search and a fine-tuned Google FLAN transformer model via Hugging Face to enable accurate quiz question answering from NHS TaskLearn data.
  • Built robust ETL pipelines using Azure Data Factory to ingest quiz content from MongoDB into Azure Data Lake Storage, automating transformation workflows, ensuring scalable ingestion, and enabling real-time data access for downstream ML processes.
  • Implemented data preprocessing, EDA, and feature engineering using Python libraries (Pandas, NumPy, Scikit-learn) to clean, normalize, and encode question-answer pairs for model readiness, including exploratory analytics to assess text distributions, sentiment patterns, and keyword density.
  • Developed, fine-tuned, and evaluated transformer-based models for natural language understanding, applying techniques such as tokenization, cosine similarity, embeddings optimization, and fallback handling with confidence thresholds to ensure context-aware, reliable bot interactions.
  • Integrated the Dr.GPT solution into a Streamlit-based front-end UI, allowing NHS staff to interact with the model via natural language queries; supported secure API deployment, versioning via GitHub, and continuous integration pipelines using Azure DevOps.

Environment: Python, R, SQL, Pandas, NumPy, Scikit-learn, FAISS, Google FLAN, Hugging Face Transformers, Azure Data Factory, Azure Data Lake Storage, MongoDB, Streamlit, GitHub, Power BI, TensorFlow, Keras, Jupyter Notebook, Anaconda, Azure DevOps.

Education

Bachelors of Engineering and technology - Computer Science

Jawaharlal Nehru Technology University Hyderabad
Hyderabad, India
08.2021

Masters of Science - Cyber Security

Northumbria university
London, United Kingdoms
01.2025

Skills

  • Programming Languages: Python, R, SQL, DAX
  • Data Science & Machine Learning: Scikit-learn, TensorFlow, Keras, Hugging Face Transformers, Pandas, NumPy, FAISS
  • Deep Learning: CNN, RNN, LSTM, GANs, FLAN, Transformers
  • Natural Language Processing (NLP): Text Preprocessing, Sentiment Analysis, Tokenization, Embeddings, RAG Architecture
  • Data Visualization: Power BI, Tableau, Excel, Matplotlib, Seaborn
  • Data Engineering: Azure Data Factory, Azure Data Lake Storage, Azure Databricks, MongoDB, SQL Server
  • BI & Reporting: Power BI (Row-Level Security, DAX Measures, Time Intelligence), Excel Pivoting
    Databases: SQL Server, Oracle, MongoDB
  • Cloud Platforms: Microsoft Azure, AWS Sagemaker, Google Cloud Platform (GCP)
  • Big Data & Processing: Apache Spark (via Databricks), Streamlit
  • Version Control & CI/CD: GitHub, Azure DevOps, JIRA
  • Tools & IDEs: Jupyter Notebook, Anaconda, TensorBoard, SSMS
  • Statistical Analysis: Hypothesis Testing, Confidence Intervals, Probability Distributions, Sampling Techniques
  • Soft Skills: Stakeholder Engagement, Team Collaboration, Mentorship, Business Communication

Awards

  • Ethical Hacking Workshop, 2020-01-28, IIIT Hyderabad
  • Cyber disease ethical hacking competition, 2020-06-16, Aakaaar IIT Bombay

Projects

  • FACIAL EMOTION RECOGNITION OF A STUDENT USING CNN, Led a team of 4 and successfully completed 'FACIAL EMOTION RECOGNITION OF A STUDENT USING CNN'. The main objective of the project is, Facial expression is one of the most important forms of non-verbal communication. Facial expressions emit the feelings of a person, and it allows judging that person by others. Some can understand facial expressions of underlying emotions to some extent, whereas many of us cannot. Facial Expression recognition (FER) system is a system to recognize expressions from a person's face. It plays an important part in today's world in fields of Education, Mental disease diagnosis, and Human Social/Physiological interaction detection. Various methods of FER exist., 2021-01-01, 2021-06-01
  • COVID ANALYSIS PREDICTION THROUGH K-MEANS CLUSTERING, Led a team of 2 and successfully completed 'COVID ANALYSIS PREDICTION THROUGH K-MEANS CLUSTERING'. The main objectives of the Project were to quantify hospital-based outcomes and deaths, including in relation to sociodemographic characteristics and comorbidities as ascertained from hospital AND general practice data. To estimate the strength of association between these outcomes and sociodemographic and health characteristics., 2020-01-01, 2020-06-01

Disclaimer

I assure that if, I have given an opportunity I will carry out my duties to the best of my ability with utmost concentration, complete dedication and entire satisfaction of company's management.

Websites, Portfolios and Profiles

linkedin.com/in/mohammed-faheem-uddintayyab-1b7b23211

Timeline

AIML Data Scientist

Astro Data Ltd
02.2025 - Current

AIML Data Scientist

Astro Data Ltd
02.2024 - 01.2025

Bachelors of Engineering and technology - Computer Science

Jawaharlal Nehru Technology University Hyderabad

Masters of Science - Cyber Security

Northumbria university
Faheem Uddin Tayyab Mohammed