Muhammad Danial Azmi

Data Scientist & AI Engineer

Building end-to-end intelligent systems across the full data stack—from infrastructure and pipelines to machine learning models and GenAI applications. I transform complex data challenges into scalable, production-ready solutions that drive business impact.

01 — ABOUT

Experience & Background

With 4+ years of experience spanning fintech, media, and education technology, I specialize in architecting and deploying AI/ML solutions that solve real-world problems. My approach combines full-stack data capabilities with cutting-edge GenAI to build systems that are intelligent, scalable, and business-aligned.

I excel across the entire data lifecycle—designing robust data platforms and pipelines, generating data insights, developing machine learning models (from traditional ML to LLMs), implementing MLOps best practices, and creating GenAI-powered applications. Whether it's building recommendation systems, RAG-based chatbots, or internal AI agents for analytics, I focus on delivering production-grade solutions with proper monitoring, CI/CD, and cloud infrastructure.

Currently at Tupai AI, I own the complete data stack for an education tech platform, where I've built everything from data infrastructure to an internal AI agent that democratizes data access across the entire company.

4+
Years Experience
10+
Projects Completed
4
Industries Served

Education

Bachelor of Computer Science (Hons) with Specialization in Data Science
Multimedia University, Malaysia (2018–2021)

02 — PROJECTS

Featured Work

Tupai AI 2025–Present

Internal Data Analytics AI Agent

Company-wide AI agent enabling non-technical teams to query and analyze data through natural language. Handles complex analytics requests, generates insights, creates dashboards, and democratizes data access across the organization.

Python LLM Docker Analytics
Tupai AI 2025–Present

End-to-End Data Platform & ML Infrastructure

Architected complete data infrastructure for education tech platform including data pipelines, ML model deployment, recommendation systems for personalized mathematics learning, and analytics dashboards.

Python MLOps Data Engineering GCP Analytics
Tupai AI 2025–Present

Adaptive Learning Recommendation System

Personalized mathematics learning system combining Bayesian Knowledge Tracing with traditional ML to model student knowledge states. Identifies weak points in real-time and generates tailored learning plans, optimizing educational pathways based on individual performance patterns.

Python Machine Learning Recommendation Systems Analytics
REV Media Group 2023–2025

Multi-Channel RAG Chatbot System

End-to-end multi-turn question-answering chatbot with Retrieval-Augmented Generation. Deployed on GCP using FastAPI and Haystack, integrated across multiple channels including WhatsApp for enhanced customer engagement.

Python LLM GCP Docker
REV Media Group 2023–2025

Hyper-Personalization Recommendation Engine

Geolocational hyper-personalization system that drove substantial increases in organic user engagement across multiple products using neural networks and location-based targeting.

Python Machine Learning GCP Recommendation Systems
REV Media Group 2023–2025

MLOps Pipeline & Cloud Migration

Migrated on-premise ML models to GCP Vertex AI with enhanced monitoring, observability, and CI/CD integration using Kubeflow and Docker for improved scalability and automation.

MLOps Docker GCP Kubernetes
DataMicron Systems 2023

Client-Facing AI/ML Solutions

Led research and POC development for multiple client projects, identifying ML use cases and translating business problems into AI solutions. Managed end-to-end project lifecycles from consultation through deployment.

Python Machine Learning Analytics
MoneyLion 2022

ETL Pipeline Cloud Migration

Led migration of data team's ETL pipelines and dbt SQL transformations from AWS Redshift to Snowflake. Migrated workflows in Airflow to optimize for Snowflake's architecture, ensuring zero downtime and improved query performance.

Data Engineering AWS Snowflake
MoneyLion 2022

NLU Model Optimization via A/B Testing

Designed and executed multiple A/B testing experiments to enhance Natural Language Understanding models for customer support. Analyzed unstructured text data to identify pain points and optimize model performance, resulting in reduced operational costs.

Python NLP Machine Learning Analytics

Discord Tutor Chatbot

AI chatbot on Discord that provides personalized learning experiences with LaTeX rendering for mathematical calculations using Haystack and the Discord API.

Python NLP

E-commerce CNN Classifier

Fine-tuned ResNet50 model on 1M+ images for e-commerce product classification, automatically categorizing products up to 3 levels deep using transfer learning.

Python TensorFlow Machine Learning

Steam Review Sentiment Analysis

NLP project mining millions of Steam game reviews, performing data validation and preprocessing, then applying neural networks and traditional ML for aspect-based sentiment analysis.

Python NLP Machine Learning
03 — SKILLS

Technical Expertise

Languages

Python SQL JavaScript Bash

AI/ML & GenAI

LLMs GenAI RAG TensorFlow PyTorch Scikit-learn Hugging Face NLP Neural Networks Recommendation Systems

Data Engineering

Airflow dbt Snowflake Data Pipelines ETL/ELT Pandas Vector Databases

MLOps & DevOps

Docker Kubernetes Kubeflow CI/CD MLOps Git Model Deployment

Cloud & Infrastructure

GCP AWS Azure Vertex AI

Frameworks & Tools

FastAPI Haystack LangChain Pinecone Plotly Streamlit
04 — CONTACT

Get In Touch

I'm open to collaborations, opportunities, and conversations about AI, ML, and data engineering. Feel free to reach out via email or connect with me on LinkedIn.