Hi, I'm Ridwan

Data Scientist | Python, SQL, ML, Power BI | Building Data-Driven Solutions

Ridwan Yusuf

Data Analysis

Machine Learning

SQL & Databases

About Me

Passionate about turning data into actionable insights that drive real impact

Hi, I'm Ridwan, a data scientist with a background in computer science and over 5 years of experience creating educational content in Mathematics and Computers & Technology.

These days, I build practical, end-to-end data solutions using Python, SQL, and Machine Learning. My work spans MLOps, predictive modeling, and deployment with recent projects ranging from a stock market ETL pipeline to health-related risk prediction models.

I'm especially interested in solving problems in finance and healthcare, but I don't box myself in. If data can be used to uncover insights or drive impact, I'm all in.

Feel free to explore my projects, and if you'd like to collaborate or chat, don't hesitate to reach out!

3+ Years Experience
11+ Projects Completed
2 Domains Specialized
3 Clients Satisfied

My Expertise

A comprehensive toolkit for solving complex data challenges across industries

Data Analysis & Visualization

Python MySQL SQLite3 MongoDB PowerBI A/B Testing Interactive Dashboards Documentation

Machine Learning & NLP

Predictive Modeling Clustering Statistical Analysis Classification Scikit-learn Text Classification NLTK SpaCy

Tools & Technologies

API Development Streamlit Flask Git Docker AWS (EC2) Jupyter Notebook Google BigQuery Apache Airflow MLFlow Google Colab CI/CD

Featured Projects

Real-world solutions showcasing expertise in data science, machine learning, and analytics

Macroeconomics

Economic Recovery Dashboard

Built a real-time dashboard that tracks economic recovery using FRED API data, FastAPI backend, & prompt-engineered responses for interactive frontend.

Python FastAPI FRED API Prompt Engineering LLM
View Project
Financial Markets

Stock Market ETL Pipeline

Developed a robust ETL pipeline to extract, transform, and load historical stock market data, ensuring data quality and availability for analysis.

Python Apache Airflow MLFlow SQLite3 Docker FastAPI
View Project
Finance

FraudDetect System

Built a real-time fraud detection system using highly imbalanced data (0.3% fraud), achieving at least 83% precision, recall, and F1-score on both classes.

Python Scikit-Learn Streamlit Feature Selection Data Resampling
View Live Demo
Education

Knowledge Gap Analysis

Segmented students using clustering analysis and visualized grade predictions and knowledge gaps with Power BI for targeted support.

Python Pandas Scikit-learn PowerBI
View Dashboard
Drug Discovery

hERG Blocker Classification

Developed a Dockerized ML pipeline for cardiotoxicity prediction with 89.4% ROC-AUC, using engineered molecular features and SMOTE for improved class detection.

Python Scikit-learn SMOTE Docker
View Code
Cryptocurrency

Bitcoin Volatility Prediction

Predicted Bitcoin price volatility using GARCH, delivering actionable insights on daily (3.63%) and annual (69.31%) fluctuations to optimize trading strategies.

Python GARCH Statsmodels API Time Series
View Analysis
Healthcare

MedNet Diagnosis System

Built medical diagnosis system with 85% accuracy for early disease detection and applied topic modelling to derive insights from patient data.

Python Machine Learning Scikit-Learn NLP Topic Modelling
Try Demo
Recommendation

Financial Education Recommender

Developed content-based recommender with cosine similarity and clustering to personalize financial education and boost engagement.

Python Scikit-learn Clustering Content-Based
Try Recommender
Sports Analytics

UCL 2024/25 Prediction Model

Engineered key features and built regression model with 84% accuracy. Successfully predicted 7 of top 8 teams and 14 of next 16 teams in final standings.

Python Regression Feature Engineering
View Predictions
Public Safety

Atlanta Crime Analytics

Created an interactive crime dashboard using Power BI, providing key insights into crime patterns, COVID-19 effects, and forecasting trends.

PowerBI Data Cleaning Excel Feature Selection
View Dashboard
Maternal Health

CareWomb Platform

Developed CareWomb, a maternal health platform integrating predictive analytics and personalized recommendations to support safer pregnancies.

Python Scikit-Learn AWS (EC2)
Visit Platform

Let's Work Together

Have an interesting project or just want to connect? I'd love to hear from you.