👋 Hello, I'm Kenneth Leung
- Thanks for popping by!
- An avid learner, bold builder, curious explorer, and driven doer with a bias towards action, I enjoy seeking and solving meaningful problems with data and technology while having fun at the same time.
- I welcome you to join me on a learning journey! Follow me on GitHub, Medium, and LinkedIn for a great dose of practical educational data science content.
- You can find my data science portfolio below, where every project and article was born out of inspiration, curiosity, and motivation. Feel free to reach out for a chat on topics common to both of us!
👨🔧 Currently working on: (i) Applied Generative AI Use Cases, and (ii) Compilation of high-profile ML failures: Failed-ML. If you're keen to join me in contributing, let's connect!
How to reach me
Portfolio Contents
- Computer Vision
- Database Management
- Data Extraction and Web Scraping
- Data Science Certification Guides
- Data Science Toolkit
- Data Science in the Real World
- Generative AI
- Insights from Data Science Seminars
- Machine Learning
- MLOps
- Natural Language Processing
- Networks and Graphs
- Sports Analytics
- Visualization
- Web Development
- Web3 and Metaverse
- Writing for DataCamp
- Writing Tips
Projects with
Computer Vision 👁️
| Title | Article | Repo |
|---|---|---|
| Classifying Images of Alcoholic Beverages with fast.ai v2 | ||
| Russian Car Plate Detection with OpenCV and TesseractOCR | 🔗 | |
| Evaluate OCR Output Quality with Character Error Rate (CER) and Word Error Rate (WER) | ||
| Top Python libraries for Image Augmentation in Computer Vision | ||
| ⭐ PyTorch Ignite Tutorial - Classifying Tiny ImageNet with EfficientNet | ||
| Practical Guide to Transfer Learning in TensorFlow for Multiclass Image Classification |
Database Management 🗄️
| Title | Article | Repo |
|---|---|---|
| PyMySQL - Connecting Python and SQL for Data Science |
Data Extraction and Web Scraping 🧰
| Title | Article | Repo |
|---|---|---|
| Using OneMap API to extract Singapore postal codes, coordinates and travel distance | - | 🔗 |
| A Detailed Web Scraping Walkthrough Using Python and Selenium |
Data Science Certification Guides 👨🎓
| Title | Article | Repo |
|---|---|---|
| 3 Steps to Get AWS Cloud Practitioner Certified in 2 Weeks | ||
| 3 Steps to Get Tableau Desktop Certified in 2 Weeks | - | |
| - |
Data Science Toolkit 🛠️
| Title | Article | Repo |
|---|---|---|
| Common Python codes for Data Wrangling | - | |
| Enhance your Python code’s readability with pycodestyle | 🔗 | - |
| Free Resources for Generating Realistic Fake Data | - | |
| Most Starred and Forked GitHub Repos for Data Science and Python | - | |
| Most Starred and Forked GitHub Repos for Data Science and R | 🔗 | - |
| Automatically Generate Machine Learning Code with Just a Few Clicks | - | |
| Read and Modify Image Metadata with Python | ||
| Top Tips to Google Search Like a Seasoned Data Scientist | - | |
| How to Swap Day and Month of Incorrectly Formatted Excel Dates | - |
Data Science in the Real World 🌏
| Title | Article | Repo |
|---|---|---|
| Exploring Illegal Drugs in Singapore — A Data Perspective | 🔗 | |
| Pharmacokinetic Modeling of Drug Concentration Trajectories using Ordinary Differential Equations (ODE) and Global Optimization with Differential Evolution | - | |
| Healthcare’s AI Future — In Conversation with Andrew Ng and Fei-Fei Li | - | |
| Real-World Data Science Use Cases in the Insurance Industry | - | |
| ⭐ Failed-ML: Compilation of high-profile real-world examples of failed machine learning projects | - |
Generative AI 🤖
| Title | Article | Repo |
|---|---|---|
| Generative AI Pharmacist - Macy | 🔗 | 🔗 |
Insights from Data Science Seminars 👨🏫
| Title | Article | Repo |
|---|---|---|
| Bridging AI’s Proof-of-Concept to Production Gap — Insights from Andrew Ng | - |
Machine Learning 🎰
| Title | Article | Repo |
|---|---|---|
| Exploring Condominium Rental Prices with Web Scraping and Exploratory Data Analysis | 🔗 | |
| Using Ensemble Regressors to Predict Condominium Rental Prices | 🔗 | 🔗 |
| The Dying ReLU Problem, Clearly Explained | - | |
| Why Bootstrapping Actually Works | - | |
| ⭐ Assumptions of Logistic Regression, Clearly Explained | ||
| Data-Centric AI Competition - Tips and Tricks of a Top 5% Finish | ||
| Credit Card Fraud Detection with AutoXGB | ||
| ⭐ Micro, Macro & Weighted Averages of F1 Score, Clearly Explained | - | |
| Principal Component Regression - Clearly Explained and Implemented | 🔗 | |
| Quick Primer on Types of Missing Data and Imputation Techniques | 🔗 | - |
| Imputation of Missing Data in Tables with DataWig |
MLOps - Machine Learning Operations 👨🔧
| Title | Article | Repo |
|---|---|---|
| Key Learning Points from MLOps Specialization — Course 1/4 | 🔗 | |
| Key Learning Points from MLOps Specialization — Course 2/4 | ||
| Key Learning Points from MLOps Specialization — Course 3/4 | 🔗 | |
| Key Learning Points from MLOps Specialization — Course 4/4 | 🔗 | |
| ⭐ End-to-End AutoML Pipeline with H2O AutoML, MLflow, FastAPI, and Streamlit for Insurance Cross-Sell | ||
| 🔗 | ||
Natural Language Processing 📑
| Title | Article | Repo |
|---|---|---|
| COVID-19 Vaccine — What’s the Public Sentiment? | ||
| Keyword Extraction and Analysis Pipeline with KeyBERT and Taipy |
Networks and Graphs 🌐
| Title | Article | Repo |
|---|---|---|
| How to Deploy Interactive Pyvis Network Graphs on Streamlit | ||
| A No-Code Approach to Building Knowledge Graphs |
Sports Analytics ⚽
| Title | Article | Repo |
|---|---|---|
| 🔗 | ||
| Combining Python and R for FIFA Football World Ranking Analysis | 🔗 |
Visualization 📈
| Title | Article | Repo |
|---|---|---|
| Uniform Singapore Energy Price and Demand Forecast Dashboard (with Plotly Dash) | - | |
| Visualizing Fortune 500 Companies in a Bar Chart Race | ||
| How to Easily Draw Neural Network Architecture Diagrams | 🔗 |
Web Development 🖥️
| Title | Article | Repo |
|---|---|---|
| From HTTP to HTTPS — Easily Secure Flask Web Apps With Talisman | 🔗 | - |
Web3 and Metaverse 👨💻
| Title | Article | Repo |
|---|---|---|
| The Web3 / Metaverse Glossary — A Keyword Guide to the Tech Future | - |
Writing for DataCamp ✍️
| Title | Article | Repo |
|---|---|---|
| - | ||
| Democratizing Data in Government Agencies | - | |
| A Survey Into Data Governance Tools | 🔗 | - |
| Scaling Data Science With Data Governance | - | |
| 3 Reasons Why All Teams Should Learn SQL | - | |
| 3 Reasons Why All Teams Should Learn R | 🔗 | - |
| How Tableau Helps Your Organization Achieve Greater Data Insights | - | |
| How PowerBI Helps Your Organization Achieve Greater Data Insights | - |
Writing Tips 📜
| Title | Article | Repo |
|---|---|---|
| Create a Clickable Table of Contents for Your Medium Posts | 🔗 | - |



