Machine Learning Intermediate

Machine Learning Bootcamp

A beginner-friendly, 7-week project-based bootcamp designed to take you from Python basics to deploying your first Machine Learning model. Through hands-on practice, you will master essential data manipulation, build predictive algorithms, and develop an end-to-end, industry-ready application to kickstart your career in data science.

7 Weeks
Project-Based Learning

About this Course

A beginner-friendly, 7-week project-based bootcamp designed to take you from Python basics to deploying your first Machine Learning model. Through hands-on practice, you will master essential data manipulation, build predictive algorithms, and develop an end-to-end, industry-ready application to kickstart your career in data science. This comprehensive course is designed to take you from intermediate concepts to advanced mastery. You will build real-world skills through hands-on practice and expert guidance.

Course Syllabus

Module 1: Python Fundamentals for Data Science

  • Introduction to the Python ecosystem and Jupyter Notebooks.
  • Basic data types, variables, and operators.
  • Control flow (if/else statements, loops) and functions.
  • Mini-Project: Python Logic Builder – Creating a simple text-based calculator and interactive data dictionary.

Module 2: Data Wrangling & Manipulation

  • Introduction to NumPy arrays and mathematical operations.
  • Data manipulation with Pandas (Series and DataFrames).
  • Filtering, sorting, and grouping data.
  • Handling missing values and data cleaning techniques.
  • Mini-Project: Messy Data Cleaner – Transforming a raw, unstructured CSV file into a clean, analytical-ready dataset.

Module 3: Exploratory Data Analysis (EDA) & Visualization

  • Principles of effective data storytelling.
  • Creating basic plots (line, bar, scatter) with Matplotlib.
  • Advanced statistical visualizations with Seaborn.
  • Identifying correlations, distributions, and outliers.
  • Mini-Project: Insight Dashboard – Designing a static visual report that answers three key business questions from a provided dataset.

Module 4: Introduction to Machine Learning & Regression

  • Supervised vs. Unsupervised Learning concepts.
  • Understanding Linear Regression and its assumptions.
  • Feature engineering: Encoding categorical variables and feature scaling.
  • Evaluating regression models (MAE, MSE, RMSE, R-Squared).
  • Mini-Project: Real Estate Predictor – Building a model to estimate house prices based on features like location, size, and age.

Module 5: Foundations of Classification

  • Understanding Classification problems and use cases.
  • Implementing Logistic Regression.
  • Introduction to Decision Trees and how they split data.
  • Evaluating classification models: Accuracy, Confusion Matrix.
  • Mini-Project: Health Diagnosis App – Classifying patient risk levels (e.g., high/low risk of diabetes) using basic medical data.

Module 6: Advanced Classification & Model Tuning

  • Ensemble learning techniques: Random Forest Classifier.
  • Advanced evaluation metrics: Precision, Recall, and F1-Score.
  • Handling imbalanced datasets.Hyperparameter tuning using Grid Search and Cross-Validation.
  • Mini-Project: Spam Detector – Training an optimized Random Forest model to classify emails or SMS messages as spam or legitimate.

Module 7: Unsupervised Learning & Clustering

  • Finding hidden patterns without labeled data.
  • Implementing K-Means Clustering.Determining the optimal number of clusters (Elbow Method).
  • Evaluating clusters using the Silhouette Score.
  • Mini-Project: Customer Segmentation – Grouping mall shoppers into distinct marketing personas based on purchasing behavior.

Module 8: Model Deployment & End-to-End Pipeline

  • Saving and loading trained models using Pickle/Joblib.
  • Introduction to building web interfaces with Streamlit.
  • Best practices for UI/UX in data applications.
  • Connecting the machine learning pipeline to the web app.
  • Final Delivery: Deployment of the Capstone Project.

Capstone Project

Featured Project

End-to-End Student Success Predictor

You will act as a Data Scientist for an e-learning platform. Your objective is to analyze historical student data and build a predictive web application that identifies students who are at risk of dropping out or failing a course. You will handle the entire machine learning lifecycle—from cleaning raw engagement logs and training an optimized classification model, to deploying an interactive dashboard that instructors can use to input student metrics and receive real-time risk assessments.

Core Project Goal

Apply all the skills you've learned to build a production-ready application from scratch. This project serves as your portfolio piece.

Key Features:

  • Automated Data Pipeline: A preprocessing script that automatically cleans missing values, encodes categorical data (like course type or student background), and scales numerical inputs.
  • Predictive Engine: A tuned machine learning classification model (e.g., Random Forest) specifically optimized for high Recall, ensuring that "at-risk" students are not missed.
  • Interactive Web Dashboard: A user-friendly Streamlit interface featuring a sidebar where instructors can manually input a student's study hours, quiz scores, and login frequency to get an instant pass/fail probability.
  • Insights Generator: An automated visual element within the app that displays "Feature Importance," showing instructors exactly which factors (e.g., low quiz scores vs. low attendance) are driving the student's current risk level.
Total Investment
Rp 7,000,000
One-time payment
Duration: 7 Weeks
Mode: Online
Certificate of Completion