Week 2 of 8

Week 2: Data, ML, and How Models Learn

Data handling, feature thinking, evaluation, and classical ML before the LLM layer.

Week Thesis

What the machine expects from you.

This lesson is about data shaping as engineering work, not notebook theater. You are learning how raw tables become trustworthy model inputs.

Bad data silently poisons everything downstream. If your features are inconsistent, mislabeled, or leaky, your model quality and your product decisions become fiction.

Think of the dataset as an interface contract between the real world and the model. Every column carries assumptions about meaning, freshness, allowable values, and transformation history.

This lesson teaches the decision boundary between common supervised learning tasks and the discipline of choosing the simplest model that fits the job.

Lesson Stack

Three dense lessons, one enforced deliverable.

Lesson

What survives the week.

audit

ML Pipeline Audit

A structured audit describing data prep, evaluation design, leakage risks, and privacy boundaries.

Deliverable

A simple ML pipeline with evaluation and a leakage audit.

Each week leaves behind portfolio evidence that compounds into the final SaaS and its operating narrative.

Start learning Back to curriculum

Week 2: Data, ML, and How Models Learn

What the machine expects from you.

Three dense lessons, one enforced deliverable.

Data Shaping With Pandas and NumPy

Classification, Regression, and Model Choice

Evaluation, Leakage, and GDPR Boundaries

What survives the week.

ML Pipeline Audit

A simple ML pipeline with evaluation and a leakage audit.