📥 Download Course Datasets

Save any file in the same folder as your Python script or Jupyter notebook. Then use pd.read_csv("filename.csv") or pd.read_excel("filename.xlsx") to load it.

👶 In simple terms: Think of each file as a digital table (like Excel). Your code opens that table so Python can do math and graphs on it. Putting the file in the same folder as your script means Python can find it without a long path.

Housing.csv

House prices and features (area, bedrooms, bathrooms, mainroad, guestroom, etc.). Used in Linear Regression and Statistics.

Download

Hotel Reservations.csv

Hotel booking data. Used in Statistics (Chi-Square, T-Test), Hypothesis Testing, Missing Values & Outliers, Clustering.

Download

AB_testing_data.csv

A/B test results (conversion fractions for variant A and B over days). Used in A/B Testing lesson.

Download

market_baskets_data.csv

Market basket (shopping cart) transactions. Used in Market Basket Analysis with Apriori.

Download

Apriori_Algorithm.xlsx

Excel file for Apriori / market basket practice. Used with Market Basket Analysis.

Download

auto_mpg.csv

Car data: mpg, cylinders, horsepower, weight, etc. Used in Regularization (Ridge/Lasso) and Bias–Variance.

Download

cars_data.csv

Car evaluation: buying price, maintenance, doors, persons, lug_boot, safety → class (unacc/acc/good/vgood). Used in Random Forest Code Walkthrough.

Download

diabetes.csv

Health metrics and diabetes outcome. Used in Explainable AI (feature importance, SHAP).

Download

heart_disease_dataset.csv

Heart disease risk factors and outcome. Used in Logistic Regression (classification).

Download

dataset.csv

Alternative classification dataset (same structure as heart disease). Use for Logistic Regression practice.

Download

insurance.csv

Insurance-related data. Used in assignments and regression practice.

Download

titanic3.xlsx

Titanic passenger data (Excel). Used in Statistics and practice.

Download

Practice.xlsx

Practice exercises in Excel. Used with Statistics and Maths lessons.

Download

All datasets are part of the FKTI Data Science curriculum. Use them only for learning.