Explainable AI (XAI) | Fakhruddin Khambaty's Learning Hub

Why Explainability?

👶 In Simple Terms

Black-box models (e.g. complex trees, neural nets) give a prediction but not the reason. Explainable AI (XAI) answers: “Which inputs (features) drove this output?” That builds trust, helps debug, and meets regulatory or fairness needs. Common tools: feature importance, SHAP (Shapley values), LIME (local approximations), and explainer dashboards (e.g. Plotly Dash).

Feature Importance

Tree-based models (Random Forest, XGBoost) expose feature_importances_: which features the model used most to split. For linear models, coefficients show how each feature pushes the prediction up or down.

from sklearn.ensemble import RandomForestClassifier
import pandas as pd

# Train a model (e.g. on diabetes.csv – download below)
df = pd.read_csv("datasets/diabetes.csv")
X = df.drop("Outcome", axis=1)
y = df["Outcome"]
model = RandomForestClassifier()
model.fit(X, y)

# Feature importance
for name, imp in sorted(zip(X.columns, model.feature_importances_), key=lambda x: -x[1]):
    print(f"{name}: {imp:.3f}")

📥 Dataset for this lesson: diabetes.csv — save in the same folder as your script or use path datasets/diabetes.csv.

SHAP & LIME (Short)

SHAP assigns each feature a contribution (Shapley value) to the prediction for a single sample. LIME fits a simple model (e.g. linear) around that sample to approximate the black box locally. Both help answer “why did the model say 1 for this patient?” Install: pip install shap lime and use their APIs on your trained model and a single row.

Explainer Dashboard

Tools like ExplainerDashboard (Python) or Plotly Dash can wrap your model and SHAP/LIME to build an interactive dashboard: change inputs, see prediction and feature contributions. Great for presenting to stakeholders.

🚫 Common Mistakes in Explainable AI

Trusting feature importance without context — High importance can mean correlation, not causation; always sanity-check with domain knowledge.
Using LIME/SHAP on the wrong data — Explain on the same preprocessed inputs the model saw; mismatched scaling or encoding gives wrong explanations.
Explaining once for all users — Local explanations (per sample) matter for "why this loan was denied"; global importance is for "what drives the model overall."

💭 Short reflection

In one sentence: why would a doctor or regulator prefer a model that is explainable (e.g. feature importance, SHAP) over a pure black box?

✅ CORE (Must know)

Explainable AI (XAI): understand why a model made a prediction.
Global: feature importance (e.g. tree-based importances) across the dataset.
Local: SHAP (Shapley values per feature per sample), LIME (local linear approximation).
Use for trust, debugging, and compliance.

📚 NON-CORE (Good to know)

ExplainerDashboard, counterfactual explanations.

Back to Course Hub Next: Gen AI Intro