Train House Prices with Gradient Descent

Business Context

HomeValue, a residential pricing platform serving 200K monthly property searches, wants a transparent baseline model for estimating sale prices from listing attributes. The pricing team needs a model trained from scratch with gradient descent so they can validate optimization behavior before moving to more complex methods.

Dataset

You are given a historical housing dataset built from MLS listings and closed sales.

Feature Group	Count	Examples
Numerical property features	9	square_feet, lot_size, bedrooms, bathrooms, year_built, hoa_fee
Categorical location features	4	neighborhood, zip_code, property_type, condition_rating
Temporal features	2	listing_month, days_on_market
Derived market features	3	price_per_sqft_neighborhood_avg, school_score, distance_to_downtown

Size: 52K home sales, 18 raw features
Target: Continuous — final sale price in USD
Missing data: 6% missing in hoa_fee, 4% in condition_rating, 2% in school_score
Data quality: Sale prices are right-skewed with a small number of luxury outliers above $3M

Success Criteria

A good solution should achieve RMSE below $42K and MAE below $28K on a held-out test set, while showing stable convergence of the training loss. The team also expects a clear explanation of learning rate choice, regularization, and stopping criteria.

Constraints

Use gradient descent to train the core model rather than a closed-form solver.
The solution must be interpretable enough to explain feature effects to pricing analysts.
Training should complete in a few minutes on a laptop.
Avoid data leakage from future market aggregates.

Deliverables

Build a regression pipeline that preprocesses mixed feature types and trains a linear model with gradient descent.
Explain the loss function, gradients, and how you monitor convergence.
Compare batch vs mini-batch gradient descent and justify your choice.
Evaluate on train/validation/test splits with RMSE, MAE, and R².
Recommend hyperparameters and production retraining cadence.

Business Context

Dataset

You are given a historical housing dataset built from MLS listings and closed sales.

Feature Group	Count	Examples
Numerical property features	9	square_feet, lot_size, bedrooms, bathrooms, year_built, hoa_fee
Categorical location features	4	neighborhood, zip_code, property_type, condition_rating
Temporal features	2	listing_month, days_on_market
Derived market features	3	price_per_sqft_neighborhood_avg, school_score, distance_to_downtown

Size: 52K home sales, 18 raw features
Target: Continuous — final sale price in USD
Missing data: 6% missing in hoa_fee, 4% in condition_rating, 2% in school_score
Data quality: Sale prices are right-skewed with a small number of luxury outliers above $3M

Success Criteria

Constraints

Use gradient descent to train the core model rather than a closed-form solver.
The solution must be interpretable enough to explain feature effects to pricing analysts.
Training should complete in a few minutes on a laptop.
Avoid data leakage from future market aggregates.

Deliverables

Build a regression pipeline that preprocesses mixed feature types and trains a linear model with gradient descent.
Explain the loss function, gradients, and how you monitor convergence.
Compare batch vs mini-batch gradient descent and justify your choice.
Evaluate on train/validation/test splits with RMSE, MAE, and R².
Recommend hyperparameters and production retraining cadence.

Business Context

Dataset

You are given a historical housing dataset built from MLS listings and closed sales.

Feature Group	Count	Examples
Numerical property features	9	square_feet, lot_size, bedrooms, bathrooms, year_built, hoa_fee
Categorical location features	4	neighborhood, zip_code, property_type, condition_rating
Temporal features	2	listing_month, days_on_market
Derived market features	3	price_per_sqft_neighborhood_avg, school_score, distance_to_downtown

Size: 52K home sales, 18 raw features
Target: Continuous — final sale price in USD
Missing data: 6% missing in hoa_fee, 4% in condition_rating, 2% in school_score
Data quality: Sale prices are right-skewed with a small number of luxury outliers above $3M

Success Criteria

Constraints

Use gradient descent to train the core model rather than a closed-form solver.
The solution must be interpretable enough to explain feature effects to pricing analysts.
Training should complete in a few minutes on a laptop.
Avoid data leakage from future market aggregates.

Deliverables

Build a regression pipeline that preprocesses mixed feature types and trains a linear model with gradient descent.
Explain the loss function, gradients, and how you monitor convergence.
Compare batch vs mini-batch gradient descent and justify your choice.
Evaluate on train/validation/test splits with RMSE, MAE, and R².
Recommend hyperparameters and production retraining cadence.

Business Context

Dataset

You are given a historical housing dataset built from MLS listings and closed sales.

Feature Group	Count	Examples
Numerical property features	9	square_feet, lot_size, bedrooms, bathrooms, year_built, hoa_fee
Categorical location features	4	neighborhood, zip_code, property_type, condition_rating
Temporal features	2	listing_month, days_on_market
Derived market features	3	price_per_sqft_neighborhood_avg, school_score, distance_to_downtown

Size: 52K home sales, 18 raw features
Target: Continuous — final sale price in USD
Missing data: 6% missing in hoa_fee, 4% in condition_rating, 2% in school_score
Data quality: Sale prices are right-skewed with a small number of luxury outliers above $3M

Success Criteria

Constraints

Use gradient descent to train the core model rather than a closed-form solver.
The solution must be interpretable enough to explain feature effects to pricing analysts.
Training should complete in a few minutes on a laptop.
Avoid data leakage from future market aggregates.

Deliverables

Build a regression pipeline that preprocesses mixed feature types and trains a linear model with gradient descent.
Explain the loss function, gradients, and how you monitor convergence.
Compare batch vs mini-batch gradient descent and justify your choice.
Evaluate on train/validation/test splits with RMSE, MAE, and R².
Recommend hyperparameters and production retraining cadence.

Interview Guides

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Train House Prices with Gradient Descent

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Your Answer

Train House Prices with Gradient Descent

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Train House Prices with Gradient Descent

Business Context

Dataset

Success Criteria

Constraints

Deliverables

Your Answer