Titanic - PyTorch MLP (Kaggle)
A reproducible PyTorch MLP pipeline for the Kaggle Titanic dataset.
This project demonstrates:
- Modular feature engineering
- Clean training loop separation
- Ensemble inference
- Reproducible submission generation

Project Structure
- src/
- model.py # MLP architecture definition
- train.py # training loop (optimizer, loss, logging)
- features.py # feature engineering (fit / transform pipeline)
- scripts/
- make_submission.py # production entry point
- data/
- outputs/
- requirements.txt
- README.md
Setup
Install dependencies:
pip install -r requirements.txt
Data
Run Submission Pipeline
Feature Pipeline
Model
- PyTorch MLP
- Configurable depth
- Dropout regularization
- Adam optimizer
- Ensemble across multiple random seeds
Reproducibility
- Randomness is controlled via:
Numpy seed
PyTorch seed
Deterministic train / validation split
Notes
- The submission script is intentionally clean (no plotting or experiment logic).