You're starting a new data platform project and need to define the pipeline foundation before implementation begins. You want an approach that can support growth, operational reliability, and clean handoffs between ingestion, transformation, and downstream consumption.
How would you design a scalable infrastructure for a new project?