You are working on pipeline infrastructure that runs across a large Linux fleet. The job is not only keeping nodes healthy, but also rolling out software changes without breaking scheduled data workloads.
Describe your experience with Linux cluster management and how you handle software deployment at scale.