What is a Software Engineer at Together AI?
As a Software Engineer at Together AI, you will design, build, and scale the foundational layer of the AI Acceleration Cloud. This platform is an end-to-end, global infrastructure that virtualizes cutting-edge machine learning hardware—including NVIDIA GB200/GB300 GPUs and BlueField DPUs—to deliver blazing-fast LLM inference and training capabilities. Your work directly enables AI researchers and enterprise customers to provision on-demand compute, managed Kubernetes clusters, and massive Slurm workloads with zero friction.
This role sits at the intersection of high-performance physical hardware and highly available cloud software. Unlike traditional SaaS engineering, the systems you build at Together AI must handle exabyte-scale data pipelines, ultra-low latency networking topologies like Infiniband, and complex distributed scheduling challenges. The platform serves both external enterprise customers and internal state-of-the-art products, making your engineering contributions a critical driver of the company's growth and technological edge.
To succeed in this position, you must possess a deep appreciation for systems-level programming, concurrent execution, and distributed state management. You will tackle highly ambiguous problems that lack pre-existing industry blueprints, collaborating with a world-class team of AI researchers and infrastructure experts who have pioneered breakthroughs like FlashAttention and RedPajama.