1. What is a Machine Learning Engineer at Advanced Micro Devices?
At Advanced Micro Devices (AMD), the role of a Machine Learning Engineer is fundamentally about bridging the gap between cutting-edge AI software and high-performance hardware. Unlike generalist ML roles that focus solely on model architecture or data cleaning, this position at AMD is deeply entrenched in hardware-software co-design. You are not just training models; you are defining how the next generation of Generative AI, Large Language Models (LLMs), and computer vision systems run on AMD’s instinct accelerators (such as the MI300 series) and consumer GPUs.
This role is critical to AMD’s strategic mission to challenge the status quo in the AI accelerator market. You will work within teams like the Models and Applications team, the Llama team, or the Advanced Graphics Program. Your impact is measured by your ability to optimize distributed training pipelines, enhance the ROCm open software ecosystem, and push the performance boundaries of frameworks like PyTorch, JAX, and TensorFlow. You are the engineer ensuring that the world's most complex AI workloads run efficiently and at scale on AMD silicon.
Candidates joining AMD in this capacity enter an environment that values engineering rigor and "underdog" innovation. You will tackle complex problems involving distributed systems, kernel optimization, and massive-scale cluster management. Whether you are optimizing inference for Agentic AI or pushing the limits of real-time neural graphics, your work directly empowers developers and researchers to choose AMD as their platform of choice for the AI revolution.

