In large NVIDIA C++ codebases such as CUDA runtime components or TensorRT execution paths, concurrency changes are risky because correctness depends on ordering, shared state, and synchronization semantics.
Explain how you would structure a code review for a complex C++ concurrency change. Your answer should cover:
The interviewer expects a coding-focused, systems-oriented explanation rather than people-process advice. Discuss concrete techniques such as dependency graphs, lock-order analysis, invariants, state transitions, and use of NVIDIA-relevant tooling where appropriate. You do not need to implement a full static analyzer, but you should explain the algorithmic structure of a strong review approach and the failure modes it is designed to catch.