
You are discussing modern NLP model architectures in an interview setting. The conversation turns to the model family behind many current text classification, generation, and retrieval systems.
What is a transformer?
Understanding of self-attention and token interactionsHow tokenization and embeddings feed transformer layersWhy transformers replaced many older NLP architecturesHow transformers are used in practical NLP systems