

You are working on an NLP system that processes text for classification, extraction, or generation. Your team is deciding whether to use a transformer model or a recurrent model such as an RNN or LSTM.
What are transformers and how do they differ from RNNs/LSTMs?