Convert the tokens into token IDs (indices) As discussed above, in order to train a transformer on a text dataset, you have to turn the text data into a list of token IDs. These IDs are numbers such ...