The smallest units of data that a model can process in Natural Language Processing (NLP). Can refer to words, characters, subwords or even sentences depending on the granularity of the model.
In a language model, tokens could be individual words (‘dog’, ‘cat’, etc.) or subwords (e.g., ‘d-o-g’)