If we use tokenize=True, which is the default setting, that string will also be tokenized for us.