If any sentence has ????? or **** core\nlp tokenizes it and identifies it as Number, which should not happen.