Tokenizer that is aware of Wikipedia syntax.