charset: relevant unicode blocks in language comparisons