unicode: regularise latin samples