unicode: prefer shorter digraph codes