sanskrit transcriptions from wikipedia
-rw-r--r-- 25256 unicode.txt