convert source text files to unicode