X-Git-Url: http://git.shiar.nl/unicode-sampler.git/blobdiff_plain/ea70e66db50bd2be8da8a8b5ee8676e768172545..a227e6ccba568130b4e41f998fb2f994283aa0bb:/unicode.txt diff --git a/unicode.txt b/unicode.txt index ad61432..ef5e850 100644 --- a/unicode.txt +++ b/unicode.txt @@ -1,29 +1,39 @@ Unicode sampler ‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾ -Test support of various text encoded with Unicode up to version 8.0 (2015). +Test support of various text encoded with Unicode up to version 10.0 (2017). Based on file by Markus Kuhn -Updated by Mischa Poslawsky 2015-09-13 +Updated by Mischa Poslawsky 2020-03-10 Compact font overview: ╔══════════════════════════════════════════════════════════════════════╗ - ║ _ABCDEFGHIJKLMNOPQRSTUVWXYZ ÅĀČẾƏØṆⱣÞß ΑΒΓΔΩὮ АБВГДЯѢЌ ԱԲԳ ႠႡႢჇ אבגױ ║ - ║ @abcdefghijklmnopqrstuvwxyz åāčếəøṇᵽþſ αβγδωὦ абвгдяѣќ աբգ აბგჷ ابجݰ ║ - ║ -0123456789 (/)[\]{|} ^`"'~ «“’”» ,;:.…!¿?‽ •&#§¶†©%‰ −±+*×÷ <>=≠∀∧∅ ║ - ║ ·¤¢₥$€£¥₹₽ ฿₫֏₭₺₦₩₪ ✂℻☆♥⚐☺☯☹ ☉♀♁♂♉ ✔✘ ○☓□△ ␣⌫⌥⌘↵␀ ¯₁½²√¬∈∞ ↗┌╁╖░█∎ � ║ + ║ _ABCDEFGHIJKLMNOPQRSTUVWXYZ ÅĀČẾƏØṆⱣÞß АБВГДЯѢЌ ΑΒΓΔΩὮ ႠႡႢჇ ԱԲԳ אבגױ ║ + ║ -abcdefghijklmnopqrstuvwxyz åāčếəøṇᵽþſ абвгдяѣќ αβγδωὦ აბგჷ աբգ ابجݰ ║ + ║ −0123456789 <=>+÷× ¤¢$€¥£元 (/)[\]{|} ,;:.…!¿?‽· ^`'"~ ✔✘☺☹ @#&§¶†©• ║ + ║ ¯½₁²↋ %‰√∞∧¬∈≠≥±∶*∀∅ ฿₺₽₹₩₪ ␣⌫⌥⎇⌘↵␤␀ ☉♀♁♂✂✎☆♥⚐☯∎ «“’”» ○☓□△ ↗┌╁╖░█ � ║ ╚══════════════════════════════════════════════════════════════════════╝ +Unicode blocks: + 0__ 1__ 2__ 3__ 4__ 5__ 6__ 7__ 8__ 9__ A__ B__ C__ D__ E__ F__ + U+00*__ A Á Ă Ȁ ɐ ʶ ◌̌ Ω Я Ԙ Ա א ب ݓ ܐ ߐ ࡁ ࢶ क ক ਕ ક କ க క ಕ ക ක ท ລ ཀ + U+01*__ က დ ㅎ አ ፩ ᎃ Ꮳ ᓀ ᙽ ᚏ ᚠ ᜃ ក ᠦ ᣈ ᤁ ᦂ ᨠ ◌᪱ ᬓ ᯂ᯦ ᰀ Დ ᴂ ᶐ ◌ᷲ Ậ ᾮ + U+02*__ ※ ₿⃕ ™ ⇅ √ ⋲ ⌘ ⏻ ␛ Ⓐ ╩ ▛ ◈ ☺ ✈ ⟇ ⟴ ⡽ ⤱ ⦖ ⨖ ⫻ ⬀ ⯒ Ⰳ Ⲁ ⵣ ⷔ ⹋ ⺾⼬ + U+03-0A ひカㄅ㇂㈭ ㌁ 㐀 ䷃ 中 ꊈ꒸ꓯ ꕉ Ꙗ ꚩ Ꜽ Ꞻ ꡀ ꢒ ꤰ ꦏ ꨀ ꪁ ꬰ ꯀ 가 + U+10*__ 𐀀 𐂛 𐅄 𐇑 𐊀 𐊷 𐌰 𐎠 𐑗 𐒱 𐔀 𐔰 𐘐 𐡀 𐢀 𐤀 𐦠 𐩱 𐪑 𐬁 𐭠 𐰢 𐲘 𐴀 𐹠 𐼁 𐿠 + Code:

 
   Hash[ :nbsp => 0O2_40 ].each {|name, cp| puts "#{name} is '#{cp.chr}'" }
 
-  while ((c = *l++) != '\0') { m->stat[2] = IO | (~OK & X_8); }
+  while ((c = *l++) != '\0') { m->stat[2] = IO | (~OK & X_8); } /* C */
 
   perl -pe's/\w/$^ =~ $& > chop($^ = $& . $^) ? "@-" : $&/ge'
 
+  fix$(<$>)<$>(:)<*>((<$>((:[{- hs -}])<$>))(=<<)<$>(*)<$>(>>=)(+)($))$1
+
   ↑1 ⍵∨.∧3 4=+/,¯1 0 1∘.⊖¯1 0 1⌽¨⊂⍵ ⍝ game of life
 
 Mathematics and sciences:
@@ -72,7 +82,7 @@ Precomposed and combining diacritics:
   Muļķa hipiji mēģina nogaršot žņaudzējčūsku. Trâu chậm uống nước đục.
   Mul̦k̦a hipiji mēg̓ina nogaršot žn̦audzējčūsku. Trâu chậm uống nước đục.
 
-  STARGɅ̊TE, a = v̇ = r̈, a⃑ ⊥ b⃑
+  STARGɅ̊TE • a = v̇ = r̈, a⃑ ⊥ b⃑ • 1̴·2⃯⃗·3̶̮̑·4̣̤̇̈·5⃘̜̹͑͗
 
 Pangrams:
 
@@ -100,6 +110,10 @@ German with presentational ligatures:
   Im finſteren Jagdſchloß am offenen Felsquellwaſſer patzte der affig‐flatterhafte
   kauzig‐höf‌liche Bäcker über ſeinem verſifften kniffligen C‐Xylophon.
 
+Common homographs:
+
+  AΑАᎪꓮ𝖠𖽀 OΟОՕⲞס߀Ჿꓳ𐐄𐊒𐊫𐌏ዐ𐓂᠐ꢝ𐰗𖫩ⵔ𖩠0𝙾○ㆁ꒨
+
 Modern Greek Ύμνος εις την Ελευθερίαν:
 
   Σε γνωρίζω από την κόψη του σπαθιού την τρομερή,
@@ -114,6 +128,11 @@ Ancient Greek Iliad:
   ϙύνεσσιν οἰωνοῖσί τε πᾶσι· Διὸς δ᾽ ἐτελείετο βουλή· ἐξ οὗ δὴ τὰ πρῶτα
   διαστήτην ἐρίσαντε Ἀτρεΐδης τε ϝάναξ ἀνδρῶν καὶ δῑος Ἀχιλλεύς.
 
+Coptic:
+
+  ⲕⲧ̅ⲕⲁ ⲅⲉⲗⲅⲟ̅ⲥⲛ ⲓ̈ⲏ̅ⲥⲟⲩⲥⲓ ⲛⲁⳡⲁⲛ ⲧⲣⲓⲕⲁ• ⲇⲟⲗⲗⲉ ⲡⲟⲗⲅⲁⲣⲁ ⲡⲉⲥⲥⲛⲁ• ⲡⲁⲡⲟ ⲥ̅ⲕⲟⲉⲗⲙ̅ⲙⲉ ⲉⲕ̅ⲕⲁ
+  κτ̄κα γελγελο̄ϲουανον ῑη̄ϲουϲι ναϫαν τρικα• δολλε πολγαρα πεϲϲνα• παπο ϲ̄κοελμ̄με εκ̄κα
+
 Georgian:
 
   ვეფხისტყაოსანი (Veṗxis Ṭq̇aosani) შოთა რუსთაველი (დაახ. 1165)
@@ -164,6 +183,10 @@ Zarka Table (Torah cantillation):
   זַרְקָא֮ סְגוֹלְתָּא֒ מוּנַח־לְגַרְמֵ֣הּ׀ מוּנַ֣ח רְבִ֗יעַ פָּזֵר־קָטָ֡ן תְּלִישָׁא־גְ֠דוֹלָה תְּלִישָׁא־קְטַנָה֩ אַזְלָ֨א גֶּ֜רֶשׁ
   מְהֻפָּ֤ךְ פַּשְׁטָא֙ זָקֵף־קָטָ֔ן טִפְחָ֖א אַתְנָ֑ח דַּרְגָּ֧א תְּבִ֛יר טִפְחָ֖א מֵרְכָ֥א סִלּֽוּק׃
 
+Zalgo text:
+
+  T̫̺̳o̬̜ ì̬͎̲̟nv̖̗̻̣̹̕o͖̗̠̜̤k͍͚̹͖̼e̦̗̪͍̪͍ ̬ͅt̕h̠͙̮͕͓e̱̜̗͙̭ ̥͔̫͙̪͍̣͝ḥi̼̦͈̼v҉̩̟͚̞͎e͈̟̻͙̦̤-m̷̘̝̱í͚̞̦̳n̝̲̯̙̮͞d̴̺̦͕̫ ̗̭̘͎͖r̞͎̜̜͖͎̫͢ep͇r̝̯̝͖͉͎̺e̴s̥e̵̖̳͉͍̩̗n̢͓̪͕̜̰̠̦t̺̞̰i͟n҉̮̦̖̟g̮͍̱̻͍̜̳ ̳c̖̮̙̣̰̠̩h̷̗͍̖͙̭͇͈a̧͎̯̹̲̺̫ó̭̞̜̣̯͕s̶̤̮̩̘.̨̻̪̖͔ ̳̭̦̭̭̦̞́I̠͍̮n͇̹̪̬v̴͖̭̗̖o̸k҉̬̤͓͚̠͍i͜n̛̩̹͉̘̹g͙ ̠̥ͅt̰͖͞h̫̼̪e̟̩̝ ̭̠̲̫͔fe̤͇̝̱e͖̮̠̹̭͖͕l͖̲̘͖̠̪i̢̖͎̮̗̯͓̩n̸̰g̙̱̘̗͚̬ͅ ͍o͍͍̩̮͢f̖͓̦̥ ̘͘c̵̫̱̗͚͓̦h͝a̝͍͍̳̣͖͉o͙̟s̤̞.̙̝̭̣̳̼͟
+
 Ethiopic (Amharic, Blin, Sebatbeit):
   ዩኒኮድ ለእያንዳንዱ ፊደል፣       ዩኒኮድ ላፍደልድክ፡       ዩኒኮድ እንም ኤነት ፊደል፤
   ማንኛውም ዓይነት ኮምፒውተር ቢሆን፣  ኣኻ ኮምፕዩተርልክ ኣኽን፡   ሟኒም ኤነት ኮምፒተር ቢኸር፤
@@ -277,6 +300,8 @@ Japanese Iroha:
   浅き夢見じ   あさきゆめみし   アサキユメミジ   アサキユメミジ   阿佐伎喩女美之
   酔ひもせず   ゑひもせす     ヱヒモセズン    ウェヒモセズン   恵比毛勢須
 
+  hentaigana 変体仮名: 𛀆𛄆𛂦𛂌𛃀𛂶𛁻 𛁦𛃶𛂏𛃸𛄚 𛄋𛀙𛃫𛁟𛄀𛁚 𛁩𛂒𛂄𛃭𛃑
+
 Chinese:
 
   ‣ Most common characters: