X-Git-Url: http://git.shiar.nl/unicode-sampler.git/blobdiff_plain/ea70e66db50bd2be8da8a8b5ee8676e768172545..8e883931059c70153be95b4f3f5edc57c60179bc:/unicode.txt diff --git a/unicode.txt b/unicode.txt index ad61432..4524fde 100644 --- a/unicode.txt +++ b/unicode.txt @@ -1,29 +1,38 @@ Unicode sampler ‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾ -Test support of various text encoded with Unicode up to version 8.0 (2015). +Test support of various text encoded with Unicode up to version 10.0 (2017). Based on file by Markus Kuhn -Updated by Mischa Poslawsky 2015-09-13 +Updated by Mischa Poslawsky 2020-03-10 Compact font overview: ╔══════════════════════════════════════════════════════════════════════╗ - ║ _ABCDEFGHIJKLMNOPQRSTUVWXYZ ÅĀČẾƏØṆⱣÞß ΑΒΓΔΩὮ АБВГДЯѢЌ ԱԲԳ ႠႡႢჇ אבגױ ║ - ║ @abcdefghijklmnopqrstuvwxyz åāčếəøṇᵽþſ αβγδωὦ абвгдяѣќ աբգ აბგჷ ابجݰ ║ - ║ -0123456789 (/)[\]{|} ^`"'~ «“’”» ,;:.…!¿?‽ •&#§¶†©%‰ −±+*×÷ <>=≠∀∧∅ ║ - ║ ·¤¢₥$€£¥₹₽ ฿₫֏₭₺₦₩₪ ✂℻☆♥⚐☺☯☹ ☉♀♁♂♉ ✔✘ ○☓□△ ␣⌫⌥⌘↵␀ ¯₁½²√¬∈∞ ↗┌╁╖░█∎ � ║ + ║ _ABCDEFGHIJKLMNOPQRSTUVWXYZ ÅĀČẾƏØṆⱣÞß АБВГДЯѢЌ ΑΒΓΔΩὮ ႠႡႢჇ ԱԲԳ אבגױ ║ + ║ -abcdefghijklmnopqrstuvwxyz åāčếəøṇᵽþſ абвгдяѣќ αβγδωὦ აბგჷ աբգ ابجݰ ║ + ║ −0123456789 <=>+÷× ¤¢$€¥£元 (/)[\]{|} ,;:.…!¿?‽· ^`'"~ ✔✘☺☹ @#&§¶†©• ║ + ║ ¯½₁²↋ %‰√∞∧¬∈≠≥±∶*∀∅ ฿₺₽₹₩₪ ␣⌫⌥⎇⌘↵␤␀ ☉♀♁♂✂✎☆♥⚐☯∎ «“’”» ○☓□△ ↗┌╁╖░█ � ║ ╚══════════════════════════════════════════════════════════════════════╝ +Unicode blocks: + 0xx 1xx 2xx 3xx 4xx 5xx 6xx 7xx 8xx 9xx Axx Bxx Cxx Dxx Exx Fxx + U+00*xx A Æ Ŧ Ə ɚ ʶ ◌̌ Ψ Я Ԅ Զ א ﻕ ܐݘޘߐࠄࡊࢨ ऊ উ ਇ ઈ ଊ ஒ ఈ ಏ ഋ ඉ ฆ ຄ ༀ + U+01*xx က ფ ㄹ ቒ ᎉ Ꭳ ᗾ . . . ᚘᚠᜈᜣᝁᝰទ ᠦ ᣆ ᤊᥜᦁ᧬ᨁᨮ ◌᪱ᬓ ᮕᯄ᯦ᰀᱚᲐᳯᴓ ᶒ ◌᷌ Ḛ Ὦ + U+02*xx ※ ⁵€⃕℘⅞↬ ∉ ⌘ ␦⑇Ⓑ ╩ ▛ ◈ ☤ ✍ ⟀ ⟰ ⡽ ⤱ ⦖ ⨒ ⬀ ⰂⱠⲀ ⴂⵣⶎ ⸎ ⺋⼈ + U+03-0A のオㄌ㇂㊤ ㍚ 㐁 ䷃ 中 ꀃ꒚ꓘ ꖃ Ꙗ ꚬ Ꜽ ꠁꡁꢇꣷꤊꤰꦑꧣꨀ꩷ꪂꫠꬫꬴꮠꯂ가 + Code:

 
   Hash[ :nbsp => 0O2_40 ].each {|name, cp| puts "#{name} is '#{cp.chr}'" }
 
-  while ((c = *l++) != '\0') { m->stat[2] = IO | (~OK & X_8); }
+  while ((c = *l++) != '\0') { m->stat[2] = IO | (~OK & X_8); } /* C */
 
   perl -pe's/\w/$^ =~ $& > chop($^ = $& . $^) ? "@-" : $&/ge'
 
+  fix$(<$>)<$>(:)<*>((<$>((:[{- hs -}])<$>))(=<<)<$>(*)<$>(>>=)(+)($))$1
+
   ↑1 ⍵∨.∧3 4=+/,¯1 0 1∘.⊖¯1 0 1⌽¨⊂⍵ ⍝ game of life
 
 Mathematics and sciences:
@@ -72,7 +81,7 @@ Precomposed and combining diacritics:
   Muļķa hipiji mēģina nogaršot žņaudzējčūsku. Trâu chậm uống nước đục.
   Mul̦k̦a hipiji mēg̓ina nogaršot žn̦audzējčūsku. Trâu chậm uống nước đục.
 
-  STARGɅ̊TE, a = v̇ = r̈, a⃑ ⊥ b⃑
+  STARGɅ̊TE • a = v̇ = r̈, a⃑ ⊥ b⃑ • 1̴·2⃯⃗·3̶̮̑·4̣̤̇̈·5⃘̜̹͑͗
 
 Pangrams:
 
@@ -100,6 +109,10 @@ German with presentational ligatures:
   Im finſteren Jagdſchloß am offenen Felsquellwaſſer patzte der affig‐flatterhafte
   kauzig‐höf‌liche Bäcker über ſeinem verſifften kniffligen C‐Xylophon.
 
+Common homographs:
+
+  AΑАᎪꓮ𝖠𖽀 OΟОՕⲞס߀Ჿꓳ𐐄𐊒𐊫𐌏ዐ𐓂᠐ꢝ𐰗𖫩ⵔ𖩠0𝙾○ㆁ꒨
+
 Modern Greek Ύμνος εις την Ελευθερίαν:
 
   Σε γνωρίζω από την κόψη του σπαθιού την τρομερή,
@@ -114,6 +127,11 @@ Ancient Greek Iliad:
   ϙύνεσσιν οἰωνοῖσί τε πᾶσι· Διὸς δ᾽ ἐτελείετο βουλή· ἐξ οὗ δὴ τὰ πρῶτα
   διαστήτην ἐρίσαντε Ἀτρεΐδης τε ϝάναξ ἀνδρῶν καὶ δῑος Ἀχιλλεύς.
 
+Coptic:
+
+  ⲕⲧ̅ⲕⲁ ⲅⲉⲗⲅⲟ̅ⲥⲛ ⲓ̈ⲏ̅ⲥⲟⲩⲥⲓ ⲛⲁⳡⲁⲛ ⲧⲣⲓⲕⲁ• ⲇⲟⲗⲗⲉ ⲡⲟⲗⲅⲁⲣⲁ ⲡⲉⲥⲥⲛⲁ• ⲡⲁⲡⲟ ⲥ̅ⲕⲟⲉⲗⲙ̅ⲙⲉ ⲉⲕ̅ⲕⲁ
+  κτ̄κα γελγελο̄ϲουανον ῑη̄ϲουϲι ναϫαν τρικα• δολλε πολγαρα πεϲϲνα• παπο ϲ̄κοελμ̄με εκ̄κα
+
 Georgian:
 
   ვეფხისტყაოსანი (Veṗxis Ṭq̇aosani) შოთა რუსთაველი (დაახ. 1165)
@@ -164,6 +182,10 @@ Zarka Table (Torah cantillation):
   זַרְקָא֮ סְגוֹלְתָּא֒ מוּנַח־לְגַרְמֵ֣הּ׀ מוּנַ֣ח רְבִ֗יעַ פָּזֵר־קָטָ֡ן תְּלִישָׁא־גְ֠דוֹלָה תְּלִישָׁא־קְטַנָה֩ אַזְלָ֨א גֶּ֜רֶשׁ
   מְהֻפָּ֤ךְ פַּשְׁטָא֙ זָקֵף־קָטָ֔ן טִפְחָ֖א אַתְנָ֑ח דַּרְגָּ֧א תְּבִ֛יר טִפְחָ֖א מֵרְכָ֥א סִלּֽוּק׃
 
+Zalgo text:
+
+  T̫̺̳o̬̜ ì̬͎̲̟nv̖̗̻̣̹̕o͖̗̠̜̤k͍͚̹͖̼e̦̗̪͍̪͍ ̬ͅt̕h̠͙̮͕͓e̱̜̗͙̭ ̥͔̫͙̪͍̣͝ḥi̼̦͈̼v҉̩̟͚̞͎e͈̟̻͙̦̤-m̷̘̝̱í͚̞̦̳n̝̲̯̙̮͞d̴̺̦͕̫ ̗̭̘͎͖r̞͎̜̜͖͎̫͢ep͇r̝̯̝͖͉͎̺e̴s̥e̵̖̳͉͍̩̗n̢͓̪͕̜̰̠̦t̺̞̰i͟n҉̮̦̖̟g̮͍̱̻͍̜̳ ̳c̖̮̙̣̰̠̩h̷̗͍̖͙̭͇͈a̧͎̯̹̲̺̫ó̭̞̜̣̯͕s̶̤̮̩̘.̨̻̪̖͔ ̳̭̦̭̭̦̞́I̠͍̮n͇̹̪̬v̴͖̭̗̖o̸k҉̬̤͓͚̠͍i͜n̛̩̹͉̘̹g͙ ̠̥ͅt̰͖͞h̫̼̪e̟̩̝ ̭̠̲̫͔fe̤͇̝̱e͖̮̠̹̭͖͕l͖̲̘͖̠̪i̢̖͎̮̗̯͓̩n̸̰g̙̱̘̗͚̬ͅ ͍o͍͍̩̮͢f̖͓̦̥ ̘͘c̵̫̱̗͚͓̦h͝a̝͍͍̳̣͖͉o͙̟s̤̞.̙̝̭̣̳̼͟
+
 Ethiopic (Amharic, Blin, Sebatbeit):
   ዩኒኮድ ለእያንዳንዱ ፊደል፣       ዩኒኮድ ላፍደልድክ፡       ዩኒኮድ እንም ኤነት ፊደል፤
   ማንኛውም ዓይነት ኮምፒውተር ቢሆን፣  ኣኻ ኮምፕዩተርልክ ኣኽን፡   ሟኒም ኤነት ኮምፒተር ቢኸር፤
@@ -277,6 +299,8 @@ Japanese Iroha:
   浅き夢見じ   あさきゆめみし   アサキユメミジ   アサキユメミジ   阿佐伎喩女美之
   酔ひもせず   ゑひもせす     ヱヒモセズン    ウェヒモセズン   恵比毛勢須
 
+  hentaigana 変体仮名: 𛀆𛄆𛂦𛂌𛃀𛂶𛁻 𛁦𛃶𛂏𛃸𛄚 𛄋𛀙𛃫𛁟𛄀𛁚 𛁩𛂒𛂄𛃭𛃑
+
 Chinese:
 
   ‣ Most common characters: