| 1 |
|
| 2 | UTF-8 encoded sample plain-text file
|
| 3 | ‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾
|
| 4 |
|
| 5 | Markus Kuhn [ˈmaʳkʊs kuːn] <http://www.cl.cam.ac.uk/~mgk25/> — 2002-07-25 CC BY
|
| 6 |
|
| 7 |
|
| 8 | The ASCII compatible UTF-8 encoding used in this plain-text file
|
| 9 | is defined in Unicode, ISO 10646-1, and RFC 2279.
|
| 10 |
|
| 11 |
|
| 12 | Using Unicode/UTF-8, you can write in emails and source code things such as
|
| 13 |
|
| 14 | Mathematics and sciences:
|
| 15 |
|
| 16 | ∮ E⋅da = Q, n → ∞, ∑ f(i) = ∏ g(i), ⎧⎡⎛┌─────┐⎞⎤⎫
|
| 17 | ⎪⎢⎜│a²+b³ ⎟⎥⎪
|
| 18 | ∀x∈ℝ: ⌈x⌉ = −⌊−x⌋, α ∧ ¬β = ¬(¬α ∨ β), ⎪⎢⎜│───── ⎟⎥⎪
|
| 19 | ⎪⎢⎜⎷ c₈ ⎟⎥⎪
|
| 20 | ℕ ⊆ ℕ₀ ⊂ ℤ ⊂ ℚ ⊂ ℝ ⊂ ℂ, ⎨⎢⎜ ⎟⎥⎬
|
| 21 | ⎪⎢⎜ ∞ ⎟⎥⎪
|
| 22 | ⊥ < a ≠ b ≡ c ≤ d ≪ ⊤ ⇒ (⟦A⟧ ⇔ ⟪B⟫), ⎪⎢⎜ ⎲ ⎟⎥⎪
|
| 23 | ⎪⎢⎜ ⎳aⁱ-bⁱ⎟⎥⎪
|
| 24 | 2H₂ + O₂ ⇌ 2H₂O, R = 4.7 kΩ, ⌀ 200 mm ⎩⎣⎝i=1 ⎠⎦⎭
|
| 25 |
|
| 26 | Linguistics and dictionaries:
|
| 27 |
|
| 28 | ði ıntəˈnæʃənəl fəˈnɛtık əsoʊsiˈeıʃn
|
| 29 | Y [ˈʏpsilɔn], Yen [jɛn], Yoga [ˈjoːgɑ]
|
| 30 |
|
| 31 | APL:
|
| 32 |
|
| 33 | ((V⍳V)=⍳⍴V)/V←,V ⌷←⍳→⍴∆∇⊃‾⍎⍕⌈
|
| 34 |
|
| 35 | Nicer typography in plain text files:
|
| 36 |
|
| 37 | ╔══════════════════════════════════════════╗
|
| 38 | ║ ║
|
| 39 | ║ • ‘single’ and “double” quotes ║
|
| 40 | ║ ║
|
| 41 | ║ • Curly apostrophes: “We’ve been here” ║
|
| 42 | ║ ║
|
| 43 | ║ • Latin-1 apostrophe and accents: '´` ║
|
| 44 | ║ ║
|
| 45 | ║ • ‚deutsche‘ „Anführungszeichen“ ║
|
| 46 | ║ ║
|
| 47 | ║ • †, ‡, ‰, •, 3–4, —, −5/+5, ™, … ║
|
| 48 | ║ ║
|
| 49 | ║ • ASCII safety test: 1lI|, 0OD, 8B ║
|
| 50 | ║ ╭─────────╮ ║
|
| 51 | ║ • the euro symbol: │ 14.95 € │ ║
|
| 52 | ║ ╰─────────╯ ║
|
| 53 | ╚══════════════════════════════════════════╝
|
| 54 |
|
| 55 | Combining characters:
|
| 56 |
|
| 57 | STARGΛ̊TE SG-1, a = v̇ = r̈, a⃑ ⊥ b⃑
|
| 58 |
|
| 59 | Greek (in Polytonic):
|
| 60 |
|
| 61 | The Greek anthem:
|
| 62 |
|
| 63 | Σὲ γνωρίζω ἀπὸ τὴν κόψη
|
| 64 | τοῦ σπαθιοῦ τὴν τρομερή,
|
| 65 | σὲ γνωρίζω ἀπὸ τὴν ὄψη
|
| 66 | ποὺ μὲ βία μετράει τὴ γῆ.
|
| 67 |
|
| 68 | ᾿Απ᾿ τὰ κόκκαλα βγαλμένη
|
| 69 | τῶν ῾Ελλήνων τὰ ἱερά
|
| 70 | καὶ σὰν πρῶτα ἀνδρειωμένη
|
| 71 | χαῖρε, ὦ χαῖρε, ᾿Ελευθεριά!
|
| 72 |
|
| 73 | From a speech of Demosthenes in the 4th century BC:
|
| 74 |
|
| 75 | Οὐχὶ ταὐτὰ παρίσταταί μοι γιγνώσκειν, ὦ ἄνδρες ᾿Αθηναῖοι,
|
| 76 | ὅταν τ᾿ εἰς τὰ πράγματα ἀποβλέψω καὶ ὅταν πρὸς τοὺς
|
| 77 | λόγους οὓς ἀκούω· τοὺς μὲν γὰρ λόγους περὶ τοῦ
|
| 78 | τιμωρήσασθαι Φίλιππον ὁρῶ γιγνομένους, τὰ δὲ πράγματ᾿
|
| 79 | εἰς τοῦτο προήκοντα, ὥσθ᾿ ὅπως μὴ πεισόμεθ᾿ αὐτοὶ
|
| 80 | πρότερον κακῶς σκέψασθαι δέον. οὐδέν οὖν ἄλλο μοι δοκοῦσιν
|
| 81 | οἱ τὰ τοιαῦτα λέγοντες ἢ τὴν ὑπόθεσιν, περὶ ἧς βουλεύεσθαι,
|
| 82 | οὐχὶ τὴν οὖσαν παριστάντες ὑμῖν ἁμαρτάνειν. ἐγὼ δέ, ὅτι μέν
|
| 83 | ποτ᾿ ἐξῆν τῇ πόλει καὶ τὰ αὑτῆς ἔχειν ἀσφαλῶς καὶ Φίλιππον
|
| 84 | τιμωρήσασθαι, καὶ μάλ᾿ ἀκριβῶς οἶδα· ἐπ᾿ ἐμοῦ γάρ, οὐ πάλαι
|
| 85 | γέγονεν ταῦτ᾿ ἀμφότερα· νῦν μέντοι πέπεισμαι τοῦθ᾿ ἱκανὸν
|
| 86 | προλαβεῖν ἡμῖν εἶναι τὴν πρώτην, ὅπως τοὺς συμμάχους
|
| 87 | σώσομεν. ἐὰν γὰρ τοῦτο βεβαίως ὑπάρξῃ, τότε καὶ περὶ τοῦ
|
| 88 | τίνα τιμωρήσεταί τις καὶ ὃν τρόπον ἐξέσται σκοπεῖν· πρὶν δὲ
|
| 89 | τὴν ἀρχὴν ὀρθῶς ὑποθέσθαι, μάταιον ἡγοῦμαι περὶ τῆς
|
| 90 | τελευτῆς ὁντινοῦν ποιεῖσθαι λόγον.
|
| 91 |
|
| 92 | Δημοσθένους, Γ´ ᾿Ολυνθιακὸς
|
| 93 |
|
| 94 | Georgian:
|
| 95 |
|
| 96 | From a Unicode conference invitation:
|
| 97 |
|
| 98 | გთხოვთ ახლავე გაიაროთ რეგისტრაცია Unicode-ის მეათე საერთაშორისო
|
| 99 | კონფერენციაზე დასასწრებად, რომელიც გაიმართება 10-12 მარტს,
|
| 100 | ქ. მაინცში, გერმანიაში. კონფერენცია შეჰკრებს ერთად მსოფლიოს
|
| 101 | ექსპერტებს ისეთ დარგებში როგორიცაა ინტერნეტი და Unicode-ი,
|
| 102 | ინტერნაციონალიზაცია და ლოკალიზაცია, Unicode-ის გამოყენება
|
| 103 | ოპერაციულ სისტემებსა, და გამოყენებით პროგრამებში, შრიფტებში,
|
| 104 | ტექსტების დამუშავებასა და მრავალენოვან კომპიუტერულ სისტემებში.
|
| 105 |
|
| 106 | Russian:
|
| 107 |
|
| 108 | From a Unicode conference invitation:
|
| 109 |
|
| 110 | Зарегистрируйтесь сейчас на Десятую Международную Конференцию по
|
| 111 | Unicode, которая состоится 10-12 марта 1997 года в Майнце в Германии.
|
| 112 | Конференция соберет широкий круг экспертов по вопросам глобального
|
| 113 | Интернета и Unicode, локализации и интернационализации, воплощению и
|
| 114 | применению Unicode в различных операционных системах и программных
|
| 115 | приложениях, шрифтах, верстке и многоязычных компьютерных системах.
|
| 116 |
|
| 117 | Thai (UCS Level 2):
|
| 118 |
|
| 119 | Excerpt from a poetry on The Romance of The Three Kingdoms (a Chinese
|
| 120 | classic 'San Gua'):
|
| 121 |
|
| 122 | [----------------------------|------------------------]
|
| 123 | ๏ แผ่นดินฮั่นเสื่อมโทรมแสนสังเวช พระปกเกศกองบู๊กู้ขึ้นใหม่
|
| 124 | สิบสองกษัตริย์ก่อนหน้าแลถัดไป สององค์ไซร้โง่เขลาเบาปัญญา
|
| 125 | ทรงนับถือขันทีเป็นที่พึ่ง บ้านเมืองจึงวิปริตเป็นนักหนา
|
| 126 | โฮจิ๋นเรียกทัพทั่วหัวเมืองมา หมายจะฆ่ามดชั่วตัวสำคัญ
|
| 127 | เหมือนขับไสไล่เสือจากเคหา รับหมาป่าเข้ามาเลยอาสัญ
|
| 128 | ฝ่ายอ้องอุ้นยุแยกให้แตกกัน ใช้สาวนั้นเป็นชนวนชื่นชวนใจ
|
| 129 | พลันลิฉุยกุยกีกลับก่อเหตุ ช่างอาเพศจริงหนาฟ้าร้องไห้
|
| 130 | ต้องรบราฆ่าฟันจนบรรลัย ฤๅหาใครค้ำชูกู้บรรลังก์ ฯ
|
| 131 |
|
| 132 | (The above is a two-column text. If combining characters are handled
|
| 133 | correctly, the lines of the second column should be aligned with the
|
| 134 | | character above.)
|
| 135 |
|
| 136 | Ethiopian:
|
| 137 |
|
| 138 | Proverbs in the Amharic language:
|
| 139 |
|
| 140 | ሰማይ አይታረስ ንጉሥ አይከሰስ።
|
| 141 | ብላ ካለኝ እንደአባቴ በቆመጠኝ።
|
| 142 | ጌጥ ያለቤቱ ቁምጥና ነው።
|
| 143 | ደሀ በሕልሙ ቅቤ ባይጠጣ ንጣት በገደለው።
|
| 144 | የአፍ ወለምታ በቅቤ አይታሽም።
|
| 145 | አይጥ በበላ ዳዋ ተመታ።
|
| 146 | ሲተረጉሙ ይደረግሙ።
|
| 147 | ቀስ በቀስ፥ ዕንቁላል በእግሩ ይሄዳል።
|
| 148 | ድር ቢያብር አንበሳ ያስር።
|
| 149 | ሰው እንደቤቱ እንጅ እንደ ጉረቤቱ አይተዳደርም።
|
| 150 | እግዜር የከፈተውን ጉሮሮ ሳይዘጋው አይድርም።
|
| 151 | የጎረቤት ሌባ፥ ቢያዩት ይስቅ ባያዩት ያጠልቅ።
|
| 152 | ሥራ ከመፍታት ልጄን ላፋታት።
|
| 153 | ዓባይ ማደሪያ የለው፥ ግንድ ይዞ ይዞራል።
|
| 154 | የእስላም አገሩ መካ የአሞራ አገሩ ዋርካ።
|
| 155 | ተንጋሎ ቢተፉ ተመልሶ ባፉ።
|
| 156 | ወዳጅህ ማር ቢሆን ጨርስህ አትላሰው።
|
| 157 | እግርህን በፍራሽህ ልክ ዘርጋ።
|
| 158 |
|
| 159 | Runes:
|
| 160 |
|
| 161 | ᚻᛖ ᚳᚹᚫᚦ ᚦᚫᛏ ᚻᛖ ᛒᚢᛞᛖ ᚩᚾ ᚦᚫᛗ ᛚᚪᚾᛞᛖ ᚾᚩᚱᚦᚹᛖᚪᚱᛞᚢᛗ ᚹᛁᚦ ᚦᚪ ᚹᛖᛥᚫ
|
| 162 |
|
| 163 | (Old English, which transcribed into Latin reads 'He cwaeth that he
|
| 164 | bude thaem lande northweardum with tha Westsae.' and means 'He said
|
| 165 | that he lived in the northern land near the Western Sea.')
|
| 166 |
|
| 167 | Braille:
|
| 168 |
|
| 169 | ⡌⠁⠧⠑ ⠼⠁⠒ ⡍⠜⠇⠑⠹⠰⠎ ⡣⠕⠌
|
| 170 |
|
| 171 | ⡍⠜⠇⠑⠹ ⠺⠁⠎ ⠙⠑⠁⠙⠒ ⠞⠕ ⠃⠑⠛⠔ ⠺⠊⠹⠲ ⡹⠻⠑ ⠊⠎ ⠝⠕ ⠙⠳⠃⠞
|
| 172 | ⠱⠁⠞⠑⠧⠻ ⠁⠃⠳⠞ ⠹⠁⠞⠲ ⡹⠑ ⠗⠑⠛⠊⠌⠻ ⠕⠋ ⠙⠊⠎ ⠃⠥⠗⠊⠁⠇ ⠺⠁⠎
|
| 173 | ⠎⠊⠛⠝⠫ ⠃⠹ ⠹⠑ ⠊⠇⠻⠛⠹⠍⠁⠝⠂ ⠹⠑ ⠊⠇⠻⠅⠂ ⠹⠑ ⠥⠝⠙⠻⠞⠁⠅⠻⠂
|
| 174 | ⠁⠝⠙ ⠹⠑ ⠡⠊⠑⠋ ⠍⠳⠗⠝⠻⠲ ⡎⠊⠗⠕⠕⠛⠑ ⠎⠊⠛⠝⠫ ⠊⠞⠲ ⡁⠝⠙
|
| 175 | ⡎⠊⠗⠕⠕⠛⠑⠰⠎ ⠝⠁⠍⠑ ⠺⠁⠎ ⠛⠕⠕⠙ ⠥⠏⠕⠝ ⠰⡡⠁⠝⠛⠑⠂ ⠋⠕⠗ ⠁⠝⠹⠹⠔⠛ ⠙⠑
|
| 176 | ⠡⠕⠎⠑ ⠞⠕ ⠏⠥⠞ ⠙⠊⠎ ⠙⠁⠝⠙ ⠞⠕⠲
|
| 177 |
|
| 178 | ⡕⠇⠙ ⡍⠜⠇⠑⠹ ⠺⠁⠎ ⠁⠎ ⠙⠑⠁⠙ ⠁⠎ ⠁ ⠙⠕⠕⠗⠤⠝⠁⠊⠇⠲
|
| 179 |
|
| 180 | ⡍⠔⠙⠖ ⡊ ⠙⠕⠝⠰⠞ ⠍⠑⠁⠝ ⠞⠕ ⠎⠁⠹ ⠹⠁⠞ ⡊ ⠅⠝⠪⠂ ⠕⠋ ⠍⠹
|
| 181 | ⠪⠝ ⠅⠝⠪⠇⠫⠛⠑⠂ ⠱⠁⠞ ⠹⠻⠑ ⠊⠎ ⠏⠜⠞⠊⠊⠥⠇⠜⠇⠹ ⠙⠑⠁⠙ ⠁⠃⠳⠞
|
| 182 | ⠁ ⠙⠕⠕⠗⠤⠝⠁⠊⠇⠲ ⡊ ⠍⠊⠣⠞ ⠙⠁⠧⠑ ⠃⠑⠲ ⠔⠊⠇⠔⠫⠂ ⠍⠹⠎⠑⠇⠋⠂ ⠞⠕
|
| 183 | ⠗⠑⠛⠜⠙ ⠁ ⠊⠕⠋⠋⠔⠤⠝⠁⠊⠇ ⠁⠎ ⠹⠑ ⠙⠑⠁⠙⠑⠌ ⠏⠊⠑⠊⠑ ⠕⠋ ⠊⠗⠕⠝⠍⠕⠝⠛⠻⠹
|
| 184 | ⠔ ⠹⠑ ⠞⠗⠁⠙⠑⠲ ⡃⠥⠞ ⠹⠑ ⠺⠊⠎⠙⠕⠍ ⠕⠋ ⠳⠗ ⠁⠝⠊⠑⠌⠕⠗⠎
|
| 185 | ⠊⠎ ⠔ ⠹⠑ ⠎⠊⠍⠊⠇⠑⠆ ⠁⠝⠙ ⠍⠹ ⠥⠝⠙⠁⠇⠇⠪⠫ ⠙⠁⠝⠙⠎
|
| 186 | ⠩⠁⠇⠇ ⠝⠕⠞ ⠙⠊⠌⠥⠗⠃ ⠊⠞⠂ ⠕⠗ ⠹⠑ ⡊⠳⠝⠞⠗⠹⠰⠎ ⠙⠕⠝⠑ ⠋⠕⠗⠲ ⡹⠳
|
| 187 | ⠺⠊⠇⠇ ⠹⠻⠑⠋⠕⠗⠑ ⠏⠻⠍⠊⠞ ⠍⠑ ⠞⠕ ⠗⠑⠏⠑⠁⠞⠂ ⠑⠍⠏⠙⠁⠞⠊⠊⠁⠇⠇⠹⠂ ⠹⠁⠞
|
| 188 | ⡍⠜⠇⠑⠹ ⠺⠁⠎ ⠁⠎ ⠙⠑⠁⠙ ⠁⠎ ⠁ ⠙⠕⠕⠗⠤⠝⠁⠊⠇⠲
|
| 189 |
|
| 190 | (The first couple of paragraphs of "A Christmas Carol" by Dickens)
|
| 191 |
|
| 192 | Compact font selection example text:
|
| 193 |
|
| 194 | ABCDEFGHIJKLMNOPQRSTUVWXYZ /0123456789
|
| 195 | abcdefghijklmnopqrstuvwxyz £©µÀÆÖÞßéöÿ
|
| 196 | –—‘“”„†•…‰™œŠŸž€ ΑΒΓΔΩαβγδω АБВГДабвгд
|
| 197 | ∀∂∈ℝ∧∪≡∞ ↑↗↨↻⇣ ┐┼╔╘░►☺♀ fi�⑀₂ἠḂӥẄɐː⍎אԱა
|
| 198 |
|
| 199 | Greetings in various languages:
|
| 200 |
|
| 201 | Hello world, Καλημέρα κόσμε, コンニチハ
|
| 202 |
|
| 203 | Box drawing alignment tests: █
|
| 204 | ▉
|
| 205 | ╔══╦══╗ ┌──┬──┐ ╭──┬──╮ ╭──┬──╮ ┏━━┳━━┓ ┎┒┏┑ ╷ ╻ ┏┯┓ ┌┰┐ ▊ ╱╲╱╲╳╳╳
|
| 206 | ║┌─╨─┐║ │╔═╧═╗│ │╒═╪═╕│ │╓─╁─╖│ ┃┌─╂─┐┃ ┗╃╄┙ ╶┼╴╺╋╸┠┼┨ ┝╋┥ ▋ ╲╱╲╱╳╳╳
|
| 207 | ║│╲ ╱│║ │║ ║│ ││ │ ││ │║ ┃ ║│ ┃│ ╿ │┃ ┍╅╆┓ ╵ ╹ ┗┷┛ └┸┘ ▌ ╱╲╱╲╳╳╳
|
| 208 | ╠╡ ╳ ╞╣ ├╢ ╟┤ ├┼─┼─┼┤ ├╫─╂─╫┤ ┣┿╾┼╼┿┫ ┕┛┖┚ ┌┄┄┐ ╎ ┏┅┅┓ ┋ ▍ ╲╱╲╱╳╳╳
|
| 209 | ║│╱ ╲│║ │║ ║│ ││ │ ││ │║ ┃ ║│ ┃│ ╽ │┃ ░░▒▒▓▓██ ┊ ┆ ╎ ╏ ┇ ┋ ▎
|
| 210 | ║└─╥─┘║ │╚═╤═╝│ │╘═╪═╛│ │╙─╀─╜│ ┃└─╂─┘┃ ░░▒▒▓▓██ ┊ ┆ ╎ ╏ ┇ ┋ ▏
|
| 211 | ╚══╩══╝ └──┴──┘ ╰──┴──╯ ╰──┴──╯ ┗━━┻━━┛ ▗▄▖▛▀▜ └╌╌┘ ╎ ┗╍╍┛ ┋ ▁▂▃▄▅▆▇█
|
| 212 | ▝▀▘▙▄▟
|