<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"
http://www.w3.org/TR/html4/loose.dtd">
<html><head>
<title>UTF-8 Sampler</title>
<META http-equiv="Content-Type" content="text/html">
</head><body bgcolor="#ffffff" text="#000000">
<h3>UTF-8 Sampler</h3>
UTF-8 is an ASCII-preserving encoding method for
<a href="unicode.html">Unicode</a> (ISO 10646), the Universal Character Set
(UCS). The UCS encodes most of the world's writing systems in a single
character set, allowing you to mix languages and scripts within a document
without needing any tricks for switching character sets. This web page is
encoded directly in UTF-8.
<p>
<a href="k95.html">Kermit 95</a>
can display UTF-8 plain text in Windows NT, XP, or 2000 when using a
monospace Unicode font like Lucida Console or Courier New.
<a href="k95next.html">The forthcoming GUI version of Kermit 95</a>
will be able to display it too, even in Windows 95, 98, and ME.
<a href="ckermit.html">C-Kermit 7.0</a> and later can
handle it too, <a href="
http://www.cl.cam.ac.uk/~mgk25/unicode.html">if you
have a Unicode display</a>. As many languages as
are representable in your font can be seen on the screen at the same time.
<p>
This, however, is a Web page. Some Web browsers can handle UTF-8, some can't.
And those that can might not have a sufficiently populated font to work with.
<a href="
http://www.hclrss.demon.co.uk/unicode/fonts.html">CLICK HERE</a>
for a survey of Unicode fonts.
<p>
First, the Euro symbol: €.
<p>
From the Anglo-Saxon <a href="
http://www.ragweedforge.com/poems.html"><cite>Rune Poem</cite></a> (Rune version):
<p><blockquote>
ᚠᛇᚻ᛫ᛒᛦᚦ᛫ᚠᚱᚩᚠᚢᚱ᛫ᚠᛁᚱᚪ᛫ᚷᛖᚻᚹᛦᛚᚳᚢᛗ<br>
ᛋᚳᛖᚪᛚ᛫ᚦᛖᚪᚻ᛫ᛗᚪᚾᚾᚪ᛫ᚷᛖᚻᚹᛦᛚᚳ᛫ᛗᛁᚳᛚᚢᚾ᛫ᚻᛦᛏ᛫ᛞᚫᛚᚪᚾ<br>
ᚷᛁᚠ᛫ᚻᛖ᛫ᚹᛁᛚᛖ᛫ᚠᚩᚱ᛫ᛞᚱᛁᚻᛏᚾᛖ᛫ᛞᚩᛗᛖᛋ᛫ᚻᛚᛇᛏᚪᚾ᛬<br>
</blockquote>
<p>
From Laȝamon's<i> <a href="
http://mesl.itd.umich.edu/b/brut/">Brut</a></i>
(<i>The Chronicles of England</i>, Middle English, West Midlands):
<p>
<blockquote>
An preost wes on leoden, Laȝamon was ihoten<br>
He wes Leovenaðes sone -- liðe him be Drihten.<br>
He wonede at Ernleȝe at æðelen are chirechen,<br>
Uppen Sevarne staþe, sel þar him þuhte,<br>
Onfest Radestone, þer he bock radde.
</blockquote>
<p>
From the <cite>Tagelied</cite> of
<a href="
http://www.gunnet.de/Wolframs-Eschenbach/wolfram.htm">
<b>Wolfram von Eschenbach</b></a> (Middle High German):
<p><blockquote>
Sîne klâwen durh die wolken sint geslagen,<br>
er stîget ûf mit grôzer kraft,<br>
ich sih in grâwen tägelîch als er wil tagen,<br>
den tac, der im geselleschaft<br>
erwenden wil, dem werden man,<br>
den ich mit sorgen în verliez.<br>
ich bringe in hinnen, ob ich kan.<br>
sîn vil manegiu tugent michz leisten hiez.<br>
</blockquote><p>
Some lines of
<a href="
http://users.hol.gr/~artemis/odysseas_elytis.htm">
<b>Odysseus Elytis</b></a> (Greek):
<blockquote>
Τη γλώσσα μου έδωσαν ελληνική<br>
το σπίτι φτωχικό στις αμμουδιές του Ομήρου.<br>
Μονάχη έγνοια η γλώσσα μου στις αμμουδιές του Ομήρου.<br>
<p>
από το Άξιον Εστί<br>
του Οδυσσέα Ελύτη
</blockquote>
<p>
A stanza of
<a href="
http://www.ocf.berkeley.edu/%7Eleong/Russkaya%20Literatura/Aleksandr%20Sergeevich%20Pushkin.htm"><b>Pushkin</b></a>'s <cite>Bronze Horseman</cite> (Russian):<br>
<p><blockquote>
На берегу пустынных волн<br>
Стоял он, дум великих полн,<br>
И вдаль глядел. Пред ним широко<br>
Река неслася; бедный чёлн<br>
По ней стремился одиноко.<br>
По мшистым, топким берегам<br>
Чернели избы здесь и там,<br>
Приют убогого чухонца;<br>
И лес, неведомый лучам<br>
В тумане спрятанного солнца,<br>
Кругом шумел.<br>
</blockquote><p>
<a href="
http://www.compling.hu-berlin.de/~johannes/mxedruli/"><b>Šota Rustaveli</b></a>'s Veṗxis Ṭq̇aosani,
̣︡Th, <cite>The Knight in the Tiger's Skin</cite> (Georgian):<p>
<blockquote>
ვეპხის ტყაოსანი
შოთა რუსთაველი
<p>
ღმერთსი შემვედრე, ნუთუ კვლა დამხსნას სოფლისა შრომასა,
ცეცხლს, წყალსა და მიწასა, ჰაერთა თანა მრომასა;
მომცნეს ფრთენი და აღვფრინდე, მივჰხვდე მას ჩემსა ნდომასა,
დღისით და ღამით ვჰხედვიდე მზისა ელვათა კრთომაასა.
</blockquote>
<p>
And from the sublime to the ridiculous, here is a
<a href="
http://hcs.harvard.edu/~igp/glass.html">
certain phrase</a> in an assortment of languages:
<p><blockquote>
<b>Greek</b>: Μπορώ να φάω σπασμένα γυαλιά χωρίς να πάθω τίποτα.<br>
<b>Sanskrit</b>: (NEEDED)<br>
<b>Etruscan</b>: (NEEDED)<br>
<b>Latin</b>: Vitrum edere possum; mihi non nocet.<br>
<b>Esperanto</b>: Mi povas manĝi vitron, ĝi ne damaĝas min.<br>
<b>French</b>: Je peux manger du verre, cela ne me fait pas mal.<br>
<b>Provençal</b>: Pòdi manjar de veire, me nafrariá pas.<br>
<b>Québécois</b>: J'peux bouffer d'la vitre, ça m'fa pas mal.<br>
<b>Walloon</b>: Dji pou magnî do vêre, çoula m' freut nén må.<br>
<b>Champenois</b>: (NEEDED)<br>
<b>Lorrain</b>: (NEEDED)<br>
<b>Picard</b>: (NEEDED)<br>
<b>Corsican</b>: (NEEDED)<br>
<b>Occitan</b>: (NEEDED)<br>
<b>Catalan</b>: Puc menjar vidre que no em fa mal.<br>
<b>Spanish</b>: Puedo comer vidrio, no me hace daño.<br>
<b>Basque</b>: Kristala jan dezaket, ez dit minik ematen.<br>
<b>Aragones</b>: Puedo minchar beire, no me'n fa mal .<br>
<b>Galician</b>: Eu podo xantar cristais e non cortarme.<br>
<b>Portuguese</b>: Posso comer vidro, não me faz mal.<br>
<b>Brazilian Portuguese</b>: Consigo comer vidro. Não me machuca.<br>
<b>Cabo Verde Creole</b>: M' podê cumê vidru, ca ta maguâ-m'.<br>
<b>Papiamentu</b>: (NEEDED)<br>
<b>Italian</b>: Posso mangiare il vetro e non mi fa male.<br>
<b>Roman</b>: Me posso magna' er vetro, e nun me fa male.<br>
<b>Sicilian</b>: Puotsu mangiari u vitru, nun mi fa mali.<br>
<b>Milanese</b>: Sôn bôn de magnà el véder, el me fa minga mal.<br>
<b>Venetian</b>: Mi posso magnare el vetro, no'l me fa mae.<br>
<b>Rheto-Romance</b>: (NEEDED)<br>
<b>Romanian</b>: Pot să mănânc sticlă și ea nu mă rănește.<br>
<b>Pictish</b>: (NEEDED)<br>
<b>Breton</b>: (NEEDED)<br>
<b>Cornish</b>: Mý a yl dybry gwéder hag éf ny wra ow ankenya.<br>
<b>Welsh</b>: Dw i'n gallu bwyta gwydr, dwy e ddim yn gwneud dolur i mi.<br>
<b>Irish</b>: Tá mé in ann gloine a ithe; Ní chuireann sé isteach nó amach orm.<br>
<b>Scottish Gaelic</b>: S urrainn dhomh gloinne ithe; cha ghoirtich i mi.<br>
<b>Anglo-Saxon</b>: Ic mæg glæs eotan ond hit hearmiað me ne.<br>
<b>Middle English</b>: Ich canne glas eten and hit hirtiþ me nouȝt.<br>
<b>English</b>: I can eat glass and it doesn't hurt me.<br>
<b>Norwegian (Nynorsk):</b> Eg kan eta glas utan å skada meg.<br>
<b>Norwegian (Bokmål):</b> Jeg kan spise glass uten å skade meg.<br>
<b>Icelandic</b>: Èg get borðað gler, það meiðir mig ekki.<br>
<b>Danish</b>: Jeg kan spise glas, det gør ikke ondt på mig.<br>
<b>Soenderjysk</b>: Æ ka æe glass uhen at det go mæ naue.<br>
<b>Frisian</b>: Ik kin glês ite, it docht me net sear.<br>
<b>Dutch</b>: Ik kan glas eten. Het doet me geen pijn.<br>
<b>Afrikaans</b>: Ek kan glas eet, maar dit maak my nie seer nie.<br>
<b>German</b>: Ich kann Glas essen, ohne mir weh zu tun.<br>
<b>Lëtzebuergescht</b>: Ech kan Glas iessen, daat deet mir nët wei.<br>
<b>Schwäbisch</b>: I kå Glas frässa, ond des macht mr nix!<br>
<b>Bayrisch</b>: I koh Glos esa, und es duard ma ned wei.<br>
<b>Allemannisch</b>: I kaun Gloos essen, es tuat ma ned weh.<br>
<b>Schwyzerdütsch</b>: Ich chan Glaas ässe, das tuet mir nöd weeh.<br>
<b>Swedish</b>: Jag kan äta glas, det skadar mig inte.<br>
<b>Finnish</b>: Pystyn syömään lasia. Se ei koske yhtään.<br>
<b>Hungarian</b>: Meg tudom enni az üveget, nem lesz tőle bajom.<br>
<b>Estonian</b>: Ma vōin klaasi süüa, see ei tee mulle midagi.<br>
<b>Latvian</b>: Es varu ēst stiklu, tas man nekaitē.<br>
<b>Lithuanian</b>: Aš galiu valgyti stiklą ir jis manęs nežeidžia<br>
<b>Croatian</b>: Ja mogu jesti staklo i ne boli me.<br>
<b>Czech</b>: Mohu jíst sklo, neublíží mi.<br>
<b>Slovak</b>: Môžem jesť sklo. Nezraní ma.<br>
<b>Polish</b>: Mogę jeść szkło i mi nie szkodzi.<br>
<b>Albanian</b>: Unë mund të ha qelq dhe nuk më gjen gjë.<br>
<b>Slovenian:</b> Lahko jem steklo, ne da bi mi škodovalo.<br>
<b>Serbian</b>: Mogu jesti staklo bez da mi škodi.<br>
<b>Serbian</b>: Могу јести стакло без да ми шкоди.<br>
<b>Macedonian:</b> Можам да јадам стакло, а не ме штета.<br>
<b>Russian</b>: Я могу есть стекло, это мне не вредит.<br>
<b>Ukrainian</b>: Я можу їсти шкло, й воно мені не пошкодить.<br>
<b>Bulgarian</b>: Мога да ям стъкло и не ме боли. <br>
<b>Georgian</b>: მინას ვჭამ და არა მტკივა.<br>
<b>Armenian</b>: Կրնամ ապակի ուտել և ինծի անհանգիստ չըներ։<br>
<b>Turkish</b>: Cam yiyebilirim, bana zararı dokunmaz.<br>
<b>Marath</b>i: मी काच खाऊ शकतो, मला ते दुखत नाही.<br>
<b>Hindi</b>: मैं काँच खा सकता हूँ, मुझे उस से कोई पीडा नहीं होती.<br>
<b>Farsi / Persian</b>: .من می توانم بدونِ احساس درد شيشه بخورم<br>
<b>Pashto</b><a href="#note1">(1)</a>: زه شيشه خوړلې شم، هغه ما نه خوږوي<br>
<b>Arabic</b><a href="#note1">(1)</a>: <span dir="RTL" lang=AR>أنا قادر على أكل الزجاج و هذا لا يؤلمني.</span><br>
<B>Hebrew</B><a href="#note1">(1)</a>: <SPAN dir=rtl lang=HE>אני יכול לאכול זכוכית וזה לא מזיק לי.</SPAN><br>
<B>Yiddish</B><a href="#note1">(1)</a>: <SPAN dir=rtl lang=JI>איך קען עסן גלאָז און עס טוט מיר נישט װײ.</SPAN><br>
<b>Ladino</b>: (NEEDED)<br>
<b>Twi</b>: Metumi awe tumpan, ɜnyɜ me hwee.<br>
<b>Yoruba</b><a href="#note2">(2)</a>: Mo lè je̩ dígí, kò ní pa mí lára.<br>
<b>Malay</b>: Saya boleh makan kaca dan ia tidak mencederakan saya.<br>
<b>Tagalog</b>: Kaya kong kumain nang bubog at hindi ako masaktan.<br>
<b>Chamorro</b>: Siña yo' chumocho krestat, ti ha na'lalamen yo'.<br>
<b>Javanese</b>: Aku isa mangan beling tanpa lara.<br>
<B>Vietnamese</B>: Tôi có thể ăn thủy tinh mà không hại gì.<BR>
<b>Chinese</b>: 我能吞下玻璃而不伤身体。<br>
<b>Japanese</b>: 私はガラスを食べられます。それは私を傷つけません。<br>
<b>Korean</b>: 나는 유리를 먹을 수 있어요. 그래도 아프지 않아요<br>
<b>Thai</b>: ฉันกินกระจกได้ แต่มันไม่ทำให้ฉันเจ็บ<br>
<b>Lojban</b>: mi kakne le nu citka le blaci .iku'i le se go'i na xrani mi<br>
</blockquote>
<p>
For testing purposes, some of these are repeated in a <b>monospace font</b> . . .
<p>
<blockquote>
<pre>
<b>Euro Symbol</b>: €.
<b>Greek</b>: Μπορώ να φάω σπασμένα να γυαλιά χωρίς να πάθω τίποτα.
<b>Icelandic</b>: Èg get borðað gler, það meiðir mig ekki.
<b>Polish</b>: Mogę jeść szkło, i mi nie szkodzi.
<b>Romanian</b>: Pot să mănânc sticlă și ea nu mă rănește.
<b>Ukrainian</b>: Я можу їсти шкло, й воно мені не пошкодить.
<b>Armenian</b>: Կրնամ ապակի ուտել և ինծի անհանգիստ չըներ։
<b>Georgian</b>: მინას ვჭამ და არა მტკივა.
<b>Hindi</b>: मैं काँच खा सकता हूँ, मुझे उस से कोई पीडा नहीं होती.
<B>Hebrew</B><a href="#note1">(1)</a>: <SPAN dir=rtl lang=HE>אני יכול לאכול זכוכית וזה לא מזיק לי.</SPAN>
<b>Arabic</b><a href="#note1">(1)</a>: <span dir="RTL" lang=AR>أنا قادر على أكل الزجاج و هذا لا يؤلمني.</span>
<b>Japanese</b>: 私はガラスを食べられます。それは私を傷つけません。
<b>Thai</b>: ฉันกินกระจกได้ แต่มันไม่ทำให้ฉันเจ็บ
</pre>
</blockquote>
<b>Notes:</b>
<ol>
<li><a name="note1">Correct right-to-left display of these languages
depends on the capabilities of your browser.</a> The period should
appear on the left.
<li><a name="note2">The third word</a> is Latin letter small 'j' followed by
small 'e' with U+0329, Combining Vertical Line Below. This displays
correctly only if your Unicode font includes the U+0329 glyph and your
browser supports combining diacritical marks.
</ol>
<p>
<i>(Additions, corrections, completions,</i>
<a href="mailto:
[email protected]"><i>gratefully accepted</i></a><i>.)</i>
<p>
Other Unicode samplers:<ul><li><a href="
http://www.unicode.org/unicode/standard/WhatIsUnicode.html">What Is Unicode?</a>
<li><a href="
http://www.trigeminal.com/samples/provincial.html">Anyone
can be provincial!</a>
<li><a href="
http://www.macchiato.com/unicode/Unicode_transcriptions.html">Transcriptions of "Unicode"</a>
<li><a href="
http://www.geocities.com/i18nguy/unicode-example.html">Example
Unicode Usage for Business Applications</a>
</ul>
<p>
[ <a href="k95.html">Kermit 95</a> ]
[ <a href="glass.html">K95 Screen Shots</a> ]
[ <a href="ckermit.html">C-Kermit</a> ]
[ <a href="index.html">Kermit Home</a> ]
[ <a href="
http://www.hclrss.demon.co.uk/unicode/fonts.html">Unicode Fonts</a> ]
<hr>
<ADDRESS>
UTF-8 Sampler / <a href="index.html">The Kermit Project</a> /
<a href="
http://www.columbia.edu">Columbia University</a> /
<a href="mailto:
[email protected]">
[email protected]</a> / 17 Jan 2002
</ADDRESS>
</body>
</html>