Edit History Actions Discussion

Diff for "FAQ/Unicode"

Differences between revisions 11 and 12
Revision 11 as of 2008-05-09 15:24:11
Size: 1380
Comment:
Revision 12 as of 2008-12-16 19:16:17
Size: 931
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
== Question ==
Does the shell provide unicode support?
= Unicode =
Line 4: Line 3:
== Answer ==
Yes. Both for commandline programs and for webpages.
'''Question/Problem''': Are you having problems with character encodings?
Line 7: Line 5:
=== Problem #1 - Commandline programs === '''Answer/Fix''': Try using Unicode/UTF-8.

== Commandline programs ==
Line 18: Line 18:
To use unicode in irssi, type this in the command prompt in irssi: For irssi, see [[FAQ/IrssiCharset]].
Line 20: Line 20:
{{{
/set term_charset UTF-8}}}
Use this command if you want to send text in another charset to other users that are still using ISO for example (note however that you will not be able to send UTF text with this setting):

{{{
/set recode_out_default_charset iso-8859-1}}}


Remember that you have to change the charset in your SSH-client as well! Use the translation option in [[PuTTY]] for example.

=== Problem #2 - Webpages ===
== Problem #2 - Webpages ==

Unicode

Question/Problem: Are you having problems with character encodings?

Answer/Fix: Try using Unicode/UTF-8.

Commandline programs

Localized characters doesn't display correctly in irssi, nano etc (they are replaced with question marks, squares or something similar.)

Solution:

Write this command at the shell prompt (you don't need to do this if you already choose UTF-8 in the shell setup):

echo 'export LC_ALL="en_US.UTF-8"' >> ~/.profile

Next time you log in your shell will be in unicode.

For irssi, see FAQ/IrssiCharset.

Problem #2 - Webpages

Characters in webpages doesn't display correctly.

Solution:

Save the webpages in UTF-8 from your text-editor. Then create a new file named .htaccess in the same directory as your webpages (public_html) with the following content:

AddDefaultCharset UTF-8