When using Microsoft's Word or PowerPoint products to write HTML documents, when you save as HTML, the program translates certain characters such that they cause problems with non-Microsoft browsers (e.g. Netscape). For example, Hex Decimal Character === ======= ========================= 85 133 Ellipsis (...) 91 145 Open Single-Quote 92 146 Close Single-Quote 93 147 Open Double-Quote 94 148 Close Double-Quote 95 149 Bullet 96 150 En Dash 97 151 Em Dash 99 153 Trademark Sign A1 161 Inverted Exclamation mark A2 162 Cent Sign A3 163 Pound Sign A4 164 Currency Symbol A5 165 Yen Symbol A9 169 Copyright Symbol AC 172 Not Symbol and many more. See http://med.stanford.edu/web/resources/demoroniser.html for a quick and dirty Perl program, which I downloaded from http://www.fourmilab.ch/webtools/demoroniser and put in my ~jasper/bin directory. --------------------------------------------------------------------------------- There are many, many pages on internationalization, but it's hard to find any that just tell you how to do it. I started with - http://www.hclrss.demon.co.uk/unicode - http://www.hclrss.demon.co.uk/unicode/htmlunicode.html for "Creating multilingual web pages" - http://ppewww.ph.gla.ac.uk/~flavell/charset/ for some i18n issues, like the origin of the term "i18n" (pronounced "i-eighteen-n") which is an abbreviation of the word "internationalization". It has an "i", followed by 18 letters, then an "n". And you don't have to get hung up on whether the "z" is an "s" or not. - http://ppewww.ph.gla.ac.uk/~flavell/charset/checklist for different ways one might code HTML to handle different languages. - http://www.cs.tut.fi/~jkorpela/chars.html for a very long tutorial on Character Code Issues. - http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1252.TXT for the mapping of cp1252 characters to Unicode - http://rf.net/~james/perli18n.html for Perl, Unicode and i18N FAQ - http://??? for how some positions are reservered for local national characters. - http://??? for the warning that \ (backslash, aka reverse solidus) gets displayed as the yen sign when using charset=shift_jis, but this only seems to happen when . - There's a problem where backslashes (also known as a "reverse solidus") appear as the yen symbol (a capital Y with 2 horizontal lines in its lower half), and the tilde (~) appear as an overline. According to http://www.cs.mcgill.ca/~aelias4/encodings.html, This "problem" has become so pervasive that it is no longer a problem at all: Japanese people all know and expect to find yen symbols instead of backslashes. You should think of the yen and backslash as just variant ways of writing the same character. Get used to seeing such things as DOS/Windows paths separators and escape characters look like yen symbols.