euc2html
--------
By, "Jordan Husney" , (c) 2001 and
distributed under the GNU Public License.
This is a really simple application that processes files on a
pipe only (reading from stdin, and writing to stdout). It converts
any double-byte Japanese (and maybe Chinese/Korean) EUC encoded
characters and replaces them with HTML 4.0 Unicode entities.
Example usage:
cat some_euc_encoded_file.txt | euc2html > output.html
This application is basically a command line hack of a rather
well done Win32 hack of an application by a master of language
encodings, "William A. McKee" , who took pity
on me and authored the original application when I struggling to
include Kanji on the Everything2 (http://www.everything2.com) website.
If you look at the source code, the hooks exist to do
bi-directional processing (that is, convert HTML back to EUC). Also,
JIS and S/JIS processing would be rather trivial to add. I do not
need this functionality, so I did not implement it. If you do, send
me that patches and I will roll it into a release.
Good luck.
Jordan.
---
jordanh@remotepoint.com