|
楼主 |
发表于 2005-8-26 17:02:05
|
显示全部楼层
sfatsdu:
可以用用 autogb , 她可以自动识别编码,在字数 > 100 时候,几乎没有识别错误的
在包 zh-autoconvert 里
zh-autoconvert - Chinese HZ/GB/BIG5/UNI/UTF7/UTF8 encodings auto-converter
- hua@hua:~$ autogb -h
- autogb: invalid option -- h
- Usage: autogb [-OPTION] < input > output
- Now autoconvert can only judge gb/big5/hz encoding, so if you want to
- do other conversion, you must set the option --input. Without --input,
- the program will guess the encoding of the input.
- OPTION is one of (case insensitive):
- -i encoding, --input encoding: Set the input encoding
- -o encoding, --output encoding: Set the output encoding
- The encoding should be gb, big5, hz, uni, utf7 or utf8.
复制代码
如 autogb -o utf8 < old.html > new.html |
|