perlcn − XXXXXXXXX Perl XXXXXX
XXXXXXXXX Perl XXXXXXX !
XX 5.8.0 XXXXXXX , Perl XXXXXXXXXXXXXXX Unicode ( XXXXXXX ) XXXXXX , XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX ; CJK ( XXXXXXX ) XXXXXXXXXXXXXXXXX . Unicode XXXXXXXXXXXXXXXXXXX , XXXXXXXXXXXXXXXXXXXXXXXXXXXXX: XXXXXXXXX , XXXXXXXXXX , XXXXXXXXXXXXXXXXXX ( XXXXXXXX , XXXXXXXXXXX , XXXXXXXXXX , XXXXXXXXX , XXXXXXX , XXXXXXXXXX , XXXX ). XXXXXXXXXXXXXXXXXXXXXXXXXXX ( XX PC XXXXXXXXXX ).
Perl XXXXXXX Unicode XXXXXXXXX . XXXXX Perl XXXXXXXXXXXXXXXXXXXXXXXX Unicode XXX ; Perl XXXXXXXXXXXXXX ( XXXXXXXXXXXXXXXXX ) XXXXXXX Unicode XXXXXXXXX . XXXXXXXXXXXXXXXXX , XXXXXXXXXXX Unicode XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX , Perl XXXXXXX Encode XXXXXXXX , XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX .
Encode XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX (’gb2312’ XXX ’euc−cn’):
euc−cn Unix XXXXXXXXXX, XXXXXXXXXXXXXXXXXXXX gb2312−raw XXXXXXXXXXXXX (XXXXXXX) GB2312 XXXXX gb12345 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX iso−ir−165 GB2312 + GB6345 + GB8565 + XXXXXXXX cp936 XXXXXXX 936, XXXXXXXXX 'GBK' (XXXXXXXXXXXXX) XXXXXX hz 7 XXXXXXXXXXX GB2312 XXXXX
XXXXXXXXX , XX EUC-CN XXXXXXXXXXXXXXXXX Unicode, XXXXXXXXXXXXXXXXXXXXX:
perl −Mencoding=euc−cn,STDOUT,utf8 −pe1 < file.euc−cn > file.utf8
Perl XXXXXXXXXX "piconv", XXXXXXXXXXX Perl XXXXXXXXXXXXXXXXXXXXXXXXXX , XXXXXXXXX:
piconv −f euc−cn −t utf8 < file.euc−cn > file.utf8 piconv −f utf8 −t euc−cn < file.utf8 > file.euc−cn
XXXXX , XXXXX encoding XXXX , XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX , XXXXXXXXX:
#!/usr/bin/env perl # XXXXXX euc−cn XXXXXXXX; XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX euc−cn XXXXX use encoding 'euc−cn', STDIN => 'euc−cn', STDOUT => 'euc−cn'; print length("XXXX"); # 2 (XXXXXXXXXXXXXXX) print length('XXXX'); # 4 (XXXXXXXXXXXXXXXX) print index("XXXXXXXXX", "XXXXXX"); # −1 (XXXXXXXXXXXXXXXXXX) print index('XXXXXXXXX', 'XXXXXX'); # 1 (XXXXXXXXXXXXXXXXX)
XXXXXXXXXXXXXXXXXXXX , " XX " XXXXXXXXXXXXXXXX " XX " XXXXXXXXXXXXXXXXXXXXXX EUC-CN XXXXXX " XXX "; " XX " XXXXXXXXXXXXXXXXXXX " XXX " XXXXXXXXXXXXXXXXXXXXXX " XXX ". XXXXXXXXXXXX EUC-CN XXXXXXXXXXXXXXXXXXXXXXXXXX .
XXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXX , XXXXXXX
CPAN (<http://www.cpan.org/>)
XXX Encode::HanExtra XXXX .
XXXXXXXXXXXXXXXXXXXXXXXXXXXX:
gb18030 XXXXXXXXXXXXXXXXXX, XXXXXXXXXXXXXXX
XXXXX , Encode::HanConvert XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX:
big5−simp Big5 XXXXXXXXXXX Unicode XXXXXXXXXXXXX gbk−trad GBK XXXXXXXXXXX Unicode XXXXXXXXXXXXX
XXXXXXX GBK XX Big5 XXXXXXXXX , XXXXXXXXXXXXXXXXXXXXXX b2g.pl XX g2b.pl XXXXXXXX , XXXXXXXXXXXXXXXXXXXXXXXXXXX:
use Encode::HanConvert; $euc_cn = big5_to_gb($big5); # XX Big5 XXXX GBK $big5 = gb_to_big5($euc_cn); # XX GBK XXXX Big5
XXXXXXXXXXXX
XXXXXXX Perl XXXXXXXXXXXXXXXXXXXXXX (
XXXXXXXXXXXXXXXXXXXXX ),
XXXXXXXXXXXXXXXXX Perl
XXXXXXXX , XXXX Unicode
XXXXXXXXXXX . XXXX ,
XXXXXXXXXXXXXXXXXXXX:
XXXXX
Perl XXXXXXXXXXXX
<http://www.perl.com/>
Perl XXXXXXX ( XXXXXXXXXXXXXXXXXX )
<http://www.cpan.org/>
Perl XXXXXXXXXXX (Comprehensive Perl Archive Network)
<http://lists.perl.org/>
Perl XXXXXXXXXXXXXX
XXXX
Perl XXXXXXXX
<http://www.oreilly.com.cn/index.php?func=booklist&cat=68>
XXXXXXXXXXXXXXXXXXXX Perl XXXXX
Perl
XXXXXXXXXXX
<http://www.pm.org/groups/asia.html>
XXXXX Perl XXXXXXXXXX
Unicode
XXXXXXXXXX
<http://www.unicode.org/>
Unicode XXXXXXXXX (Unicode XXXXXXXXXXXXXXXXX )
<http://www.cl.cam.ac.uk/%7Emgk25/unicode.html>
Unix/Linux XXXXX UTF−8 XXX Unicode XXXXXX
Encode, Encode::CN, encoding, perluniintro, perlunicode
Jarkko Hietaniemi <jhi@iki.fi>
Audrey Tang ( XXXXXX ) <audreyt@audreyt.org>