• <ins id="pjuwb"></ins>
    <blockquote id="pjuwb"><pre id="pjuwb"></pre></blockquote>
    <noscript id="pjuwb"></noscript>
          <sup id="pjuwb"><pre id="pjuwb"></pre></sup>
            <dd id="pjuwb"></dd>
            <abbr id="pjuwb"></abbr>

            看到有前輩寫(xiě)了一個(gè)UTF-8與UNICODE相互轉(zhuǎn)換的代碼,順便提一下,希望可以給大家提供一點(diǎn)幫助.
            下面是一些編碼格式的bit長(zhǎng)

            Examples of fixed-width encoding forms:

            Type Each character
            encoded as
            Notes
              7-bit a single 7-bit quantity example: ISO 646
              8-bit G0/G1 a single 8-bit quantity with constraints on use of C0 and C1 spaces
              8-bit a single 8-bit quantity with no constraints on use of C1 space
              8-bit EBCDIC a single 8-bit quantity with the EBCDIC conventions rather than ASCII conventions
            16-bit (UCS-2) a single 16-bit quantity within a code space of 0..FFFF
            32-bit (UCS-4) a single 32-bit quantity within a code space 0..7FFFFFFF
            32-bit (UTF-32) a single 32-bit quantity within a code space of 0..10FFFF
            16-bit DBCS process code a single 16-bit quantity example: UNIX widechar implementations of Asian CCS's
            32-bit DBCS process code a single 32-bit quantity example: UNIX widechar implementations of Asian CCS's
            DBCS Host two 8-bit quantities following IBM host conventions

            Examples of variable-width encoding forms:

            Name Characters are encoded as Notes
            UTF-8 a mix of one to four 8-bit code units in Unicode
            and one to six code units in 10646
            used only with Unicode/10646
            UTF-16 a mix of one to two 16 bit code units used only with Unicode/10646

            Boost中提供了一個(gè)UTF-8 Codecvt Facet,可以在utf8和UCS-4(Unicode-32)之間轉(zhuǎn)換.
            使用方式如下

              //...
              // My encoding type
              typedef wchar_t ucs4_t;

              std::locale old_locale;
              std::locale utf8_locale(old_locale,new utf8_codecvt_facet<ucs4_t>);

              // Set a New global locale
              std::locale::global(utf8_locale);

              //  UCS-4 轉(zhuǎn)換為 UTF-8
              {
                std::wofstream ofs("data.ucd");
                ofs.imbue(utf8_locale);
                std::copy(ucs4_data.begin(),ucs4_data.end(),
                      std::ostream_iterator<ucs4_t,ucs4_t>(ofs));
              }

              // 讀入 UTF-8 ,轉(zhuǎn)換為 UCS-4 
              std::vector<ucs4_t> from_file;
              {
                std::wifstream ifs("data.ucd");
                ifs.imbue(utf8_locale);
                ucs4_t item = 0;
                while (ifs >> item) from_file.push_back(item);
              }
              //...
            UTF-8 Codecvt Facet詳見(jiàn)
            http://www.boost.org/libs/serialization/doc/codecvt.html

            posted on 2006-02-15 17:19 張沈鵬 閱讀(2667) 評(píng)論(2)  編輯 收藏 引用
            Comments

            只有注冊(cè)用戶登錄后才能發(fā)表評(píng)論。
            網(wǎng)站導(dǎo)航: 博客園   IT新聞   BlogJava   博問(wèn)   Chat2DB   管理


             
            国产精品久久久久久久久| 国产精品成人精品久久久| 伊人久久一区二区三区无码| 久久久久一本毛久久久| 亚洲成av人片不卡无码久久| 久久综合亚洲色一区二区三区| 亚洲精品乱码久久久久久| 亚洲精品无码久久久久去q| 亚洲精品无码久久千人斩| 中文字幕亚洲综合久久| 亚洲日韩欧美一区久久久久我| 亚洲AV无码久久精品色欲| aaa级精品久久久国产片| 国产无套内射久久久国产| 久久久精品久久久久影院| 精品久久久久久国产| 久久人人爽人人爽人人片AV麻烦| 激情伊人五月天久久综合| 久久久久国产亚洲AV麻豆| 97久久精品无码一区二区天美| 久久久久亚洲精品中文字幕| 国产亚洲精久久久久久无码| 欧美成人免费观看久久| 久久久久久久综合日本亚洲 | 久久国产热精品波多野结衣AV| 99久久精品九九亚洲精品| 一本色综合网久久| 亚洲第一永久AV网站久久精品男人的天堂AV | 国产亚洲欧美成人久久片| 精品国产日韩久久亚洲| 久久国产影院| 88久久精品无码一区二区毛片 | 国产精品久久久久久福利69堂| 久久国产免费直播| 久久久久亚洲av毛片大| 亚洲狠狠久久综合一区77777| 久久综合亚洲欧美成人| 无码国产69精品久久久久网站 | 久久超碰97人人做人人爱| 久久精品国产亚洲av麻豆图片| 久久国产精品视频|