PTT數位生活區 / Python

Re: [問題] 請問一下unicode的問題

看板Python作者pky.時間18年前 (2007/01/06 15:32)推噓0(0推 0噓 0→)

留言0則, 0人參與討論串6/18 (看更多)

※ 引述《pkyosx.bbs@ptt.cc (Insomnia)》之銘言： : ※ 引述《pkyosx (Insomnia)》之銘言： : : 直接用 Ultra Editor Hex進位模式驗證: : : 存檔前: : : => FF FE 11 62 : : 存檔後: 終於發現問題就在於 notepad 存 UTF-8 的時候多存東西上去了!! : : => FF FE FF FE 11 62 : : 但是 notepad 存 unicode(UTF-16), Ultra-Editor 存 UTF-8, UTF-16 都不會有問題 : : => FF FE 11 62 : : 結論: : : 習慣用 notepad 開文件的人小心阿= =" ...TMD 總結一下: BOM on wiki: http://en.wikipedia.org/wiki/Byte_Order_Mark UTF-8 沒有 BE LE 的問題, 所以拿 BOM 只是用來跟其他編碼識別我用 hexdump 出來是 EF BB BF 用 ultra-editor 看到的卻是 FF FE FF FE 但是用裡面的一個 unicode/ascii/utf8 轉 utf8 (ascii 編輯) 後才變成 EF BB BF 不知道聰明的 Ultra editor 到底做了什麼事情我猜可能跟編輯的編碼有關這是另一位板友提供的連結: http://evanjones.ca/python-utf8.html 裡面一小段 code 說出了 python 在處理 utf-8 奇怪的地方 >>> codecs.BOM_UTF16.decode( "utf16" ) u'' >>> codecs.BOM_UTF8.decode( "utf8" ) u'\ufeff' 他的建議是自己在偵測到 utf-8 的時候手動把 u'\ufeff' 拿掉 import codecs if s.beginswith( codecs.BOM_UTF8 ): # The byte string s begins with the BOM: Do something. # For example, decode the string as UTF-8 if u[0] == unicode( codecs.BOM_UTF8, "utf8" ): # The unicode string begins with the BOM: Do something. # For example, remove the character. # Strip the BOM from the beginning of the Unicode string, if it exists u.lstrip( unicode( codecs.BOM_UTF8, "utf8" ) ) 這是 python 的 bug 嗎? 其實我不確定, 如果有人系統本身是 UTF-8 的可以試試看搞不好 decode 出來不會多個 FEFF -- ※Post by pky from pkyosx.Dorm-GD2.NCTU.edu 老鼠的香香乳酪洞˙電子佈告欄系統˙alexbbs.twbbs.org˙140.113.166.7

‣ 返回看板[ Python ] 程設

‣ 更多 pky. 的文章

文章代碼(AID): #15dr1q00 (Python)

討論串 (同標題文章)

本文引述了以下文章的的內容：

1

1

Re: [問題] 請問一下unicode的問題

18年前, 01/06

完整討論串 (本文為第 6 之 18 篇)：

排序：最新先 | 最舊先 | 留言數

1

3

Re: [問題] 請問一下unicode的問題

16年前, 10/02

Re: [問題] 請問一下unicode的問題

18年前, 05/06

1

2

Re: [問題] 請問一下unicode的問題

18年前, 05/05

Re: [問題] 請問一下unicode的問題

18年前, 01/12

Re: [問題] 請問一下unicode的問題

18年前, 01/11

Re: [問題] 請問一下unicode的問題

18年前, 01/11

Re: [問題] 請問一下unicode的問題

18年前, 01/09

1

1

Re: [問題] 請問一下unicode的問題

18年前, 01/08

Re: [問題] 請問一下unicode的問題

18年前, 01/08

Re: [問題] 請問一下unicode的問題

18年前, 01/07

在新視窗開啟完整討論串 (共18篇)

Python 近期熱門文章

3

3

[閒聊] 各位現在用os.path 還是用pathlib.Path

1天前, 07/17

2

6

[閒聊] 2024年的自我python學習

1天前, 07/17

1

2

[問題] 用Whisper AI幫我下載字幕（有酬）

3月前, 04/01

1

3

[問題] selenium 有辦法做檔案上傳嗎?

5月前, 02/03

3

13

Fw: [討論] 哈囉請問有給python新手的課程嗎

5月前, 01/24

4

19

Re: [問題] @property 真正的運用是啥

6月前, 01/15

3

8

[問題] class type 跟 class object

6月前, 01/10

9

17

[閒聊] python平行處理效能是否很差?

6月前, 01/07

更多近期熱門文章 >>

PTT數位生活區即時熱門文章

13

20

[心得] 整機全球啟動 9800X3D+5090

[ PC_Shopping ]

6小時前, 07/19

5

9

Re: [賣/台中全國]零件機 HP 14-dq1033cl

[ nb-shopping ]

6小時前, 07/19

18

70

[請益] 尋找代替Adobe 的軟體

[ PC_Shopping ]

7小時前, 07/18

8

9

[心得] 家訪只是過程-Linn Selekt Dsm Organik

8小時前, 07/18

9

33

[請益] 現在B550主機板推薦?

[ PC_Shopping ]

9小時前, 07/18

46

150

Re: [情報] 視博通結束全漢全產品代理合作

[ PC_Shopping ]

10小時前, 07/18

6

26

[菜單] 6k內升級顯卡

[ PC_Shopping ]

11小時前, 07/18

16

36

Re: [新聞] 黃仁勳最愛手機竟然是Google Pixel 他親

11小時前, 07/18

更多即時熱門文章 >>

‣ 返回看板[ Python ] 程設

‣ 更多 pky. 的文章

文章代碼(AID): #15dr1q00 (Python)