Python 文字列内の HTML エンティティをデコードしますか? 質問する

Question

Python 3.4以上

使用html.unescape():

import html
print(html.unescape('&pound;682m'))

FYIhtml.parser.HTMLParser.unescapeは非推奨であり、3.5で削除される予定だったただし、これは誤って残されたものです。すぐに言語から削除される予定です。

Python 2.6-3.3

HTMLParser.unescape()標準ライブラリから使用できます:

Python 2.6-2.7の場合はHTMLParser
Python 3の場合はhtml.parser

>>> try:
...     # Python 2.6-2.7 
...     from HTMLParser import HTMLParser
... except ImportError:
...     # Python 3
...     from html.parser import HTMLParser
... 
>>> h = HTMLParser()
>>> print(h.unescape('&pound;682m'))
£682m

また、sixインポートを簡素化する互換性ライブラリ:

>>> from six.moves.html_parser import HTMLParser
>>> h = HTMLParser()
>>> print(h.unescape('&pound;682m'))
£682m

Answer 1

Python 3.4以上

使用html.unescape():

import html
print(html.unescape('&pound;682m'))

FYIhtml.parser.HTMLParser.unescapeは非推奨であり、3.5で削除される予定だったただし、これは誤って残されたものです。すぐに言語から削除される予定です。

Python 2.6-3.3

HTMLParser.unescape()標準ライブラリから使用できます:

Python 2.6-2.7の場合はHTMLParser
Python 3の場合はhtml.parser

>>> try:
...     # Python 2.6-2.7 
...     from HTMLParser import HTMLParser
... except ImportError:
...     # Python 3
...     from html.parser import HTMLParser
... 
>>> h = HTMLParser()
>>> print(h.unescape('&pound;682m'))
£682m

また、sixインポートを簡素化する互換性ライブラリ:

>>> from six.moves.html_parser import HTMLParser
>>> h = HTMLParser()
>>> print(h.unescape('&pound;682m'))
£682m

Python 文字列内の HTML エンティティをデコードしますか? 質問する

ベストアンサー1

Python 3.4以上

Python 2.6-3.3

おすすめ記事