1. 安装Python3.7环境以后用pip下载lxml4.4.1
pip install lxml==4.4.4
2. 在Python文件中使用方式:
<!-- hello.html -->
<div>
<ul>
<li class="item-0"><a href="link1.html">first item</a></li>
<li class="item-1"><a href="link2.html">second item</a></li>
<li class="item-inactive"><a href="link3.html"><span class="bold">third item</span></a></li>
<li class="item-1"><a href="link4.html">fourth item</a></li>
<li class="item-0"><a href="link5.html">fifth item</a></li>
</ul>
</div>
from lxml import html
etree = html.etree
html = etree.parse('./hello.html')
result = etree.tostring(html, pretty_print=True)
print(result.decode('utf-8'))
网友评论