lxml解析网页速度比BeautifulSoup快

lxml解析网页速度比BeautifulSoup快

作者: bbjoe | 来源:发表于2016-08-17 12:50 被阅读0次

我的代码：

# -*- coding: utf-8 -*-
import requests
from time import ctime
from lxml import etree
from bs4 import BeautifulSoup

url = 'http://www.cnblogs.com/descusr/archive/2012/06/20/2557075.html'
tries = 300
web_data = requests.get(url).text

# step 1
print('lxml start at:', ctime())
while tries > 0:
    lxml_page = etree.HTML(web_data)
    tries = tries - 1
print('lxml done at:', ctime())

# step 2
print('soup start at:', ctime())
while tries > 0:
    soup_page = BeautifulSoup(web_data, 'lxml')
    tries = tries - 1
print('soup done at:', ctime())

我是分步运行的：先注释掉step2，运行step1；之后注释掉1，运行2。新手轻拍

运行结果：

解析一个博客页面300次，Beautiful用了约8秒，lxml用了约1秒

BeautifulSoup.png

lxml.png

相关文章

网友评论

本文标题：lxml解析网页速度比BeautifulSoup快

本文链接：https://www.haomeiwen.com/subject/nkymsttx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

栏目导航

热点阅读

关于我们|服务条款|联系我们|lxml解析网页速度比BeautifulSoup快|投稿指南|网站地图|RSS订阅|排版工具|手机版

提供经典美文摘抄,优美散文欣赏,现代诗歌精选,短篇小说,心情随笔,表白情书范文,故事会在线阅读欣赏

Copyright © 2014-2023 Haomeiwen.com All Rights Reserved. 好美文阅读网版权所有

备案信息：桂公网安备 45052102000051号 · 桂ICP备13007215号-3

本站所收录作品、热点评论等信息部分来源互联网，目的只是为了系统归纳学习和传递资讯

所有作品版权归原创作者所有，与本站立场无关，如不慎侵犯了你的权益，请联系我们告知，我们将做删除处理！