美文网首页
Beautiful Soup的简单实例

Beautiful Soup的简单实例

作者: 十八度的帝都 | 来源:发表于2017-10-15 22:36 被阅读50次

简单记录一下,bs的使用

# -*- coding: utf-8 -*-
import requests
import bs4

goal_url = "https://baike.baidu.com/item/%E8%88%8C%E5%B0%96%E4%B8%8A%E7%9A%84%E4%B8%AD%E5%9B%BD/9081375"

def start_parse(url):
    print "开始获取(%s)内容" % url
    response = requests.get(url)
    print "获取网页内容完毕"
    soup = bs4.BeautifulSoup(response.content.decode("utf-8"),'lxml')
    # soup = bs4.BeautifulSoup(response.text);
    # 为了防止漏掉调用close方法,这里使用了with语句
    # 写入到文件中的编码为utf-8
    with open('archives.txt', 'w') as f:
        for archive in soup.select("dd"):
            f.write(archive.get_text().encode('utf-8'))
            print archive.get_text().encode('utf-8')

if __name__ == '__main__':
    start_parse(goal_url)

相关文章

网友评论

      本文标题:Beautiful Soup的简单实例

      本文链接:https://www.haomeiwen.com/subject/shdhuxtx.html