糗事百科爬虫

作者: 年画儿 | 来源:发表于2019-08-03 12:16 被阅读0次

python 3 爬糗事百科
使用Beautifulsoup和re爬取糗事百科笑话
糗事百科爬虫源码
Python爬虫基础教程（三）
Python爬虫小实例
爬虫学习之糗事百科
Scrapy爬虫项目
糗事百科爬虫
Python爬虫实战
糗事百科爬虫

#encoding: utf-8

import re
import requests

def parse_page(url):
    headers = {
        'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/62.0.3202.94 Safari/537.36'
    }
    response = requests.get(url,headers)
    text = response.text
    # re.S = re.DOTALL
    contents = re.findall(r'<div\sclass="content">.*?<span>(.*?)</span>',text,re.DOTALL)
    duanzi = []
    for content in contents:
#        x = content
        x = re.sub(r'<.*?>','',content)
        duanzi.append(x.strip())
        print(x.strip())
        print('='*50)


def main():
    url = 'https://www.qiushibaike.com/text/page/1/'
    for x in range(1,10):
        url = 'https://www.qiushibaike.com/text/page/%s/' % x
        parse_page(url)

if __name__ == '__main__':
    main()

网友评论

本文标题：糗事百科爬虫

本文链接：https://www.haomeiwen.com/subject/xwkjdctx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

糗事百科爬虫

相关文章

python 3 爬糗事百科

使用Beautifulsoup和re爬取糗事百科笑话

糗事百科爬虫源码

Python爬虫基础教程（三）

Python爬虫小实例

爬虫学习之糗事百科

Scrapy爬虫项目

糗事百科爬虫

Python爬虫实战

糗事百科爬虫

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读