python爬取糗事百科

作者: 奋斗live | 来源:发表于2018-05-09 23:36 被阅读0次

nice，64个python爬虫入门项目，学会轻轻松松爬取资源
python 3 爬糗事百科
爬虫项目
Python爬虫实战
Python爬虫(十七)_糗事百科案例
Python爬虫教程一爬取糗事百科段子
Python 学习——每天写点小东西-1
python爬虫
python爬取糗事百科
python爬取糗事百科

以下使用面向过程版的代码

impore urllib
import urllib2
import re
page = 1
url = 'http://www.qiushibaike.com/hot/page/'+str(page)
#url = 'http://www.yllin.cn'
user_agent = 'Mozilla/4.0 (compatible; MSIE 5.5; Windows NT)'
headers = {'User-Agent':user_agent}
try:
    request = urllib2.Request(url,headers = headers)
response = urllib2.urlopen(request)
    content = response.read().decode('utf-8')
#print content
    pattern = re.compile('<div class=\"content\"[\s\S]+?<span>([\s\S]+?)<\/span>')
items = re.findall(pattern,content)
    for item in items:
    print item
    except urllib2.URLError, e:
    if hasattr(e,"code"):
    print e.code
    if hasattr(e,"reason"):
    print e.reason

面向对象版

import urllib
import urllib2
import re

class QSBK:
    url ='' 
    headers = ''
    def __init__(self,url,headers):
        self.url = url
        self.headers = headers
    def request(self):
        request = urllib2.Request(url,headers=self.headers)
        response = urllib2.urlopen(request)
        return response
    def decode(self):
        return self.request().read().decode('utf-8')
    
    def solve_data(self):
        pattern = re.compile('<div class=\"content\"[\s\S]+?<span>([\s\S]+?)<\/span>')
        content = self.decode()
        items = re.findall(pattern,content)
        return items
    def print_data(self):
        data = self.solve_data()
        for item in data:
            print item
        

page = 1
url = 'http://www.qiushibaike.com/hot/page/'+str(page)

user_agent = 'Mozilla/4.0 (compatible; MSIE 5.5; Windows NT)'
headers = {'User-Agent':user_agent}

test = QSBK(url,headers)
test.print_data()

网友评论

本文标题：python爬取糗事百科

本文链接：https://www.haomeiwen.com/subject/uusarftx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

python爬取糗事百科

相关文章

nice，64个python爬虫入门项目，学会轻轻松松爬取资源

python 3 爬糗事百科

爬虫项目

Python爬虫实战

Python爬虫(十七)_糗事百科案例

Python爬虫教程一爬取糗事百科段子

Python 学习——每天写点小东西-1

python爬虫

python爬取糗事百科

python爬取糗事百科

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读