scrapy模拟登陆

作者: 薛落花随泪绽放 | 来源:发表于2017-11-05 14:12 被阅读8次

Scrapy基础——Cookies和Session
模拟登陆存在问题
scrapy 模拟登陆
scrapy模拟登陆
Scrapy框架--cookie的获取/传递/本地保存
scrapy模拟登陆(黑马教育)
30. scrapy模拟登陆
scrapy 模拟登陆2
Scrapy爬虫模拟登陆豆瓣
Scrapy模拟登陆豆瓣案例

scrapy startproject taoyun
cd taoyun
scrapy genspider -t basic login iqianyue.com

login.py

# -*- coding: utf-8 -*-
import scrapy
from scrapy.http import Request,FormRequest


class LoginSpider(scrapy.Spider):
    name = 'login'
    allowed_domains = ['iqianyue.com']
    #start_urls = ['http://iqianyue.com/']
    header = {"User-Agent": "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/49.0.2623.221 Safari/537.36 SE 2.X MetaSr 1.0"}
    # 编写start_requests()方法，第一次会默认调取该方法中的请求
    def start_requests(self):
        # 首次爬一次登录页，然后进入回调函数parse()
        return [Request("http://edu.iqianyue.com/index_user_login",meta={"cookiejar":1},callback=self.parse)]

    def parse(self, response):
        #设置要传递的post信息，此时没有验证码字段
        data= {
            "number":"1627697510",
            "passwd":"xh177151...",
            }

        print("登陆中...")
        #通过FormRequest.from_response()进行登陆
        return [FormRequest.from_response(response,
                                          #设置cookie信息
                                          meta={"cookiejar":response.meta["cookiejar"]},
                                          #设置headers信息模拟成浏览器
                                          headers=self.header,
                                          #设置post表单中的数据
                                          formdata=data,
                                          #设置回调函数，此时回调函数为next()
                                          callback=self.next,
                                          )]
    def next(self,response):
        data=response.body
        fh=open("D:/python/xue/a.html","wb")
        fh.write(data)
        fh.close()
        print(response.xpath("/html/head/title/text()").extract())
        yield Request("http://edu.iqianyue.com/index_user_index",callback=self.next2,meta={"cookiejar":True})
    def next2(self,response):
        data=response.body
        fh=open("D:/python/xue/b.html","wb")
        fh.write(data)
        fh.close()
        print(response.xpath("/html/head/title/text()").extract())

网友评论

python爬虫学习

本文标题：scrapy模拟登陆

本文链接：https://www.haomeiwen.com/subject/wplcmxtx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

scrapy模拟登陆

login.py

相关文章

Scrapy基础——Cookies和Session

模拟登陆存在问题

scrapy 模拟登陆

scrapy模拟登陆

Scrapy框架--cookie的获取/传递/本地保存

scrapy模拟登陆(黑马教育)

30. scrapy模拟登陆

scrapy 模拟登陆2

Scrapy爬虫模拟登陆豆瓣

Scrapy模拟登陆豆瓣案例

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读

python爬虫学习