爬虫再学习_urllibrequest

作者: 黄yy家的jby | 来源:发表于2021-01-19 22:06 被阅读0次

爬虫再学习_urllibrequest
爬虫入门
资料
Python爬虫学习（十六）初窥Scrapy
Python代理IP爬虫的简单使用
Python爬虫学习系列教程
Python爬虫学习之小结（一）
python爬虫学习-day7-实战
Python 基础爬虫目录
python爬虫学习-day5-selenium


import urllib.request

# 获取一个get请求
response = urllib.request.urlopen('https://www.baidu.com') # response 是个object
print(response.read().decode('utf-8')) # 对获取到的网页源码进行utf-8解码

# 获取一个post请求
import urllib.parse
data = bytes(urllib.parse.urlencode({'user':'test','password':'112233'}),encoding='utf-8')  #转换二进制的包
response = urllib.request.urlopen('https://httpbin.org/post', date=date) # 这个常用post测试网站
print(response.read().decode('utf-8')) # 对获取到的网页源码进行utf-8解码


# 超时处理
response = urllib.request.urlopen('https://www.baidu.com',timeout=0.1) # response 是个object

# 其他属性
response.status
response.getheaders()
response.getheader('Server')

# 模拟浏览器
url = '''https://www.douban.com'''
headers = {
    'User-Agent':'...'
}
req = urllib.request.Request(url=url,data=data,headers=headers)
response = urllib.request.urlopen(req)

# user-agent 的找法： 审阅元素 -- net_work -- 刷新 -- 最前面点击 -- header -- name地方点击 -- 下拉最后寻找user-agent -- 复制即可

网友评论

本文标题：爬虫再学习_urllibrequest

本文链接：https://www.haomeiwen.com/subject/tegczktx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

爬虫再学习_urllibrequest

相关文章

爬虫再学习_urllibrequest

爬虫入门

资料

Python爬虫学习（十六）初窥Scrapy

Python代理IP爬虫的简单使用

Python爬虫学习系列教程

Python爬虫学习之小结（一）

python爬虫学习-day7-实战

Python 基础爬虫目录

python爬虫学习-day5-selenium

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读