你是无意穿堂风 偏偏孤倨引山洪
data:image/s3,"s3://crabby-images/2b20e/2b20e32a05699ad13bb28737130b7afaeaa58767" alt=""
安装 requests
conda install requests
http://www.python-requests.org/en/master/
# -*- coding: utf-8 -*-
import requests
r = requests.get("http://www.mi.com")
print(r.status_code)
r.encoding = 'utf-8'
print(r.text)
requests 库7个主要方法:
data:image/s3,"s3://crabby-images/4c64e/4c64ef178fd2e5276f5b1e019c6acef33592b061" alt=""
requests - get()
r = requests.get(url)
r = requests.get("http://www.mi.com")
data:image/s3,"s3://crabby-images/ca770/ca770971fc04ae57e8b7e48f4252c0cc492ca20f" alt=""
data:image/s3,"s3://crabby-images/5f9e2/5f9e2ea04d2f9cdf49dc4d97ce136b8dafc6efc3" alt=""
data:image/s3,"s3://crabby-images/d6100/d6100bfd4c07717db8f1af796096a37c526e2f87" alt=""
data:image/s3,"s3://crabby-images/f0416/f04164b2bc9a0f9ea81437a35c73d2363ca192f9" alt=""
response 对象的属性:
data:image/s3,"s3://crabby-images/87d07/87d07b66266e106074554fbc81bb33f47d4760f8" alt=""
data:image/s3,"s3://crabby-images/cc0da/cc0da95f855198f49a6853c35086b45fceb71706" alt=""
理解response的编码:
data:image/s3,"s3://crabby-images/30476/304765c336486b966ba374eb03de5d046c8cef55" alt=""
爬取网页的通用代码框架:
- 网络连接有风险
- 异常处理很重要
理解Requests库的异常:
data:image/s3,"s3://crabby-images/9b02d/9b02dd79342b3fe229be223ccf6a60888c984ba4" alt=""
data:image/s3,"s3://crabby-images/89e5d/89e5d4f7726b93e863e6d709f2062436d3cf30b0" alt=""
# -*- coding: utf-8 -*-
import requests
def getHTMLText(url):
try:
r = requests.get(url, timeout=30)
r.raise_for_status()
r.encoding = r.apparent_encoding
return r.text
except:
return "产生异常"
if __name__ == "__main__":
url = "http://www.mi.com"
print(getHTMLText(url))
HTTP协议及Requests库方法:
Http协议:
data:image/s3,"s3://crabby-images/bbd62/bbd628abee0accc1e9163e004433e8220282b002" alt=""
data:image/s3,"s3://crabby-images/8cc6f/8cc6f8dd50c659736a87be22dbc501b54cd22014" alt=""
data:image/s3,"s3://crabby-images/971e8/971e8d05d3e9a145aee1048a5178da8bb169d86d" alt=""
Http协议对资源的操作:
data:image/s3,"s3://crabby-images/359b1/359b13035fd136878db504e8e3a85b15bee9273a" alt=""
data:image/s3,"s3://crabby-images/18e5c/18e5c8ec4ef470417a39d4a96a92b365f365ed7f" alt=""
http协议通过url对资源定位 通过以上6个操作方法对资源进行管理 每次操作都是独立无状态的(两次操作间没有关系)
data:image/s3,"s3://crabby-images/40570/405701dda82c3d905e1fd13d38a390e1ff4961a6" alt=""
Requests库的head():
# -*- coding: utf-8 -*-
import requests
r = requests.get("http://www.mi.com")
print(r.status_code)
r.encoding = 'utf-8'
print(r.headers)
Requests库的post():
data:image/s3,"s3://crabby-images/37c1b/37c1b861871f615b516fe22937e1e458769a7fdd" alt=""
# -*- coding: utf-8 -*-
import requests
payload = {'key1':'value1', 'key2':'value2'}
r = requests.post('http://httpbin.org/post', data=payload)
print(r.text)
print:
{
"args": {},
"data": "",
"files": {},
"form": {
"key1": "value1",
"key2": "value2"
},
"headers": {
"Accept": "*/*",
"Accept-Encoding": "gzip, deflate",
"Connection": "close",
"Content-Length": "23",
"Content-Type": "application/x-www-form-urlencoded",
"Host": "httpbin.org",
"User-Agent": "python-requests/2.14.2"
},
"json": null,
"origin": "43.247.4.53",
"url": "http://httpbin.org/post"
}
data:image/s3,"s3://crabby-images/43d97/43d975fdd530461a1e5f0bd75cb56f2b3cae7447" alt=""
Requests库的put():
data:image/s3,"s3://crabby-images/47a8b/47a8b2aa82e7066688d5a0ec064b1fd02cdfe744" alt=""
Requests库主要方法解析:
data:image/s3,"s3://crabby-images/0cac2/0cac23226df2c9a89ead1c7407e952d6b8f3faed" alt=""
data:image/s3,"s3://crabby-images/36bed/36bed793b0023bceb9f63d75fa0f46a37f83c816" alt=""
data:image/s3,"s3://crabby-images/78bb7/78bb7ec729dae3d366980a03780558ff0ae35b63" alt=""
data:image/s3,"s3://crabby-images/ee3a7/ee3a7a3aad617561258a580f8b12a94417614e1f" alt=""
data:image/s3,"s3://crabby-images/4391a/4391a44b2d2319acc1c4ee4120cf939fc02e21ef" alt=""
data:image/s3,"s3://crabby-images/03d40/03d40331ff8051397abff15fe87f9ec6963aa7f7" alt=""
data:image/s3,"s3://crabby-images/15041/1504150af6b3739131392e22687b98a4a3815886" alt=""
data:image/s3,"s3://crabby-images/2965d/2965d1f1a1e57687f16fc35315eda1de4852184b" alt=""
data:image/s3,"s3://crabby-images/93fc5/93fc5541855d2a40f1118f237bb747269f41cebe" alt=""
data:image/s3,"s3://crabby-images/f9e73/f9e7383032d97a8c39cd80a8d3d1fb4b639574cf" alt=""
data:image/s3,"s3://crabby-images/6d0bb/6d0bb8b3ae06e967df18945ec4a5473a15dc3bf1" alt=""
data:image/s3,"s3://crabby-images/7bbd6/7bbd6ce2bd8a1b336d5540ffbed30076fb93e749" alt=""
get最常用!
data:image/s3,"s3://crabby-images/2342f/2342f2859fdfdb647d09d1d7723fbb79bfe23838" alt=""
data:image/s3,"s3://crabby-images/f8004/f8004ceb2f81167e0aad90836a079fa173df051a" alt=""
data:image/s3,"s3://crabby-images/9396a/9396a1487ac126dc9fedd7d145660ac5259468cc" alt=""
data:image/s3,"s3://crabby-images/c239c/c239cbba9190093c5e342d4d5a3143fa63755a8e" alt=""
data:image/s3,"s3://crabby-images/f6fbe/f6fbe4752c13e474609958d6304bb0e95811d13c" alt=""
data:image/s3,"s3://crabby-images/d281b/d281b70bbd1f97bce9ac728fe28ddf1ed031f27d" alt=""
小结:
data:image/s3,"s3://crabby-images/cf1af/cf1af6435d569b784d5b6628fd43800cde7a3611" alt=""
data:image/s3,"s3://crabby-images/22555/225558fa6f771e41c024f80d74f05285c26482bc" alt=""
世界上所有的追求都是因为热爱
一枚爱编码 爱生活 爱分享的IT信徒
— hongXkeX
网友评论