学习资料收集自知乎专栏:
具体的学习路线是什么?
总体分为三个大方面:
-
简单的定向脚本爬虫(request —- bs4 —- re)
-
大型框架式爬虫(Scrapy框架为主)
-
浏览器模拟爬虫 (Selenium 模拟)
具体的步骤:
一:Beautiful Soup 爬虫
Ehco:从零开始写Python爬虫 --- 1.1 requests库的安装与使用zhuanlan.zhihu.com
![](https://img.haomeiwen.com/i19037128/e7478cdc18e2200b.jpg)
![](https://img.haomeiwen.com/i19037128/7ebfc96a5bddef5e.jpg)
![](https://img.haomeiwen.com/i19037128/b4d357b24e070dbc.jpg)
![](https://img.haomeiwen.com/i19037128/7a60891eb0e46a5a.jpg)
![](https://img.haomeiwen.com/i19037128/e96786fac4f66e5f.jpg)
![](https://img.haomeiwen.com/i19037128/71fe5cab011626c4.jpg)
![](https://img.haomeiwen.com/i19037128/d8a693d79cc6420d.jpg)
![](https://img.haomeiwen.com/i19037128/771a9caed1e38c88.jpg)
二: Scrapy 爬虫框架
Ehco:从零开始写Python爬虫 --- 2.1 Scrapy 爬虫框架的安装与基本介绍zhuanlan.zhihu.com
![](https://img.haomeiwen.com/i19037128/8b6f619655888f78.jpg)
![](https://img.haomeiwen.com/i19037128/c23f9a37d3831b14.jpg)
![](https://img.haomeiwen.com/i19037128/602b065c19a16c5d.jpg)
![](https://img.haomeiwen.com/i19037128/9cded6f3207a0039.jpg)
![](https://img.haomeiwen.com/i19037128/45c2a51c3b5f98fd.jpg)
![](https://img.haomeiwen.com/i19037128/fd3c57a17ec6df12.jpg)
三: 浏览器模拟爬虫
Ehco:从零开始写Python爬虫 --- 3.1 Selenium模拟浏览器zhuanlan.zhihu.com
![](https://img.haomeiwen.com/i19037128/f04c4c8d9b728af1.jpg)
![](https://img.haomeiwen.com/i19037128/8c648a6d19a938f2.jpg)
![](https://img.haomeiwen.com/i19037128/638128d120afa817.jpg)
四: 练手项目:
Ehco:从零开始写Python爬虫 --- 爬虫实践:螺纹钢数据&Cookieszhuanlan.zhihu.com
![](https://img.haomeiwen.com/i19037128/a58a4f220c8c392b.jpg)
![](https://img.haomeiwen.com/i19037128/b6528332f8f13960.jpg)
![](https://img.haomeiwen.com/i19037128/48dc5ea116e2839f.jpg)
![](https://img.haomeiwen.com/i19037128/3a076910eb53e1da.jpg)
![](https://img.haomeiwen.com/i19037128/4df7e38a91decfb8.jpg)
![](https://img.haomeiwen.com/i19037128/094aec1c0556e4b3.jpg)
![](https://img.haomeiwen.com/i19037128/733620267580dcb3.jpg)
![](https://img.haomeiwen.com/i19037128/6f42588531da3731.jpg)
![](https://img.haomeiwen.com/i19037128/0f2eedfabf4f84d4.jpg)
![](https://img.haomeiwen.com/i19037128/feec4d206692cd7a.jpg)
![](https://img.haomeiwen.com/i19037128/fd26b247a8cb8b21.jpg)
![](https://img.haomeiwen.com/i19037128/5e6665f75cc91bcf.jpg)
![](https://img.haomeiwen.com/i19037128/1a4c7c873892823b.jpg)
五: 自己写点小工具:
Ehco:爬虫存储海量数据太麻烦? 换个姿势试一试!zhuanlan.zhihu.com
![](https://img.haomeiwen.com/i19037128/97b53de124ac2be3.jpg)
![](https://img.haomeiwen.com/i19037128/5bc9227ed995e3e2.jpg)
网友评论