BeautifulSoup 爬虫实例-51job

BeautifulSoup 爬虫实例-51job

作者: haokeed | 来源:发表于2019-05-18 11:02 被阅读0次

BeautifulSoup 爬虫实例-51job
beautifulsoup教程
基于python3的selenium线性化编程到二次封装：登录系
Python+PhantomJS+selenium+Beauti
BeautifulSoup requests 爬虫初体验
Python 爬虫
爬虫2
爬虫
python八爬虫框架
Python爬虫入门（urllib+Beautifulsoup）

import requests
from bs4 import BeautifulSoup

url="https://search.51job.com/list/010000,000000,0000,00,9,99,Java%2520%25E5%25BC%2580%25E5%258F%2591,2,1.html"
res=requests.get(url)
res.encoding="gbk"
print(res)

# create对象
soup=BeautifulSoup(res.text)

# 获取职位名
position_tag=soup.find_all("p",class_="t1") # 这里class是关键字 这里需要的是属性 所以系统中加了一个下划线来区分属性 这里t1不考虑空格
position=[]
for i in range(len(position_tag)):
    position.append(position_tag[i].a["title"])
print(position)

# 获取公司名
company_tag=soup.find_all("span",{"class":"t2"}) # 
company=[]
for i in range(len(company_tag)-1):
    company.append(company_tag[i+1].a["title"])
print(company)

# 获取工作地点
addr_tag=soup.find_all("span",{"class":"t3"}) # 
addr=[]
for i in range(len(addr_tag)-1):
    addr.append(addr_tag[i+1].string)
print(addr)

# 获取工资
salary_tag=soup.find_all("span",{"class":"t4"}) # 
salary=[]
for i in range(len(salary_tag)-1):
    salary.append(salary_tag[i+1].string)
print(salary)

import pandas as pd
from pandas import DataFrame
jobinfo=DataFrame([position,company,addr,salary]).T
jobinfo.columns=["职位","公司","地点","工资"]
jobinfo.head()

jobinfo.describe()

image.png

image.png

image.png

image.png

相关文章

BeautifulSoup 爬虫实例-51job
beautifulsoup教程
beautifulsoup教程 BeautifulSoup4是爬虫必学的技能。BeautifulSoup最主要的功...
基于python3的selenium线性化编程到二次封装：登录系
实例：登录51job part1:线性化编程 import... #登陆51job dr = webdriver....
Python+PhantomJS+selenium+Beauti
Python+PhantomJS+selenium+BeautifulSoup实现简易网络爬虫简易网络小爬虫，目...
BeautifulSoup requests 爬虫初体验
BeautifulSoup requests 爬虫初体验说爬虫不得不提python 常用的Python爬虫库(摘...
Python 爬虫
Python 爬虫 urllib BeautifulSoup re datetime random json
爬虫2
爬虫之 beautifulsoup BeautifulSoup3目前已经停止开发，推荐现在的项目使用Beautif...
爬虫
爬虫之 beautifulsoup BeautifulSoup3目前已经停止开发，推荐现在的项目使用Beautif...
python八爬虫框架
爬虫框架 BeautifulSoup 功能BeautifulSoup是用来从HTML或XML中提取数据的Pytho...
Python爬虫入门（urllib+Beautifulsoup）
Python爬虫入门（urllib+Beautifulsoup）本文包括：1、爬虫简单介绍2、爬虫架构三大模块3...

网友评论

本文标题：BeautifulSoup 爬虫实例-51job

本文链接：https://www.haomeiwen.com/subject/plgzaqtx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

栏目导航

热点阅读

关于我们|服务条款|联系我们|BeautifulSoup 爬虫实例-51job|投稿指南|网站地图|RSS订阅|排版工具|手机版

提供经典美文摘抄,优美散文欣赏,现代诗歌精选,短篇小说,心情随笔,表白情书范文,故事会在线阅读欣赏

Copyright © 2014-2023 Haomeiwen.com All Rights Reserved. 好美文阅读网版权所有

备案信息：桂公网安备 45052102000051号 · 桂ICP备13007215号-3

本站所收录作品、热点评论等信息部分来源互联网，目的只是为了系统归纳学习和传递资讯

所有作品版权归原创作者所有，与本站立场无关，如不慎侵犯了你的权益，请联系我们告知，我们将做删除处理！