美文网首页Python三期爬虫作业
【Python爬虫】爬取城市传奇参赛球队相关信息

【Python爬虫】爬取城市传奇参赛球队相关信息

作者: DoctorLDQ | 来源:发表于2017-08-08 18:05 被阅读24次
import requests
from bs4 import BeautifulSoup
import json


team_base_url='http://www.ufball.com/match/join_teams_json.htm?divisionId=874902863023837184&page='     #在此基础上对参赛球队信息进行获取
team_player_base_url='http://www.ufball.com/match/team_players.htm?divisionId=874902863023837184&teamId='   #获取球员的相关信息

def get_team_info(url):
    for i in range(1,11):
        r=requests.get(url+str(i))
        team_data=json.loads(r.text)
        for i in range(len(team_data['data']['list'])):
            teamName = team_data['data']['list'][i]['teamName']
            teamID = team_data['data']['list'][i]['teamId']
            get_team_player_info(teamID,teamName)

def get_team_player_info(ID,name):
    r=requests.get(team_player_base_url+ID)
    soup=BeautifulSoup(r.text,'lxml')
    # playerName=soup.select('span[class~=player-name-value]')
    # print(playerName)
    for player in soup.find_all('div',class_='player-detail'):
        playerName=player.find('span',class_='player-name-value').text
        # print(playerName)
        playerNum=player.find('span',class_='player-num-value').text
        playerAddress=player.find('span',class_='player-address-value').text
        playerAge=player.find('span',class_='player-age-value').text
        playerHeight=player.find('span',class_='player-height-value').text
        playerWeight=player.find('span',class_='player-weight-value').text
        print(name + '  '+playerName + '  ' + playerNum + '  ' + playerAddress +  '  ' + playerAge + ' ' + playerHeight + '  ' + playerWeight + ' ')

get_team_info(team_base_url)
3.png

相关文章

网友评论

    本文标题:【Python爬虫】爬取城市传奇参赛球队相关信息

    本文链接:https://www.haomeiwen.com/subject/ukcxrxtx.html