课时19

作者: ooocoo | 来源:发表于2016-05-14 17:54 被阅读0次

    我的 代码

    import requestsfrom bs4 
    import BeautifulSoup
    import pymongo
    
    client = pymongo.MongoClient('localhost',27017)
    xiaozhu = client['xiaozhu']
    bnb_info = xiaozhu['bnb_info']
    
    def get_page_within(pages):    for page_num in range(1,pages+1):        wb_data = requests.get('http://bj.xiaozhu.com/search-duanzufang-p{}-0/'.format(page_num))        soup = BeautifulSoup(wb_data.text,'lxml')        titles = soup.select('span.result_title')        prices = soup.select('span.result_price > i')        for title, price in zip(titles,prices):            data = {                'title':title.get_text(),                'price':int(price.get_text())            }            bnb_info.insert_one(data)    print('Done')
    if i['price'] >= 500:    print(i)
    
    

    相关文章

      网友评论

          本文标题:课时19

          本文链接:https://www.haomeiwen.com/subject/fbymrttx.html