美文网首页
2018-06-22

2018-06-22

作者: lune819 | 来源:发表于2018-06-22 00:27 被阅读0次
    1. Install 2 python packages:

    $ sudo pip install requests
    $ sudo easy_install beautifulsoup4

    1. Creat test.py

    <pre>

    coding=utf-8

    import requests
    from bs4 import BeautifulSoup

    get url

    def get_html(url):
    response = requests.get(url)
    response.encoding = 'utf-8'
    return response.text

    get title

    def get_title(html):
    soup = BeautifulSoup(html, 'html.parser')
    soup.select('p')[0].get_text()
    title_content = soup.select('title')[0].get_text()
    return title_content

    get text

    def print_p(html):
    soup = BeautifulSoup(html, 'html.parser')
    for p in soup.select('p'):
    print p.get_text()

    url = "http://www.cityu.edu.hk/"
    html = get_html(url)
    title_content = get_title(html)
    print title_content
    print_p(html)
    </pre>

    3.Go to folder of test.py then execute
    $ python test.py

    1. Output


      Untitled.png

    相关文章

      网友评论

          本文标题:2018-06-22

          本文链接:https://www.haomeiwen.com/subject/cbshyftx.html