python 抓取网页

作者: 谢贤byte | 来源:发表于2020-09-03 16:22 被阅读0次

Python实用练手小案例
python抓取网页
python 抓取网页
Python 的简单爬虫
Python爬虫urllib2库的基本使用系列(三)
为什么说python适合写爬虫
Python爬虫基础
【Python】抓取网页信息
python爬虫(四)_urllib2库的基本使用
Python网页decode解码报错UnicodeDecodeE

import json

import urllib.request

import urllib.parse

#获取豆瓣json数据

url = "https://movie.douban.com/j/search_subjects"

params = {

    'type': 'movie',

    'tag': '豆瓣高分',

    'sort': 'recommend',

    'page_limit': 20,

    'page_start': 0

}

headers = {

    'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/83.0.4103.106 Safari/537.36'

}

params_encode = urllib.parse.urlencode(params).encode('utf-8')

# print(params_encode)

# url = url + '?' + params_encode

req = urllib.request.Request(url, params_encode, headers)

with urllib.request.urlopen(req) as response:

    data = response.read().decode('utf-8')

print(data)

# with open('f:/douban.json', 'w') as file:

#    json.dump(data, file, ensure_ascii=False

网友评论

本文标题：python 抓取网页

本文链接：https://www.haomeiwen.com/subject/ldkosktx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

python 抓取网页

相关文章

Python实用练手小案例

python抓取网页

python 抓取网页

Python 的简单爬虫

Python爬虫urllib2库的基本使用系列(三)

为什么说python适合写爬虫

Python爬虫基础

【Python】抓取网页信息

python爬虫(四)_urllib2库的基本使用

Python网页decode解码报错UnicodeDecodeE

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读