20180703笔记

作者: kdyq007 | 来源:发表于2018-07-04 21:20 被阅读0次

20180703笔记
20180703读书笔记第37/90天
鸡汤
20180703
20180703
20180703
20180703
20180703
20180703
20180703

一、爬虫

要注意自定义的header参数
发现未知的请求体参数，先到页面里查找参数内容，再分析是通过网络请求得到的还是通过js生成
登录成功后可能还会有多次认证，要注意看数据包

二、mysql引擎

myisam：支持全文索引，搜索速度较快，支持表锁，但不支持行锁；
表锁使用方式：
select * from table_name for update;

innoDB：不支持全文索引，只能列索引；支持行锁和表锁；速度慢；
行锁使用方式：
select col_name from table_name where id == 99 for update;

被锁住后其它数据库连接进入等待状态，本次查询结束后再正常执行。

三、requests

1. requests 请求方式

requests.get(url, params=None, **kwargs)
requests.post(url, data=None, json=None, **kwargs)
requests.put(url, data=None, **kwargs)
requests.head(url, **kwargs)
requests.delete(url, **kwargs)
requests.patch(url, data=None, **kwargs)
requests.options(url, **kwargs)

以上方法均是在此方法的基础上构建（可以用字符串代替method实现模块化）
requests.request(method, url, **kwargs)

2. requests参数说明

:param method: method for the new :class:`Request` object. （http请求类型）
:param url: URL for the new :class:`Request` object.（url地址）
:param params: (optional) Dictionary or bytes to be sent in the query string for the :class:`Request`.（字符串形式数据参数，内部自动转换成json形式参数）
:param data: (optional) Dictionary, bytes, or file-like object to send in the body of the :class:`Request`.（字典形式数据参数）
:param json: (optional) json data to send in the body of the :class:`Request`.（json形式数据参数，params自动转成了此参数）
:param headers: (optional) Dictionary of HTTP Headers to send with the :class:`Request`.（http请求头）
:param cookies: (optional) Dict or CookieJar object to send with the :class:`Request`.（cookies）
:param files: (optional) Dictionary of ``'name': file-like-objects`` (or ``{'name': file-tuple}``) for multipart encoding upload.（文件上传，指向文件对象）
    ``file-tuple`` can be a 2-tuple ``('filename', fileobj)``, 3-tuple ``('filename', fileobj, 'content_type')``
    or a 4-tuple ``('filename', fileobj, 'content_type', custom_headers)``, where ``'content-type'`` is a string
    defining the content type of the given file and ``custom_headers`` a dict-like object containing additional headers
    to add for the file.
:param auth: (optional) Auth tuple to enable Basic/Digest/Custom HTTP Auth.（用于http认证）
:param timeout: (optional) How long to wait for the server to send data
    before giving up, as a float, or a :ref:`(connect timeout, read
    timeout) <timeouts>` tuple.（等待超时时间）
:type timeout: float or tuple
:param allow_redirects: (optional) Boolean. Set to True if POST/PUT/DELETE redirect following is allowed.（是否允许跳转，默认False）
:type allow_redirects: bool
:param proxies: (optional) Dictionary mapping protocol to the URL of the proxy.（使用代理发送请求）
:param verify: (optional) whether the SSL cert will be verified. A CA_BUNDLE path can also be provided. Defaults to ``True``.（ssl认证证书路径）
:param stream: (optional) if ``False``, the response content will be immediately downloaded.（流媒体）
:param cert: (optional) if String, path to ssl client cert file (.pem). If Tuple, ('cert', 'key') pair.
:return: :class:`Response <Response>` object
:rtype: requests.Response

3. 参数示例

def param_method_url():
    # requests.request(method='get', url='http://127.0.0.1:8000/test/')
    # requests.request(method='post', url='http://127.0.0.1:8000/test/')
    pass

def param_param():
    # - 可以是字典
    # - 可以是字符串
    # - 可以是字节（ascii编码以内）

    # requests.request(method='get',
    # url='http://127.0.0.1:8000/test/',
    # params={'k1': 'v1', 'k2': '水电费'})

    # requests.request(method='get',
    # url='http://127.0.0.1:8000/test/',
    # params="k1=v1&k2=水电费&k3=v3&k3=vv3")

    # requests.request(method='get',
    # url='http://127.0.0.1:8000/test/',
    # params=bytes("k1=v1&k2=k2&k3=v3&k3=vv3", encoding='utf8'))

    # 错误
    # requests.request(method='get',
    # url='http://127.0.0.1:8000/test/',
    # params=bytes("k1=v1&k2=水电费&k3=v3&k3=vv3", encoding='utf8'))
    pass


def param_data():
    # 可以是字典
    # 可以是字符串
    # 可以是字节
    # 可以是文件对象

    # requests.request(method='POST',
    # url='http://127.0.0.1:8000/test/',
    # data={'k1': 'v1', 'k2': '水电费'})

    # requests.request(method='POST',
    # url='http://127.0.0.1:8000/test/',
    # data="k1=v1; k2=v2; k3=v3; k3=v4"
    # )

    # requests.request(method='POST',
    # url='http://127.0.0.1:8000/test/',
    # data="k1=v1;k2=v2;k3=v3;k3=v4",
    # headers={'Content-Type': 'application/x-www-form-urlencoded'}
    # )

    # requests.request(method='POST',
    # url='http://127.0.0.1:8000/test/',
    # data=open('data_file.py', mode='r', encoding='utf-8'), # 文件内容是：k1=v1;k2=v2;k3=v3;k3=v4
    # headers={'Content-Type': 'application/x-www-form-urlencoded'}
    # )
    pass


def param_json():
    # 将json中对应的数据进行序列化成一个字符串，json.dumps(...)
    # 然后发送到服务器端的body中，并且Content-Type是 {'Content-Type': 'application/json'}
    requests.request(method='POST',
                     url='http://127.0.0.1:8000/test/',
                     json={'k1': 'v1', 'k2': '水电费'})


def param_headers():
    # 发送请求头到服务器端
    requests.request(method='POST',
                     url='http://127.0.0.1:8000/test/',
                     json={'k1': 'v1', 'k2': '水电费'},
                     headers={'Content-Type': 'application/x-www-form-urlencoded'}
                     )


def param_cookies():
    # 发送Cookie到服务器端
    requests.request(method='POST',
                     url='http://127.0.0.1:8000/test/',
                     data={'k1': 'v1', 'k2': 'v2'},
                     cookies={'cook1': 'value1'},
                     )
    # 也可以使用CookieJar（字典形式就是在此基础上封装）
    from http.cookiejar import CookieJar
    from http.cookiejar import Cookie

    obj = CookieJar()
    obj.set_cookie(Cookie(version=0, name='c1', value='v1', port=None, domain='', path='/', secure=False, expires=None,
                          discard=True, comment=None, comment_url=None, rest={'HttpOnly': None}, rfc2109=False,
                          port_specified=False, domain_specified=False, domain_initial_dot=False, path_specified=False)
                   )
    requests.request(method='POST',
                     url='http://127.0.0.1:8000/test/',
                     data={'k1': 'v1', 'k2': 'v2'},
                     cookies=obj)


def param_files():
    # 发送文件
    # file_dict = {
    # 'f1': open('readme', 'rb')
    # }
    # requests.request(method='POST',
    # url='http://127.0.0.1:8000/test/',
    # files=file_dict)

    # 发送文件，定制文件名
    # file_dict = {
    # 'f1': ('test.txt', open('readme', 'rb'))
    # }
    # requests.request(method='POST',
    # url='http://127.0.0.1:8000/test/',
    # files=file_dict)

    # 发送文件，定制文件名
    # file_dict = {
    # 'f1': ('test.txt', "hahsfaksfa9kasdjflaksdjf")
    # }
    # requests.request(method='POST',
    # url='http://127.0.0.1:8000/test/',
    # files=file_dict)

    # 发送文件，定制文件名
    # file_dict = {
    #     'f1': ('test.txt', "hahsfaksfa9kasdjflaksdjf", 'application/text', {'k1': '0'})
    # }
    # requests.request(method='POST',
    #                  url='http://127.0.0.1:8000/test/',
    #                  files=file_dict)

    pass


def param_auth():
    from requests.auth import HTTPBasicAuth, HTTPDigestAuth

    ret = requests.get('https://api.github.com/user', auth=HTTPBasicAuth('wupeiqi', 'sdfasdfasdf'))
    print(ret.text)

    # ret = requests.get('http://192.168.1.1',
    # auth=HTTPBasicAuth('admin', 'admin'))
    # ret.encoding = 'gbk'
    # print(ret.text)

    # ret = requests.get('http://httpbin.org/digest-auth/auth/user/pass', auth=HTTPDigestAuth('user', 'pass'))
    # print(ret)
    #


def param_timeout():
    # ret = requests.get('http://google.com/', timeout=1)
    # print(ret)

    # ret = requests.get('http://google.com/', timeout=(5, 1))
    # print(ret)
    pass


def param_allow_redirects():
    ret = requests.get('http://127.0.0.1:8000/test/', allow_redirects=False)
    print(ret.text)


def param_proxies():
    # proxies = {
    # "http": "61.172.249.96:80",
    # "https": "http://61.185.219.126:3128",
    # }

    # proxies = {'http://10.20.1.128': 'http://10.10.1.10:5323'}

    # ret = requests.get("http://www.proxy360.cn/Proxy", proxies=proxies)
    # print(ret.headers)


    # from requests.auth import HTTPProxyAuth
    #
    # proxyDict = {
    # 'http': '77.75.105.165',
    # 'https': '77.75.105.165'
    # }
    # auth = HTTPProxyAuth('username', 'mypassword')
    #
    # r = requests.get("http://www.google.com", proxies=proxyDict, auth=auth)
    # print(r.text)

    pass


def param_stream():
    ret = requests.get('http://127.0.0.1:8000/test/', stream=True)
    print(ret.content)
    ret.close()

    # from contextlib import closing
    # with closing(requests.get('http://httpbin.org/get', stream=True)) as r:
    # # 在此处理响应。
    # for i in r.iter_content():
    # print(i)


def requests_session():
    import requests

    session = requests.Session()

    ### 1、首先登陆任何页面，获取cookie

    i1 = session.get(url="http://dig.chouti.com/help/service")

    ### 2、用户登陆，携带上一次的cookie，后台对cookie中的 gpsd 进行授权
    i2 = session.post(
        url="http://dig.chouti.com/login",
        data={
            'phone': "8615131255089",
            'password': "xxxxxx",
            'oneMonth': ""
        }
    )

    i3 = session.post(
        url="http://dig.chouti.com/link/vote?linksId=8589623",
    )
    print(i3.text)

20180703笔记
一、爬虫要注意自定义的header参数发现未知的请求体参数，先到页面里查找参数内容，再分析是通过网络请求得到...
20180703读书笔记第37/90天
20180703读书笔记第37/90天《与自己和解》～～一行禅师治愈你内心的内在小孩第八章了悟，成为菩萨 ...
鸡汤
鸡汤金融街笔记20180703以前也经常听人说鸡汤无用，当然也有些人说鸡汤有用。从我自身的体验来讲不管是别人端上...
20180703
终于熬到休假，终于可以重拾爱好啦啦啦啦啦
20180703
可以用复习的原因不写东西不能把复习当借口
20180703
最近的状态越来越差，没有一天不发脾气的，我想我已经病态了，冷静下来就觉得自己太可怕，都是一些极小的事情我可以暴...
20180703
20180703 13/100 天气：阴昨晚早睡：10:20 今日早起：5:30 今日运动：跑了8公里早起清单...
20180703
20180703
你问我我晚上睡不着会不会去找人聊天呀？也许吧。现在也许是不会了。夜半三更，跟人嘘寒问暖？我看我是疯了吧。 ...
20180703
今天小曼给我说句，不重结果，要看过程。南南给我说了句，很快都贵结束，到时候别哭了。好好珍惜，加油

20180703笔记

一、爬虫

二、mysql引擎

三、requests

1. requests 请求方式

2. requests参数说明

3. 参数示例

相关文章

20180703笔记

20180703读书笔记第37/90天

鸡汤

20180703

20180703

20180703

20180703

20180703

20180703

20180703

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读

20180703笔记

一、 爬虫

二、mysql引擎

三、requests

1. requests 请求方式

2. requests参数说明

3. 参数示例

相关文章

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读

一、爬虫