ua流量标识信息依赖
1、爬虫维护ua流量标识
需要维护流量标识对应的中文池子,便于功能化、技术产品化,如:micromessenger-微信;meituan-美团app
2、爬虫维护内置浏览器
通过特殊内置浏览器识别流量来源,如下ua:
Mozilla/5.0 (Linux; U; Android 9; zh-CN; V1824A Build/PKQ1.181216.001) AppleWebKit/537.36 (KHTML, like Gecko) Version/4.0 Chrome/57.0.2987.108 UCBrowser/12.8.2.1062 Mobile Safari/537.36
解析结果如下列表示例:
第一组示例
user_agent:
Mozilla/5.0 (Linux; Android 10; PCKM00 Build/QKQ1.190915.002; wv) AppleWebKit/537.36 (KHTML, like Gecko) Version/4.0 Chrome/78.0.3904.62 XWEB/2693 MMWEBSDK/200601 Mobile Safari/537.36 MMWEBID/6373 MicroMessenger/7.0.16.1700(0x27001035) Process/appbrand0 WeChat/arm64 NetType/4G Language/zh_CN ABI/arm64 miniProgram
Ua_key:wx_micromessenger
第二组示例
user_agent:
Mozilla/5.0 (Linux; Android 10; V1911A Build/QP1A.190711.020; wv) AppleWebKit/537.36 (KHTML, like Gecko) Version/4.0 Chrome/78.0.3904.96 Mobile Safari/537.36 hap/1.8/vivo com.vivo.hybrid/1.8.4.701 com.yslqo.bettersaying/1.5.0
Ua_key:vivo.hybrid
第三组示例
user_agent:
Mozilla/5.0 (Linux; U; Android 9; zh-CN; COR-AL10 Build/HUAWEICOR-AL10) AppleWebKit/537.36 (KHTML, like Gecko) Version/4.0 Chrome/69.0.3497.100 UWS/3.22.1.66 Mobile Safari/537.36 AliApp(Youku/9.7.0) UCBS/2.11.1.1 TTID/227200@youku_android_9.7.0 WindVane/8.5.0 Youku/9.7.0 (Android 9; Bridge_SDK; GUID de9539ad1c176e0b42a3ce1218a65428; UTDID XB4qYTG10BgDAISVtvbDch5C; packageName com.youku.phone; appKey 23570660;)
Ua_key:youku
网友评论