当前位置：首页 > 编程语言 > python >内容正文

python

python3通过request多进程获取驾校一点通试题库

发布时间：2024/1/8 python 57 豆豆

生活随笔收集整理的这篇文章主要介绍了 python3通过request多进程获取驾校一点通试题库小编觉得挺不错的,现在分享给大家,帮大家做个参考.

通过开发者工具找到试题链接地址；

对试题链接的url进行分析，发现index是试题id名称，构造随机数，可使用range或者excel拉出全部；

对json数据进行字段分析

我这里分开写了两个脚本，一个是获取数据一个是转成excel，本文主要为多进程获取数据

开发环境python3.9.1/windows10/vscode

#coding:utf-8 import requests from concurrent.futures import ProcessPoolExecutor import json# 通过url获取数据 # url = 'http://mnks.jxedt.com/get_question?r=0.5376675619396274&index=3' urls_list = [] with open('D:/YYFX/ip.txt','r') as f:for line in f:#print line,urls_list.append(line.replace('\n', '')) #模拟浏览器header hea = {'User-Agent':'Mozilla/5.0 (Windows NT 6.3; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/77.0.3865.90 Safari/537.36'} #进程 pool = ProcessPoolExecutor(20) def get_page(url):#requests.get 自带 json.loadresponse = requests.get('http://%s'%(url),headers = hea,timeout = 30 ,verify=False)response = response.content#将bytes转换成字符串response = response.decode('utf-8')return responsedef read_data(future,*args,**kwargs):response = future.result()state = json.loads(response) # print(response.status_code,response.url)print (state)#product = response1["question"]+'\n'with open('%s.json'%'data','a',encoding='utf-8') as f: #保存json数据防止乱码f.write(json.dumps(state,ensure_ascii=False) + '\n')f.close()def main():for url in urls_list:done = pool.submit(get_page,url)done.add_done_callback(read_data) if __name__ == '__main__':main()pool.shutdown(wait=True)f.close()

总结

以上是生活随笔为你收集整理的python3通过request多进程获取驾校一点通试题库的全部内容，希望文章能够帮你解决所遇到的问题。

如果觉得生活随笔网站内容还不错，欢迎将生活随笔推荐给好友。

上一篇：知识付费消停了，广播剧2019要从小众狂
下一篇： python 成语接龙1-爬去四字成语