遍历id
安装依赖
pip3 install -r requirements.txt
修改config.ini
[config]
ERR_FILE = err_list.txt
WKHTML2PDF_PATH = /usr/local/bin/wkhtmltopdf
PDF_PATH = wooyun
[log]
LOG_FILE = wooyun.log
ERR_FILE
:处理出错的id号WKHTML2PDF_PATH
:wkhtmltopdf二进制文件所在路径PDF_PATH
:存储pdf的目录log
:日志文件
启动爬虫
python3 wooyun_spider.py