scrapy爬虫框架
初始化项目
1 | scrapy startproject <project-name> |
创建并运行爬虫
1 | scrapy genspider <spider-name> <spider-domin> |
Extra
- 日志
1 | # setting.py |
- headers
1 | # setting.py |
- pipelines
1 | # setting.py |
- 处理详情
1 | yield scrapy.Request( |
- 下一页
1 | yield scrapy.Request( |
All articles in this blog are licensed under CC BY-NC-SA 4.0 unless stating additionally.