scrapy + selenium 爬取数据范例

2023-02-18 itknight Comments 0 Comment

从 stackoverflow上抄的，还没有验证可行性

import scrapy
from selenium import webdriver

class ProductSpider(scrapy.Spider):
    name = "product_spider"
    allowed_domains = ['ebay.com']
    start_urls = ['http://www.ebay.com/sch/i.html?_odkw=books&_osacat=0&_trksid=p2045573.m570.l1313.TR0.TRC0.Xpython&_nkw=python&_sacat=0&_from=R40']

    def __init__(self):
        self.driver = webdriver.Firefox()

    def parse(self, response):
        self.driver.get(response.url)

        while True:
            next = self.driver.find_element_by_xpath('//td[@class="pagn-next"]/a')

            try:
                next.click()

                # get the data and write it to scrapy items
            except:
                break

        self.driver.close()

我是能行CTO

因为喜欢，所以热爱！

scrapy + selenium 爬取数据范例

2023-02-18 itknight Comments 0 Comment

发表回复取消回复

发表回复 取消回复

发表回复取消回复