作者garlic774 (蒜头)
看板Python
标题[问题] 请问批次抓取 哪边可以做修正呢?
时间Sun Sep 19 20:36:05 2021
各位板大好,目前小弟想要爬取1-n页的特定标题,可目前都只能抓到第n页
请问哪边可以做修正呢? 程式码如下:
page = 10
products = []
keyword = "益生菌"
for page in range(1, page+1):
url =
'
https://tw.mall.yahoo.com/search/product?p={}&pg={}'.format(keyword, page)
headers = {'User-Agent':"Mozilla/5.0 (Windows NT 10.0; Win64; x64)
AppleWebKit/537.36 (KHTML, like Gecko) Chrome/93.0.4577.63 Safari/537.36"}
r = requests.get(url, headers = headers )
soup = BeautifulSoup(r.text,"html.parser")
s = soup.find_all('span',class_="BaseGridItem__title___2HWui")
s
--
※ 发信站: 批踢踢实业坊(ptt.cc), 来自: 111.248.146.206 (台湾)
※ 文章网址: https://webptt.com/cn.aspx?n=bbs/Python/M.1632054967.A.4AF.html
1F:推 s0914714: 每次抓到的s要append到products 09/19 23:41
2F:→ garlic774: 成功了!!!谢谢 09/19 23:55