作者ctr1 (【积π】)
看板Python
标题[问题] 爬虫取得onclick内所带的参数
时间Thu Feb 20 10:00:37 2020
想请问板上各位爬虫系的前辈
该如何分别取得tr > onclick="goDetail()"内的值
如下面程式码红色的部分
谢谢各位爬虫系前辈
---程式码---
url = '略'
params = '略'
html = requests.post(url, params)
soup = BeautifulSoup(html.text, 'html.parser')
result = soup.select_one('div #result tbody')
print(result)
[out:]
<tbody>
<tr class="td_odd" onclick="goDetail(
'101','70-1-44-1(035)')"
style="cursor:pointer;">
<td align="center" valign="middle">1</td>
<td align="center" class="td-line" valign="middle">效期内</td>
</tr>
<tr class="td_event" onclick="goDetail(
'301','40-107-4-1(9918)')"
style="cursor:pointer;">
<td align="center" valign="middle">2</td>
<td align="center" class="td-line" valign="middle">效期内</td>
</tr>
<tr class="td_odd" onclick="goDetail(
'502','53-1-42-1(531)')"
style="cursor:pointer;">
<td align="center" valign="middle">1</td>
<td align="center" class="td-line" valign="middle">效期内</td>
</tr>
</tbody>
--
※ 发信站: 批踢踢实业坊(ptt.cc), 来自: 210.242.44.65 (台湾)
※ 文章网址: https://webptt.com/cn.aspx?n=bbs/Python/M.1582164039.A.948.html
1F:→ vi000246: 用正规表达式硬干 02/20 10:46
2F:推 yushes920179: 先把那段字串拿出来 再re 02/20 17:05
4F:→ froce: 5066ea9f/?m=Saved%20fiddle 02/20 19:04
5F:→ froce: 何必硬干呢,onclick也是个attr而已啊 02/20 19:05
8F:推 jiyu520: 抓attribute outerHTML喔 02/21 03:22
9F:→ jiyu520: 抓attribute或直接找outerHTML喔 02/21 03:23