[問題] 爬蟲 拆寫字串問題
各問大大好
小弟新手
最近在用python urllib2 的lxml方式 也就是xpath語法
想爬些資料參考
其中該網頁的某html的語段
onclick="
onProductClick(this,{'id':'2-mb-161017-twzh-1.png','name':'台北直飛馬尼拉/宿霧
*','creative':'http://www.airasia.com/cdn/aa-images/zh-TW/main-banner/2-mb-161017-twzh-1.png?sfvrsn=4','position':'Home
page > zh-TW > main banner 2'})"
我想爬到解析"台北直飛馬尼拉/宿霧"
但不知道該怎解到那段字?
因為這一大串本身就是個大字串
用了許多方式 split atrip 都沒辦法切到重點
請問該如何解了?
python 的 scrapy還沒試過
等會試試看
感謝
--
※ 發信站: 批踢踢實業坊(ptt.cc), 來自: 114.42.230.112
※ 文章網址: https://www.ptt.cc/bbs/Python/M.1476887623.A.888.html
→
10/19 23:56, , 1F
10/19 23:56, 1F
→
10/19 23:57, , 2F
10/19 23:57, 2F
→
10/20 00:05, , 3F
10/20 00:05, 3F
→
10/20 00:05, , 4F
10/20 00:05, 4F
→
10/20 00:05, , 5F
10/20 00:05, 5F
→
10/20 00:05, , 6F
10/20 00:05, 6F
→
10/20 00:06, , 7F
10/20 00:06, 7F
→
10/20 00:06, , 8F
10/20 00:06, 8F
→
10/20 00:07, , 9F
10/20 00:07, 9F
→
10/20 00:08, , 10F
10/20 00:08, 10F
→
10/20 00:09, , 11F
10/20 00:09, 11F
→
10/20 00:09, , 12F
10/20 00:09, 12F
→
10/20 00:10, , 13F
10/20 00:10, 13F
→
10/20 00:10, , 14F
10/20 00:10, 14F
→
10/20 00:11, , 15F
10/20 00:11, 15F
→
10/20 00:11, , 16F
10/20 00:11, 16F
→
10/20 00:13, , 17F
10/20 00:13, 17F
→
10/20 00:14, , 18F
10/20 00:14, 18F
→
10/20 00:18, , 19F
10/20 00:18, 19F
→
10/20 02:14, , 20F
10/20 02:14, 20F
→
10/20 09:13, , 21F
10/20 09:13, 21F
推
10/20 09:47, , 22F
10/20 09:47, 22F
推
10/20 10:26, , 23F
10/20 10:26, 23F
→
10/20 13:38, , 24F
10/20 13:38, 24F
→
10/20 15:19, , 25F
10/20 15:19, 25F
→
10/20 17:10, , 26F
10/20 17:10, 26F
→
10/20 17:10, , 27F
10/20 17:10, 27F
→
10/20 22:55, , 28F
10/20 22:55, 28F
→
10/21 01:52, , 29F
10/21 01:52, 29F
→
10/21 01:52, , 30F
10/21 01:52, 30F
推
10/21 07:31, , 31F
10/21 07:31, 31F
Python 近期熱門文章
PTT數位生活區 即時熱門文章