PTT數位生活區 / PHP

[請益] 抓取中文網頁並分析

看板PHP作者sheeper (as)時間15年前 (2010/04/10 15:22)推噓1(1推 0噓 2→)

留言3則, 2人參與討論串1/1

下面是一個簡單的程式去抓出新聞同樣的邏輯在英文網頁就可以成功在這個聯合報的網頁就失敗請各位大大幫忙看看了先拜謝 <?php $search_url= "http://udn.com/NEWS/WORLD/WOR3/5528622.shtml"; $ch = curl_init(); curl_setopt($ch, CURLOPT_URL,$search_url); curl_setopt($ch, CURLOPT_USERAGENT, 'Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.1.7) Gecko/20091221 Firefox/3.5.7 GTBDFff GTB7.0 (.NET CLR 2.0.50727)'); curl_setopt($ch, CURLOPT_RETURNTRANSFER,1); curl_setopt($ch ,CURLOPT_HTTPHEADER, array("Accept-Language: zh-tw","Accept-Charset: utf-8")); $content = curl_exec ($ch); curl_close ($ch); echo $content."\n"; $pattern = "/(<div class=\"story\" id=\"story\">)(.*?)(<\/div>)/"; echo $pattern."\n"; preg_match($pattern, $content, $matches); print_r($matches); ?> -- For want of a nail the shoe was lost, for want of a shoe the horse was lost, for want of a horse the knight was lost, for want of a knight the battle was lost, for want of a battle the kingom was lost. -- ※ 發信站: 批踢踢實業坊(ptt.cc) ◆ From: 24.6.21.35

推

04/10 15:59, , 1^F

04/10 15:59, 1^F

→

04/10 16:01, , 2^F

04/10 16:01, 2^F

→

04/10 16:35, , 3^F

04/10 16:35, 3^F

‣ 返回看板[ PHP ] 程設

‣ 更多 sheeper 的文章

文章代碼(AID): #1Bm2TFDu (PHP)

PHP 近期熱門文章

1

3

[請益] 升級php8 之後的問題已刪文

1年前, 09/26

2

5

Re: [閒聊] 從PHP7升級到PHP8後解決count()的錯誤

1年前, 07/01

2

2

[請益] 請問如何查詢目前正在跑的程序？

1年前, 05/23

1

4

Re: [請益] 日期選擇後無法顯示在新頁面

1年前, 04/12

3

10

[請益] docker取token問題

1年前, 03/16

3

4

Re: [請益] 關於徵才條件的設定

1年前, 03/14

1

1

[請益] xampp無法外部連線問題

1年前, 03/01

2

9

[請益] 一個很基本的指定運算子的問題?

1年前, 01/27

更多近期熱門文章 >>

PTT數位生活區即時熱門文章

7

8

[情報] 美國三星S25U特價

37分鐘前, 11/27

11

30

[討論] One UI 8.5 程式碼曝光「超快速充電 3.0

1小時前, 11/27

6

11

[開箱] 聯力O11D Mini V2開箱篇最滿意的機殼

[ PC_Shopping ]

2小時前, 11/27

7

10

[請益] 請大家推薦DAC+耳擴

2小時前, 11/27

8

42

[情報] 美亞黑五 i家旗艦285k 大折扣28%

[ PC_Shopping ]

4小時前, 11/27

3

21

[討論] moptt換圖床了

7小時前, 11/27

4

17

[問題] ipad pro m5 wifi速度低下？

10小時前, 11/27

25

61

[問題] iPhone 17回上一頁怎麼做？左邊看似可以

10小時前, 11/27

更多即時熱門文章 >>

‣ 返回看板[ PHP ] 程設

‣ 更多 sheeper 的文章

文章代碼(AID): #1Bm2TFDu (PHP)