作者logoin (平衡)
看板C_Sharp
标题[问题] 如何连线到很多网页查询前几句文章的内容
时间Sat Oct 2 05:12:50 2004
hello
我有一个网页需要分析很多网站(约100笔)
所以我需要去抓取每个网站的前几笔句子来分析
看了一下个个网站的结构完全不一样
不太知道该从哪里分析或是抓取资料
有没有人可以提供好点子
For example:
我要连线到CNN网页
http://www.cnn.com/
得到前五句:
Mount St. Helens blows smoke, ash
Mount St. Helens began blowing a large cloud of smoke and steam
Friday following a week in which scientists have closely monitored
the volcano. The volcanic dome within the mountain's crater has
moved about three inches since Monday, U.S. Geological Survey
geologist John Major said.
Geologist Tom Pierson called the vent a small explosion, which
could be the first of larger events, including an eruption.
我是用asp.net + c#
谢谢大家
--
※ 发信站: 批踢踢实业坊(ptt.cc)
◆ From: 130.212.108.241