W3Cschool
恭喜您成為首批注冊用戶
獲得88經(jīng)驗值獎勵
簡單的爬蟲案例: 將百度的首頁爬取下來并保存在文件中。
import java.io.BufferedReader;
import java.io.BufferedWriter;
import java.io.FileOutputStream;
import java.io.IOException;
import java.io.InputStreamReader;
import java.io.OutputStreamWriter;
import java.io.UnsupportedEncodingException;
import java.net.URL;
public class Test {
public static void main(String[] args) throws UnsupportedEncodingException, IOException {
URL url = new URL("http://www.baidu.com");
BufferedReader bReader =
new BufferedReader(new InputStreamReader(url.openStream(), "utf-8"));
BufferedWriter bWriter =
new BufferedWriter(new OutputStreamWriter(new FileOutputStream("baidu.html")));
String msg = null;
while((msg = bReader.readLine()) != null) {
// System.out.println(msg);
bWriter.append(msg + "\n");
}
bWriter.close();
bReader.close();
}
}
Copyright©2021 w3cschool編程獅|閩ICP備15016281號-3|閩公網(wǎng)安備35020302033924號
違法和不良信息舉報電話:173-0602-2364|舉報郵箱:jubao@eeedong.com
掃描二維碼
下載編程獅App
編程獅公眾號
聯(lián)系方式:
更多建議: