I want to scrap a website.
(https://www.songpa.go.kr/www/index.do)
However, when I import a web page for that site from the Download Tool, a 400 Error appears.
(DownloadData)
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
<HTML><HEAD>
<TITLE>400 Bad Request</TITLE>
</HEAD><BODY>
<H1>
400 Bad Request</H1>
</BODY></HTML>
(DownloadHeaders)
HTTP/1.1 400 Bad Request
Date: Tue, 21 Jun 2022 04:18:38 GMT
Content-Type: text/html; charset=EUC-KR
Connection: close
Content-Length: 157
Using the same URL in Python Tool with the "beautifulsoup4" and "requests" packages, page information is imported well.
How can I get the information using the Download Tool?
Solved! Go to Solution.
You're right. When I add User-Agent, the web page source is output well.
Thank you for your help.
User | Count |
---|---|
106 | |
82 | |
72 | |
54 | |
40 |