How to validate if url exist
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hello,
Is there any way to check if the url exist? I used download tool to check it. Some site successfully works by looking at Download Headers. When the Download Headers says "HTTP/1.1 200 OK", I know that the url exist. However, some site always says "HTTP/1.1 999 Request denied". Is that because it is private website?
I attached a sample workflow.
Sincerely,
Kazumi
- Labels:
- Download
- Tips and Tricks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi Kazumi,
Looks like linkedin filters based on the user agents which means that without doing some serious tweaking to the download tool to modify the headers before sending them to the sites, you might end up getting the denied message. Here is a link to a stack overflow article that covers the 999 error and user agents in more detail http://stackoverflow.com/questions/27231113/999-error-code-on-head-request-to-linkedin
Henry
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi Kazumi,
The reason why few sites does not work is because of Security provided by the website. Few websites allow only to access the public profiles and few websites does not allow to access using third party tools.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Though it's too late, But just posting it for future references.
I have written a python based workflow/tool, where it would validate a URL to be Valid or Not Valid, It is based on the python socket library using a method 'gethostbyname'. It would be one of the fastest solution I have ever come across. This could validate any number of records. Rough estimate to validate 1000 records would be 7 Minutes.
