eham06
1
I am using the read_html command and I get the following error message on a specific website.
library('rvest')
library('dplyr')
webpage <- read_html("https://www.edmunds.com/ford/escape/2018/cost-to-own/")
Error in open.connection(x, "rb") : HTTP error 403.
Are specific websites blocking R from being scraped?
mara
2
It's not specific to R, but the site might block webscraping. Two places to look are at the site's robots.txt and the Terms of Service
https://www.edmunds.com/robots.txt
https://www.edmunds.com/about/visitor-agreement.html
1 Like
eham06
3
Thanks for the response...
system
Closed
4
This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.