Hi everyone!
It's been one year since I sarted webscraping and never found a problem like the following:
sensacine_web<- read_html("https://www.sensacine.com/")
sensacine_text<- sensacine_web %>%
html_node(".titlebar-title-lg .titlebar-link") %>%
html_attr("href")
sensacine_link <- sensacine_web %>%
html_node(".titlebar-title-lg .titlebar-link") %>%
html_text()
While html_text works fine in this website, html_attr is unable to extract the hyperlink. It seems to be a problem in this website, but it worked in other ones. Apparently it isn't protected against webscraping, so why does this happen?