I'm working on my master's thesis, looking at the impact of flooding on home values in Florida. I'm using data from Zillow (for property values) and NOAA (for flood and weather-related data). I want to merge these datasets in R, but I'm not sure how to go about it. Has anyone done something similar or have tips on how to approach this?
There are a number of "join" functions in the Tidyverse package designed to do exactly this, inner_join(), left_join(), etc. Basically, you combine two tibbles into one tibble. Of course, you have to know which variables identify a match between the two data sets. Take a look at chapter 19 in R for Data Science.