Hi all!
I have several data sets that I want to compare with each other (df0
with df1
, df1
with df2
and df2
with df3
). When a change is made, a variable identifying the source of the change should be entered for each column in a new column.
Here are my sample data sets:
df0 = data.frame("ind" = c("A1", "A2", "B1", "C1", "C2"), "x10" = c(0,NA,2,3,4), "x11" = c(NA,0,1,2,3), "x12" = c(3,4,0,NA,NA))
df1 = data.frame("ind" = c("A1", "A2", "B1", "C1", "C2"), "x10" = c(0,NA,1,3,3), "x11" = c(NA,0,1,3,3), "x12" = c(3,2,0,NA,NA))
df2 = data.frame("ind" = c("A1", "A2", "B1", "C1", "C2", "D1"), "x10" = c(0,NA,1,3,3,NA), "x11" = c(NA,0,1,3,3,NA), "x12" = c(3,2,0,NA,NA,NA))
df3 = data.frame("ind" = c("A1", "A2", "B1", "C1", "C2", "D1"), "x10" = c(0,1,1,3,3,NA), "x11" = c(4,0,1,3,3,1), "x12" = c(3,2,0,2,1,NA), "x13" = c(0,1,NA,2,1,1))
The result for the comparison of df0
and df1
should look like (x
means, that there was some change):
In the next step, df1
would be compared with df2
. If there are changes, the variable y
should be written into the change_x1.
-column, which would have the following result:
The final comparison of df2
with df3
would lead to the following result (z
indicates a change between the two data sets):
Do you have any idea, how I could solve that problem in R?