yoyong
April 21, 2021, 9:58pm
1
Hi. I have a question.
I have two datasets.
All observations in Data1 are in Data2. But the opposite is not true. Datasets have different variables. (see below). I would like to remove observations from the two lists. So the output should just contain Maria and Jose.
Data1
Name Grade Sport
Jhon 85 Tennis
Alex 75 Golf
Alice 92 Table Tennis
Data2
ID Gender Sector
Jhon M Gov
Alex M Gov
Alice F Health
Maria F Sport
Jose M Business
Appreciate your help.
yoyong
You can use anti_join()
in dplyr
.
It returns all rows from x
where there are not matching values in y
, keeping just columns from x
.
https://dplyr.tidyverse.org/reference/join.html
Also, please provide a reproducible example next time.
A minimal reproducible example consists of the following items:
A minimal dataset, necessary to reproduce the issue
The minimal runnable code necessary to reproduce the issue, which can be run
on the given dataset, and including the necessary information on the used packages.
Let's quickly go over each one of these with examples:
Minimal Dataset (Sample Data)
You need to provide a data frame that is small enough to be (reasonably) pasted on a post, but big enough to reproduce your issue.
Let's say, as an example, that you are working with the iris data frame
head(iris)
#> Sepal.Length Sepal.Width Petal.Length Petal.Width Species
#> 1 5.1 3.5 1.4 0.…
1 Like
system
Closed
May 12, 2021, 11:11pm
3
This topic was automatically closed 21 days after the last reply. New replies are no longer allowed. If you have a query related to it or one of the replies, start a new topic and refer back with a link.