I'm working on a project where I'm analyzing the reviews of a certain product.
I broke down the sentences to individual words, so now in my dataframe every row is an individual word.
When I tried to count the most common words in the reviews, I ran into the "problem" that in the top 10 words, I basically have the same word twice, but Rstudio (rightfully) lists them as different words because they have different conjugations.
Here is what I mean, this is what happens when I run my code:
Review_words %>% count(word, sort =TRUE)
use
using
good
different
differently
How do I chop off the conjugations from the words, so that I can work with the "core words" like "use, different, etc"?