Textual Analysis in R

JLJ99 · April 30, 2021, 3:18pm

Dear RStudio community,

for my bachelor thesis, I am conducting a textual analysis of company-specific newspaper articles and their effect on stock returns.

As I do not have a lot of experience using R, I would like to tap this community in order to get to a starting point. What I have is two spreadsheets: One, containing all company-specific news over a 6-month period with their publication date. The other one contains negative as well as positive keywords (Loughran & McDonald, 2011).

What would be your approach, if you wanted to generate two values: 1) How many of 'hits' are there in a given news article containing a word from the positive list and 2) how many of 'hits' are there in a given news article containing a word from the negative list?

Thanks a lot in advance!

Best,
JLJ

technocrat · April 30, 2021, 8:36pm

The [{tidytext} package] (https://www.tidytextmining.com/) is suitable to this purpose.

JLJ99 · May 1, 2021, 10:53am

Thanks a lot, this provided me with a good starting point!

system · April 24, 2024, 3:54pm

This topic was automatically closed 42 days after the last reply. New replies are no longer allowed.

If you have a query related to it or one of the replies, start a new topic and refer back with a link.