I am trying to use the following TreeTagger package on RStudio Workbench:
https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/
I have followed the instructions as per the above and tested with the following code in RStudio Workbench:
library(textstem)
x <- c(
'the dirtier dog has eaten the pies',
'that shameful pooch is tricky and sneaky',
"He opened and then reopened the food bag",
'There are skies of blue and red roses too!',
NA,
"The doggies, well they aren't joyfully running.",
"The daddies are coming over...",
"This is 34.546 above"
)
Default lexicon::hash_lemmas dictionary
lemmatize_strings(x)
Hunspell dictionary
lemma_dictionary <- make_lemma_dictionary(x, engine = 'treetagger')
The above then prompt me for the following:
TreeTagger does not appear to be installed.
Would you like me to open a download browser?
1: Yes
2: No
At this stage I stop and don't proceed further.
If I provide the path to the treetagger package location, which houses all the R packages, with the following:
lemma_dictionary <- make_lemma_dictionary(x, engine = 'treetagger', path = '/opt/repo/CRAN/treetagger')
I get the following error:
Error: None of the following files were found, please check your TreeTagger installation!
/opt/repo/CRAN/treetagger/cmd/utf8-tokenize.perl
/opt/repo/CRAN/treetagger/cmd/tokenize.perl
In addition: Warning message:
NA is replaced by empty string
We have physically checked the above path and the files do exist, which is why the error message is strange. In an attempt to get it working, we tried on Windows and it's works fine with no issues. However, on Ubuntu, we have found the above errors.
Any help, support and advise on next steps would be greatly appreciated.