'Text Analysis and dealing with grammar, tense in R
I am trying to do text analysis in R. I am able to do the frequency counts and wordcloud. But I could not figure out how to work with the words which are same but different tense such as "enjoy", "enjoyed". I want these words to count as single word "enjoy" rather than 2 separate words. Is there are way I can fix these words or change to present tense?
Solution 1:[1]
You can either use stemming as a pre-processing step or use the Quanteda package and employ a wildcard pattern match by specifying "enjoy*" to include variations such as "enjoyed" and "enjoying"
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
| Solution | Source |
|---|---|
| Solution 1 | Jeanette |
