I have a long list of item descriptions. Some of these descriptions are duplicates but the text is not exactly matching. For example: descriptions are like "The Brawn White Bolt Laser 10W" and "Laser for 10 Watt White Bolt". Though these are the same items but having different descriptions. There are many items as such in the list. My objective is to find out such common items based on their descriptions.
I have a thought to find the nouns in each description field and match them across each other. Segregate those item descriptions that matches with most common nouns. Just wondering if it is making any sense? Is there any algorithm for that in R? Is there any other way to do this kind of matching?