The Natural Language Toolkit (developerWorks)
Posted Jun 25, 2004 22:15 UTC (Fri) by
cpeterso (guest, #305)
In reply to:
The Natural Language Toolkit (developerWorks) by iabervon
Parent article:
The Natural Language Toolkit (developerWorks)
Pardon my ignorance <:) but what does "top 5 porter-stemmed words by term-frequency-inverse-document-frequency" mean? Does this mean the most common words in a particular email that are least common among ALL emails? i.e. what words make this email special compared to all other emails?
wow, that is a very interesting idea you had! :D
(
Log in to post comments)