There will be five videos, with a sample application based on a popular job posting board:
- loading text into RapidMiner (paste, file, group of files in folders, database)
- processing text in RapidMiner (strip html, tokenize, n-grams, stemming, stopwords, frequency tables)
- word vectorization and association rules with text
- calculating the similarity between documents, clustering
- automatically classifying documents and determining which words are important
Oh I can't wait for this!
ReplyDelete