Document Similarity and Clustering in RapidMiner

Loading...

Sign in or sign up now!
Alert icon
Upgrade to the latest Flash Player for improved playback performance. Upgrade now or more info.
7,789
Loading...
Alert icon
Sign in or sign up now!
Alert icon

Uploaded by on Nov 12, 2010

This is part 4 of a 5 part video series on Text Mining using the free and open-source RapidMiner.

This video describes how to calculate a term's TF-IDF score, as well as how to find similar documents using cosine similarity, and how to cluster documents using the K-Means algorithm.

  • likes, 0 dislikes

Link to this comment:

Share to:

Top Comments

  • I like about your video that you also discuss theory and not only tell us what to click

  • Even if I don't use RapidMiner your video is fantastic. Nice explanation of the theory! Nice Idea showing the diagrams on Google Docs!

see all

All Comments (4)

Sign In or Sign Up now to post a comment!
  • Thanks for your help! Excellent job!!

    From Chile

  • Great tutorial !! But I do have one question where does your Simalrity Measure Object View come from. followed your tutorial step by step but cant get this view..

    cheers

Loading...

Alert icon
0 / 00Unsaved Playlist Return to active list
    1. Your queue is empty. Add videos to your queue using this button:
      or sign in to load a different list.
    Loading...Loading...Saving...
    • Clear all videos from this list
    • Learn more