Search this blog

Tuesday, 22 January 2013

New in 5.3: Annotation operators

I just downloaded RapidMiner 5.3 and some shiny new things caught my eye. One thing I noticed was some new operators that allow annotations to be created within a process. Annotations are associated with IOobjects like data, cluster models, weights and you can store any free text you like with them. The annotation is retained with the IOobject if it is stored in the repository. With these new operators, it means you could add generated information such as a timestamp, the time taken to process the data or any other environmental information.

I have a lot of data lying around in my repositories and it would be very helpful if I had created a simple annotation to give me some clue what the data is and where it came from. Now I can.


  1. Where did the Web Mining and Text Mining operators go in v.5.3?

  2. They are extensions you download in addition to the main program. Go to Help->Updates and Extensions.

  3. I did, but they don't show up in the Operators window like in V.5. How do you bring web mining for example into the Design view?

    1. Are they enabled from the Manage Extensions view? Go to Help->Manage Extensions and enable them there. If they are disabled, the operators will not show up. If they are already enabled then that's a harder problem.

  4. They are all enabled, that's the strange thing. There is no documentation for 5.3 so it's very difficult to troubleshoot the problem. I will have to email RapidMiner.

  5. Hmmm - a harder problem indeed. Not sure why that would be, I would capture some log files.

    Something like this on Windows command prompt...

    RapidMinerGUI.bat 2>err.log