Data Science With RapidMiner: Scaling attribute values using weights

Sunday, 21 July 2013

Scaling attribute values using weights

Here's a process that multiplies each value of an attribute within one example set by a constant in another example set. The constants are specific for each attribute and the process uses weights derived from the example set. In effect, a matrix multiplication is happening.

At a high level, the process works as follows.

The Iris data set is used with weights being produced using "Weight By Information Gain"
These weights are transformed into an example set and stored for later use inside a Loop operator
A subprocess is used to make sure everything works in the right order (this technique is also used inside the Loop).
A "Loop Attributes" operator iterates over all attributes and generates a new attribute based on multiplying the existing value by a weight. The attribute name is required to be contained in the weights example set.
The weight for each example is calculated with a combination of filtering and macro extraction.

Data Science With RapidMiner

Search this blog

Sunday, 21 July 2013

Scaling attribute values using weights

No comments:

Post a Comment

About Me

Labels

Blog Archive