Missing values
Submitted by rajgupta85 on Sun, 2007-04-01 07:07
HI
I'm implimenting decision tree on .Net platform..can any one suggest the best and most effective way to handle missing values in dataset.
HI
I'm implimenting decision tree on .Net platform..can any one suggest the best and most effective way to handle missing values in dataset.
re: missing values
In the tutorial part of this I mentioned a couple of things you can do to handle missing attributes if you are coding a decision tree algorithm:
* Simply have another possible value that an attribute can take - 'blank'. This is then treated as any other value, with branches in a tree being labelled by this if necessary. This is useful if some meaning can potentially be attached to missing atributes.
* Replace the black value with the value that occurs most frequently in a similar context.