I am working with Naive Bayesian classifier over PHP (http://www.xhtml.net/php/PHPNaiveBayesianFilter) And there’s a list of words which can be ignored while training the system. Those words are not saved into the database and therefore not used for the classification. I would like to improve the system as much as I can so I was wondering if there’s any rule or