1;3409;0c A Comparable Study Employing WEKA Clustering/Classification Algorithms for Web Page Classification

A Comparable Study Employing WEKA Clustering/Classification Algorithms for Web Page Classification

2011 15th Panhellenic Conference on Informatics, 2011
Pages: 235-239DOI: 10.1109/PCI.2011.52

PCI

bibtex

Documents and web pages share many similarities. Thus classification methods used in documents can be applied to advanced web content, with or even without modifications. Algorithms for document and web classification are presented as an introduction. One out of many tools that can be used in method evaluation, application and modification is WEKA (Waikato Environment for Knowledge Analysis). Testing results and conclusions strengthen the principles and bases of classification, while demonstrating the need for a new interlayer in the evaluation of classification methods.