A simple and efficient algorithm for automatic classification of web pages