-
Views
-
Cite
Cite
Muralidhar Pantula, K S Kuppusamy, A Machine Learning-Based Model to Evaluate Readability and Assess Grade Level for the Web Pages, The Computer Journal, Volume 65, Issue 4, April 2022, Pages 831–842, https://doi.org/10.1093/comjnl/bxaa113
- Share Icon Share
Abstract
Evaluating readability of web documents has gained attention due to several factors such as improving the effectiveness of writing and to reach a wider spectrum of audience. Current practices in this direction follow several statistical measures in evaluating readability of the document. In this paper, we have proposed a machine learning-based model to compute readability of web pages. The minimum educational standards required (grade level) to understand the contents of a web page are also computed. The proposed model classifies the web pages into highly readable, readable or less readable using specified feature set. To classify a web page with the aforementioned categories, we have incorporated the features such as sentence count, word count, syllable count, type-token ratio and lexical ambiguity. To increase the usability of the proposed model, we have developed an accessible browser extension to perform the assessments of every web page loaded into the browser.