Aye, Nilar; San, Pan Ei
(Fourteenth International Conference On Computer Applications (ICCA 2016), 2016-02-25)
The web content classification system
classifies the noise or content from HTML web pages.
The system proposes the Content Extraction
algorithm using content features to remove the
boilerplate and to extract the main ...