Exploring Large HTML Documents on the Web