Large scale acquisition and maintenance from the web without source access

T Leonard, H Glaser - 2001 - eprints.soton.ac.uk
Although different web sites structure their pages differently, the pages within a single site
are often generated from a database and have a regular layout from which it is possible to
extract information automatically. Dome is a visual tool for manipulating tree-structured
documents. It can import and export in XML or HTML formats, making it ideal for harvesting
information from web pages. Editing is performed using a direct manipulation interface and
the operations are recorded for later playback. The knowledge extracted from a web page …

[CITATION][C] Large scale acquisition and maintenance from the Web without source access. In proceedings of workshop on knowledge markup and semantic …

T Leonard, H Glaser - 2001
Showing the best results for this search. See all results