A warehouse or local cache for web data
Loading...
Date
2005
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Sokoine University of Agriculture
Abstract
The internet web is becoming important resource as a source of information. Various
companies make the information from their databases available through search web
page forms. The information ranges from indexed documents to product information.
This information becomes more valuable if can be available to other software
applications for further processing. In this project I present the efficient mechanism of
extracting web data. The approach which I am using is based on anal) sing the patterns
of the HTML tags. The customised general model of the web page is produced and is
used for extracting data produced by subsequent web queries. The data extracted is
populated to the database. The motivation behind this project is that the information is
available to other software applications and hence can be used for decision making
systems.
Description
Dissertation
Keywords
Web page analysis, Internet, Web data