A warehouse or local cache for web data

Loading...
Thumbnail Image

Date

2005

Journal Title

Journal ISSN

Volume Title

Publisher

Sokoine University of Agriculture

Abstract

The internet web is becoming important resource as a source of information. Various companies make the information from their databases available through search web page forms. The information ranges from indexed documents to product information. This information becomes more valuable if can be available to other software applications for further processing. In this project I present the efficient mechanism of extracting web data. The approach which I am using is based on anal) sing the patterns of the HTML tags. The customised general model of the web page is produced and is used for extracting data produced by subsequent web queries. The data extracted is populated to the database. The motivation behind this project is that the information is available to other software applications and hence can be used for decision making systems.

Description

Dissertation

Keywords

Web page analysis, Internet, Web data

Citation