Ommerce alerter for web information changes & semi automatic web wrapper
Loading...
Date
2005
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
University of Essex
Abstract
Data extraction is an area of computer science that will come to play an increasingly
important role in the near future. This project provides two separate software applications
that can help in the understanding of the various aspects of data extraction and the analysis
of the different problems and tentative solutions. First is the E-Commerce alerter system, a
program which is developed to monitor the change of products on an E-commerce site
(www.eBay.com Auctions). The program extracts items from the www.eBay.com site and
stores them in a Relational database. It is scheduled to check for updates in information on
the site every twelve hours. Also, it monitors the stored data for the purpose of reporting
updates to the users who request them. If any changes are found, the user is informed via
electronic mail. Secondly, a semi automatic wrapper has been developed, which is a too! to
assist system developer to wrap HTML pages into XML Documents. The tool helps in
extracting the items of interest from an hind source page. The extracted data can he used by
another application because it is stored in XML formal, which is well structured. The
technique used in developing semi automatic wrapper is tag Set Progression Grid (TpGrid)
which is the fingerprint representation of an hind page.
Description
Dissertation
Keywords
Web, Web wrapper, Data warehouse, Information and Communication Technology