Ommerce alerter for web information changes & semi automatic web wrapper

Loading...
Thumbnail Image

Date

2005

Journal Title

Journal ISSN

Volume Title

Publisher

University of Essex

Abstract

Data extraction is an area of computer science that will come to play an increasingly important role in the near future. This project provides two separate software applications that can help in the understanding of the various aspects of data extraction and the analysis of the different problems and tentative solutions. First is the E-Commerce alerter system, a program which is developed to monitor the change of products on an E-commerce site (www.eBay.com Auctions). The program extracts items from the www.eBay.com site and stores them in a Relational database. It is scheduled to check for updates in information on the site every twelve hours. Also, it monitors the stored data for the purpose of reporting updates to the users who request them. If any changes are found, the user is informed via electronic mail. Secondly, a semi automatic wrapper has been developed, which is a too! to assist system developer to wrap HTML pages into XML Documents. The tool helps in extracting the items of interest from an hind source page. The extracted data can he used by another application because it is stored in XML formal, which is well structured. The technique used in developing semi automatic wrapper is tag Set Progression Grid (TpGrid) which is the fingerprint representation of an hind page.

Description

Dissertation

Keywords

Web, Web wrapper, Data warehouse, Information and Communication Technology

Citation