Filtering compressed XML streams
LE3 .A278 2007
2007
Muldner, Tomasz
Acadia University
Master of Science
Masters
Computer Science
'Information Filtering' is the problem of extracting data we desire from a corpus of data. The task of 'filtering' depends heavily on the format of that corpus. For example, the corpus may be a stationary relational database, flat-file, collection of files, or, in the case of a news feed, may be continually 'streamed' to clients. In this research, we examine the problem of filtering data from a continual ' stream' that has been compressed with an 'online', XML-conscious compressor. We introduce a filtering system that leverages the format of compressed XML streams to provide 'subscription-oriented ' filtering of the data contained in the stream. Additionally, the system provides 'persistence' by efficiently storing the ' compressed' results in relational tables, allowing subscribers to received filtered content even if they are disconnected when filtering occurs. These features make our system useful in applications where XML data is continually streamed such as news-feeds, scientific document notification services or in XML routing applications.
The author retains copyright in this thesis. Any substantial copying or any other actions that exceed fair dealing or other exceptions in the Copyright Act require the permission of the author.
https://scholar.acadiau.ca/islandora/object/theses:3062