Exact: A queryable XML compressor
LE3 .A278 2009
Bachelor of Computer Science
Extensible Markup Language (XML) is a popular language for storing data and accessing data. Unfortunately, it includes a large amount of redundant information. Specialized XML compressors can compress this redundant data better than general purpose compressors. This thesis presents Exact, a novel technique for compressing and decompressing XML documents, as well as querying a compressed XML document in such a way that the entire document does not need to be decompressed. Exact uses the grammar of an XML document - when it is available and otherwise creates and stores similar information during the compression phase - to provide better compression. Exact gives the user the choice of getting a better compression rate or faster compression.
The author grants permission to the University Librarian at Acadia University to reproduce, loan or distribute copies of my thesis in microform, paper or electronic formats on a non-profit basis. The author retains the copyright of the thesis.