Font Size: a A A

Creation of encoding schemes to reduce markup language-based overhead

Posted on:2008-05-16Degree:Ph.DType:Thesis
University:Northcentral UniversityCandidate:Larson, Theodore LFull Text:PDF
GTID:2448390005479746Subject:Computer Science
Abstract/Summary:
The HTML and XML languages have a language based overhead. This quantitative study proposed several encoding methods of the contents of HTML and XML files. The files were parsed for similar words and encoded with shorter character representations. A Web server and Web client were developed to test the hypothesis that the encoding of HTML and XML files using these methods prior to transmission by the server and decoding prior to rendering by the client might produce a reduction in overall transmission time when compared with files not encoded. Presence of Zip compression and conversion to Binary using Huffman Compression were considered. The results suggested that while there was a reduction in the size of the files after encoding, no reduction in transmission time was found. The data suggest that there is an endemic cause, and further experimentation is suggested to find the source.
Keywords/Search Tags:Encoding, HTML, XML
Related items