![Page 1: A Transcoding Proxy for HTML Web Pages: Web Page Sampling and Conversion Evaluation. Andrew Stone CS525m](https://reader035.vdocuments.us/reader035/viewer/2022072013/56649e6f5503460f94b6d228/html5/thumbnails/1.jpg)
A Transcoding Proxy for HTML Web Pages: Web Page
Sampling and Conversion Evaluation.
Andrew StoneCS525m
![Page 2: A Transcoding Proxy for HTML Web Pages: Web Page Sampling and Conversion Evaluation. Andrew Stone CS525m](https://reader035.vdocuments.us/reader035/viewer/2022072013/56649e6f5503460f94b6d228/html5/thumbnails/2.jpg)
Worcester Polytechnic Institute
2
Overview
• Proxy Goal and Scope
• Related Work
• Project scope
• Testing Methodology
• Demo
• Conclusions
• Future Work
![Page 3: A Transcoding Proxy for HTML Web Pages: Web Page Sampling and Conversion Evaluation. Andrew Stone CS525m](https://reader035.vdocuments.us/reader035/viewer/2022072013/56649e6f5503460f94b6d228/html5/thumbnails/3.jpg)
Worcester Polytechnic Institute
3
Proxy Goal
• Reduce data traffic– Get content displayed faster– Save bandwidth (and money)– Reduce power consumption
• Change content to suit device– Browser properties
![Page 4: A Transcoding Proxy for HTML Web Pages: Web Page Sampling and Conversion Evaluation. Andrew Stone CS525m](https://reader035.vdocuments.us/reader035/viewer/2022072013/56649e6f5503460f94b6d228/html5/thumbnails/4.jpg)
Worcester Polytechnic Institute
4
Related Work
• HTML to WML Transcoding Proxy– http://zoo.cs.yale.edu/classes/cs490/00-01b/dugas.robert.rfd8/rfd8cs490.pdf
• iMobile EE– http://portal.acm.org/citation.cfm?id=778492&coll=portal&dl=ACM&CFID=71256236&CFTOKEN=91425173
• RSVP Browser– http://portal.acm.org/citation.cfm?id=591429&coll=portal&dl=ACM&CFID=71256236&CFTOKEN=91425173
• Navigating a Mobile XHTML App– http://portal.acm.org/citation.cfm?id=642669&coll=portal&dl=ACM&CFID=71256236&CFTOKEN=91425173
• http://www.skweezer.net
![Page 5: A Transcoding Proxy for HTML Web Pages: Web Page Sampling and Conversion Evaluation. Andrew Stone CS525m](https://reader035.vdocuments.us/reader035/viewer/2022072013/56649e6f5503460f94b6d228/html5/thumbnails/5.jpg)
Worcester Polytechnic Institute
5
Project Scope
• Create component to transcode web pages using HTML Tidy and XML Stylesheets
• Measure web page size reduction
• Evaluate web page readability on PC with IE and Firefox and on Windows Mobile 5 Pocket IE
![Page 6: A Transcoding Proxy for HTML Web Pages: Web Page Sampling and Conversion Evaluation. Andrew Stone CS525m](https://reader035.vdocuments.us/reader035/viewer/2022072013/56649e6f5503460f94b6d228/html5/thumbnails/6.jpg)
Worcester Polytechnic Institute
6
Issue Get Request
Internet
Proxy
Get Request
HTML Tidy
XSLTTransform
Return Content
xHTML
Transformed Content
![Page 7: A Transcoding Proxy for HTML Web Pages: Web Page Sampling and Conversion Evaluation. Andrew Stone CS525m](https://reader035.vdocuments.us/reader035/viewer/2022072013/56649e6f5503460f94b6d228/html5/thumbnails/7.jpg)
Worcester Polytechnic Institute
7
Web Page Reduction
• Data Set: 5852 pages from 403 domains– From Paul Timmins and Sean McCormick’s “Characteristics of Today’s Mobile
Web Content”
• HTML Tidy produced 2730 transformed pages– 2417 successful XSL Transformations from 266 domains
• Before– Average Page Size including images: 46.9 KB– Average Page Size excluding images: 23.3 KB
• After– Average Page Size including images: 43.0 KB– Average Page Size excluding images: 19.4 KB
![Page 8: A Transcoding Proxy for HTML Web Pages: Web Page Sampling and Conversion Evaluation. Andrew Stone CS525m](https://reader035.vdocuments.us/reader035/viewer/2022072013/56649e6f5503460f94b6d228/html5/thumbnails/8.jpg)
Worcester Polytechnic Institute
8
Web Page Layout Demo
![Page 9: A Transcoding Proxy for HTML Web Pages: Web Page Sampling and Conversion Evaluation. Andrew Stone CS525m](https://reader035.vdocuments.us/reader035/viewer/2022072013/56649e6f5503460f94b6d228/html5/thumbnails/9.jpg)
Worcester Polytechnic Institute
9
Conclusions
• Real gains are in image manipulation
• ~50% of web pages have non standard HTML or HTML Tidy
• Another HTML fixing tool should be tested
• Image compression should be evaluated