anita w. huang, neel sundaresan presented by allan spale – eecs 578
DESCRIPTION
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services. Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578. Introduction. The World Wide Web is a place for information and commerce - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/1.jpg)
Aurora: A Conceptual Model for Web-content Adaptation
to Support the Universal Accessibility of Web-based
ServicesAnita W. Huang, Neel Sundaresan
Presented by Allan Spale – EECS 578
![Page 2: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/2.jpg)
Introduction
• The World Wide Web is a place for information and commerce
• Electronic information distribution removes previous accessibility barriers– Flexible presentation of information
• The use of HTML on web pages removes its meaning and functionality
![Page 3: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/3.jpg)
Problems Using HTML
• Makes “comprehension, navigation, and input difficult or completely impossible”– Literal content
– Services
![Page 4: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/4.jpg)
Improving Web Accessibility
• Low-level accessibility– Provide alternatives to different media
types
• High-level accessibility– Make Web services in a service domain
accessible to a large audience
![Page 5: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/5.jpg)
Description of Aurora• Aurora provides high-accessibility
– Analyzes web objects according to their functions within a particular domain of Web pages
– Based on a transaction model• Provides a framework for encapsulating
general goals within a service domain• “[P]rovides a set of schemas that describes
how a user obtains the identified services”
![Page 6: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/6.jpg)
Transaction Model and XML
• Converts web data in service domains into XML
• XML data is input to interface adaptors
• Each interface adaptor creates the new Web page
![Page 7: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/7.jpg)
User Scenario• Blind user visits an on-line auction site• Semantic obstacles appear on the page
to “hide” information necessary to make a bid– Aurora can improve this situation
• Access the site using Aurora• Aurora will render the page in a format
acceptable to the user
![Page 8: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/8.jpg)
Electronic Information Accessibility
• Web Accessibility– “…add provisions to existing Web pages.”
– Focus on low-level issues• Provide alternate presentation forms for
different electronic media
![Page 9: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/9.jpg)
Electronic Information Accessibility
• Adaptive Hypermedia– Offers adaptive measures for “new Web-
based information systems”
– Challenges• Incorporating each user’s goals into a user
model
• Structuring of information to permit translation across presentation formats
![Page 10: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/10.jpg)
Electronic Information Accessibility
• Wrapper generation– Wrapper applications have two roles
• Extract information from data
• Reorganize the data into structured forms
![Page 11: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/11.jpg)
Aurora’s Transaction Model• Specifies the user’s abstract goals
• “Scrapes” information from the Web page relevant to the user goals
• Relevant to a specific service domain– Common services
– Sequences of tasks to receive services
– Declaration of specific steps to accomplish tasks
![Page 12: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/12.jpg)
Transaction Model Specifics
• Services– Analyze a service domain to determine “a
discrete, common set of abstract user goals”
• Transactions– Tasks to be done to receive a service
![Page 13: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/13.jpg)
Transaction Model Specifics
• Task Hierarchical Work-Flow Model– Create a node for each step
– Connect the nodes according to sequences of steps
– Label transitions between nodes where appropriate
![Page 14: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/14.jpg)
Web Content Classification
• Transaction model tracks each page’s function in relation to the user goal– Model applies to sets of web pages
• Transaction model used to transform content without altering Web pages
• Can provide additional structure to data at the source
![Page 15: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/15.jpg)
Benefits for Universal Usability
• Consistency– Currently web sites differ in many ways
from one another
– Transaction model reduces this problem• Consistent interface
• “[M]odel specifies a common set of goal-orientated transactions for each service domain”
![Page 16: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/16.jpg)
Benefits for Universal Usability
• Simplicity– Web pages usually contain some irrelevant
information in relation to user goals
– Solution• “Scrape” information from the Web page
according to the transaction model that encapsulates the user goal
![Page 17: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/17.jpg)
Benefits for Universal Accessibility
• Adaptability– Semantic information is implicit according
to its appearance
– Solution• Use a transaction model to extract functional
semantics and add semantic markup
• Interface adaptors take output from transaction model to create presentations for a user group
![Page 18: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/18.jpg)
XML Framework• eXtensible Markup Language used for
creating structured data• The transaction model that maps web
objects can be stored using a DTD (Document Type Definition)
• XML data will maintain the functional semantics which will allow interface adaptation
![Page 19: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/19.jpg)
Using DTDs for Translation Schemas
• Describe abstract tasks for every service goal
• Contains the semantics and sequence of task steps
• Together with the “scraped” web page data, Aurora can write the transaction document in XML
![Page 20: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/20.jpg)
Operation of AuroraUsing XML
• User requests a Web page
• Aurora downloads Web page and recreates the page– Downloads the web page
– Extracts information and objects
– Creates XML document using a DTD
• Aurora uses the interface adaptor on the XML document to create the new HTML page
![Page 21: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/21.jpg)
Example of User-Aurora Interaction
• User views Aurora-generated Web page of an auction site converted from XML to be displayed in HTML– This is a node in the current transaction
document– Current node is the item for bidding– Hyperlinks in the generated HTML page
lead to other nodes in the XML document
![Page 22: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/22.jpg)
Example of User-Aurora Interaction
• User selects a hyperlink– Each XML hyperlink will link to a Web page
and a transformation rule
– Aurora will download the web page and apply the transformation rule
– A hyperlink links to a downloaded HTML document and the extracted XML segment
![Page 23: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/23.jpg)
Example of User-Aurora Interaction
• Present the downloaded page to the user– XML segment serves as input to the interface
adaptor
– Aurora will use the XML segment as input to the interface adaptor
– The result is a displayable web page typically in HTML
• The process repeats for future interactions
![Page 24: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/24.jpg)
Aurora’s Method ofContent Extraction
• Uses PatML* XML transformation tool that…match[es] and
transform[s] patterns in XML documents– Three parts to a PatML transformation rule
* XML pattern to match (source)* Way to transform the matched pattern (target)* Java code block to invoke on the pattern (action)
*Items quoted directly from the paper
![Page 25: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/25.jpg)
Aurora’s Method ofContent Extraction
• A transaction step has one PatML rule– This rule will be used for all pages on a single
Web site
• Three parts of PatML rule, specifically– Source: matches HTML tag patterns
– Target: turns matched part into an XML part
– Action: gets the XML part and returns its output
![Page 26: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/26.jpg)
Aurora Architecture UsingWeB Intermediaries (WBI)
• “…enables applications to manipulate HTTP streams during a Web transaction.”
• Three components– Request editor
• Interface adaptor translates user actions
– Document generator• Downloads web pages, applies transformation rules to web
pages, returns XML parts
– Document editor• Interface adaptor adapts requested Web pages
![Page 27: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/27.jpg)
Extensible Architecture• Interface Adaptors
– “[S]ends XML data and receives user responses.”
– Transforms XML data into a low-level presentation format
– DTDs used to help generate additional semantic meaning
– Two types of adaptors• HTML text-only, Icon-enhanced HTML
![Page 28: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/28.jpg)
Extensible Architecture• Service Domains and Web Sites
– XML configuration document stores all service domain definitions
• Adding new domains or sites into a domain only involves editing the XML document
• Transcoding Engine– Aurora can use other “transcoding and/or
extraction technologies”• Currently uses PatML within its transformer interface
![Page 29: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/29.jpg)
Implementation Details• Java plug-in for “WBI using PatML as the
transcoding tool”• Schemas include auction and search engine
service domains• PatML rules written for specific sites
– Auctions: eBay, Yahoo! Auctions– Search Engines: AltaVista, Yahoo!, Google
• Uses previously mentioned interface adaptors
![Page 30: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/30.jpg)
Summary• Transaction Model
– Extracts semantics of web sites within selected service domains
– Uses XML to maintain structured data from Web pages
• Semantic Transcoding System– “Scrapes” and adapts web pages to help the
user accomplish abstract goals within an XML framework
![Page 31: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/31.jpg)
Summary
• Extensible Structure– Supports custom adapters that convert
XML data into some presentation format
• Improvements– Needs “to support semi-automated or
automated rule generation and maintenance”
![Page 32: Anita W. Huang, Neel Sundaresan Presented by Allan Spale – EECS 578](https://reader038.vdocuments.us/reader038/viewer/2022110104/5681679e550346895ddce66b/html5/thumbnails/32.jpg)
Resources• XML 1.0
– http://www.w3c.org/TR/REC-xml
• Web Content Accessibility Guidelines 1.0– http://www.w3c.org/wai-webcontent
• PatML– http://www.alphaworks.ibm.com/aw.nsf/techmain
/00ADAB375888BDD2882566F300703F7F?OpenDocument
• WeB Intermediaries (WBI) Development Kit– http://www.almaden.ibm.com/cs/wbi/doc