web resources work

14
INTRODUCTION Web resource are what make up the World Wide Web. These documents are written in HTML (hypertext mark-up language) and are translated by your Web browser. Web pages can either be static or dynamic. All the information is returned as HTML code, so when the page gets to your browser, all the browser has to do is translate the HTML. As the number of web resources has increased explosively, organizing web resources has become an important topic for information professionals and researchers. Unlike traditional printed materials or electronic periodicals, most of web resources are not organized with traditional methods . The concept of a web resource is primitive in the web architecture, and is used in the definition of its fundamental elements. The term was first introduced to refer to targets of uniform resource locators (URLs), but its definition has been further extended to include the referent of any uniform resource identifier (RFC 3986), or internationalized resource identifier (RFC 3987). In the Semantic Web, abstract resources and their semantic properties are described using the family of languages based on Resource Description Framework (RDF). HISTORY The concept of a web resource has evolved during the web history, from the early notion of static addressable documents

Upload: dinesh-naik

Post on 18-Jul-2016

1 views

Category:

Documents


0 download

DESCRIPTION

about web resource

TRANSCRIPT

Page 1: Web Resources Work

INTRODUCTION

Web resource are what make up the World Wide Web. These documents are written in

HTML (hypertext mark-up language) and are translated by your Web browser. Web pages

can either be static or dynamic.

All the information is returned as HTML code, so when the page gets to your browser, all the

browser has to do is translate the HTML.

As the number of web resources has increased explosively, organizing web resources has

become an important topic for information professionals and researchers. Unlike traditional

printed materials or electronic periodicals, most of web resources are not organized with

traditional methods .

The concept of a web resource is primitive in the web architecture, and is used in the

definition of its fundamental elements. The term was first introduced to refer to targets of

uniform resource locators (URLs), but its definition has been further extended to include the

referent of any uniform resource identifier (RFC 3986), or internationalized resource

identifier (RFC 3987). In the Semantic Web, abstract resources and their semantic properties

are described using the family of languages based on Resource Description Framework

(RDF).

HISTORY

The concept of a web resource has evolved during the web history, from the early notion of

static addressable documents or files, to a more generic and abstract definition, now

encompassing every 'thing' or entity that can be identified, named, addressed or handled, in

any way whatsoever, in the web at large, or in any networked information system. The

declarative aspects of a resource (identification and naming) and its functional aspects

(addressing and technical handling) were not clearly distinct in the early specifications of the

web, and the very definition of the concept has been the subject of long and stil open debate

involving dif icult, and often arcane, technical, social, linguistic and philosophical issues.

Page 2: Web Resources Work

FROM DOCUMENTS AND FILES TO WEB RESOURCES:

In the early specifications of the web (1990–1994), the term resource is barely used at al .

The web is designed as a network of more or less static addressable objects, basical y files

and documents, linked using uniform resource locators (URLs). A web resource is implicitly

defined as something which can be identified. The identification deserves two distinct

purposes: naming and addressing; the latter only depends on a protocol

RFC 1738 (December 1994) further specifies URLs, the term 'Universal' being changed to

'Uniform'. The document is making a more systematic use of resource to refer to objects

which are 'available', or 'can be located

and accessed' through the internet. There again, the term resource itself is not explicitly

defined.

FROM WEB RESOURCES TO ABSTRACT RESOURCES :

The first explicit definition of resource is found in RFC 2396, in August 1998: 'A resource

can be anything that has identity. Familiar examples include an electronic document, an

image, a service (e.g., "today's weather report for Los Angeles"), and a col ection of other

resources .

METHODOLOGY

To examine whether users' collaborative terms can be applied to classify web resources, we

Selected a consumer health information domain on the web as a test set. As a dataset, we

Selected 34 consumer health information websites suggested by caphis (consumer and

Patient health information section). Tags were collected from delicious.com and the Selected

sites had at least fifty tags. Caphis offers a list of health information sites and Classifies them

into different groups, such as women, men, seniors, kids & parenting, and Drugs. The caphis

classification scheme, which is manually created, was used to assess the Performance of tag-

based classification. The collected data were trimmed by excluding both Extremely general

terms and specific terms. In total, 6416 unique terms were observed across The selected sites.

Among them, we first selected 654 terms that occurred at least five sites Simultaneously, and

then excluded 79 general terms that most frequently occurred across the Sites. Finally, 575

terms were selected in the analysis.

Page 3: Web Resources Work

A reduced set of variables is much Easier to analyze and interpret. In this study, we first

explored dimensions of the selected web .

Resources to ensure the possibility of tagging-based classification .Then, hierarchical

clustering was carried out to actually test the classification using user Terms, which is the

primary purpose of this study .

HOW WEB RESOURCES WORK ?

Web Resources rely on a special handler that is named WebResource.axd, which is designed

to retrieve assembly resources and serve them to the Web browser. The handler type for

WebResource.axd is Assembly Resource Loader.

When a request comes in from the client for Web Resource.axd, the handler looks for the

Web Resource identifier in the Query String method of the Request object. Based on the

value of the Web Resource identifier, the handler then tries to load the assembly that contains

this resource. If this operation is successful, the handler will then look for the assembly

attribute and load the resource stream from the assembly. Finally, the handler will grab the

data from the resource stream and send it to the client together with the content type that you

specify in the assembly attribute.

RESOURCES IN RDF AND THE SEMANTIC WEB

First released in 1999, RDF was first intended to describe resources, in other words to

declare metadata of resources in a standard way. A RDF description of a resource is a set of

triples (subject, predicate, object), where subject represents the resource to be described,

predicate a type of property relevant to this resource, and object can be data or another

resource.

The predicate itself is considered as a resource and identified by a URI. Hence, properties

like "title", "author" are represented in RDF as resources, which can be used, in a recursive

way, as the subject of other triples . RDF also specifies the definition of anonymous

resources or blank nodes, which are not absolutely identified by URIs .

Page 4: Web Resources Work

USING HTTP URIS TO IDENTIFY ABSTRACT RESOURCES :

URLs, particularly HTTP URIs, are frequently used to identify abstract resources, such as

classes, properties or other kind of concepts. Examples can be found in RDFS or OWL

ontologies.

Since such URIs are associated with the HTTP protocol, the question arose of which kind of

representation, if any, should one get for such resources through this protocol, typical y using

a web browser, and if the syntax of the URI itself could help to dif erentiate "abstract"

resources from "information" resources .

The URI specifications such as RFC 3986 left to the protocol specification the task of

defining actions performed on the resources and they don't provide any answer to this

question. It had been suggested that an HTTP URI identifying a resource in the original

sense, such as a file, document, or any kind of so-cal ed information resource, should be

"slash" URIs — in other words, should not contain a fragment identifier .

RESOURCE OWNERSHIP, INTELLECTUAL

PROPERTY AND TRUST :

In RDF, "anybody can declare anything about anything". Resources are defined by formal

descriptions which anyone can publish, copy, modify and publish over the web. If the content

of a web resource in the classical sense (a web page or on-line file) is clearly owned by its

publisher, who can claim intel ectual property on it, an abstract resource can be defined by an

accumulation of RDF descriptions, not necessarily control ed by a unique publisher, and not

necessarily consistent with each other. It's an open issue to know if a resource should have an

authoritative definition with clear and trustable ownership, and in this case, how to make this

description technical y distinct from other descriptions. A paral el issue is how intel ectual

property may apply to such descriptions.

Page 5: Web Resources Work

HOW IT HELP IN ORGANISATION

Provide product information to drive local sales 

Sell advertising.

Sell products or services directly over the Internet.

Provide customer service and support.

Build your brand.

Make effective your marketing campaign

Create automated mailing lists for maximum out reach.

Automate your back office functions for efficient documents storage and retrieve.

Conduct market research on products, end-user reviews, customer preferences.

ADVANTAGE OF W EB RESOURCE

Customers and prospects can download information and graphics directly, thus

reducing the need for printed matter.

The cost is only a fraction of that of traditional advertising media.

People love to share of information, jokes, cool websites and a variety of

miscellaneous "gems" of knowledge.

Quicker services delivery.

Knowledge about customer preferences .

DISADVANTAGE OF WEB RESOURCE

Loss of Data (less backups and keep your data secure).

Competition

Time

Accessibility

Cost

Page 6: Web Resources Work

CONCLUSION

This study examined whether user generated terms can be used to classify web resources.

We collected user terms from consumer health information sites, and classified them

using hierarchical clustering. By comparing the clustering result with the human-created

classification scheme, we found that the tagging-based clustering showed 100% precise

classification performance. Although this study conducted with a domain-specific

dataset, the 6 findings suggest that user-generated terms can be useful to automatically

organize web resources that are impossible to do manually due to size.

The PCA analysis reveals that the tag information clearly and quite precisely account for

the hidden structure of web resources. In this analysis, the collected tag information can

generate a reasonable clustering of the websites with appropriate quantitative methods.

Hierarchical clustering confirmed that user terms can classify web resources precisely

into the six groups predefined by human-created classification scheme (“women,”

“seniors,” “kids/parenting,” “drugs,” “men,” and “research.”). The results obtained by the

two analyses are closely akin to the classification of websites suggested by CAPHIS.

Therefore, it can be suggested that user’s contribution to describing online web resources

using tags has appropriate credibility in classification.

To expand its usefulness and examine which extend tagging is applied as an organizing

tool, further studies will be conducted for exploring more possibilities of user tag's

organizing web sources with a broader range of interest and terminology . Social tagging

as an organizing tool provide a simpler and easier way to classify web resources than

machine learning methods that require more data traffic, training process, and cost. This

study introduced two data mining methods that can be applicable to classify web

resources in a specific domain. Both PCA and hierarchical clustering methods validated

that social tags can be used as a tool to organize resources on the Web. The preliminary

findings showed that we can apply user-generated terms to automatically classify

resources on the Web. Our study demonstrates that a number of different statistical

methods can be applied to generate a reasonable clustering of web resources based on

tags, requiring less analysis or system training than machine learning techniques .

Page 7: Web Resources Work

REFERENCES

Jackson, J. E. (1991).A user's guide to principal components (Vol. 244). Wiley-

Interscience.

Kipp, M. E. I. (2005). Complementary or Discrete Contexts in Online Indexing: A

Comparison of User, Creator, and Intermediary Keywords. Canadian Journal of

Information and Library Science, 29(4):419–436.

Kipp, M. E. I. (2011). Tagging of Biomedical Articles on CiteULike: A

Comparison of User, Author and Professional Indexing. Knowledge Organization

38(3): 245-261.

Yoon, J. (2009). Towards a user-oriented thesaurus for non-domain-specific

image collections. Information Processing & Management 45(4): 452-468.

Xie, I. and Joo, S. (2012). Factors affecting the selection of search tactics: Tasks,

knowledge, process, and systems. Information Processing & Management, 48(2):

254-270.

Web Characterization Terminology & Definitions Sheet

(http://www.w3.org/1999/05/WCA-terms/), editors: Brian Lavoie and Henrik

Frystyk Nielsen, May 1999.

A Short History of "Resource" in web architecture.

(http://www.w3.org/DesignIssues/TermResource.html), by Tim Berners-Lee .

What do HTTP URIs Identify? (http://www.w3.org/DesignIssues/HTTP-

URI.html), by Tim Berners-Lee .