woogle4mediawiki - from searchers to contributors

29
Woogle4MediaWiki From Searchers to Contributors Hans-Jörg Happel, FZI Karlsruhe, Germany 2010-07-08 @ WikiSym 2010; Gdansk, Poland

Upload: hans-joerg-happel

Post on 12-May-2015

1.038 views

Category:

Technology


1 download

DESCRIPTION

This demo presents Woogle4MediaWiki, an extension for MediaWiki, which improves its core search functionality and introduces Wiki-style collaboration about information needs (each search query may have its own Wiki page) and result lists. Furthermore, aggregate statistics from the search logs are shown in pop-up windows attached to “red links” to guide users contributing new information.

TRANSCRIPT

Page 1: Woogle4MediaWiki - From Searchers to Contributors

Woogle4MediaWiki From Searchers to Contributors

Hans-Jörg Happel, FZI Karlsruhe, Germany2010-07-08 @ WikiSym 2010; Gdansk, Poland

Page 2: Woogle4MediaWiki - From Searchers to Contributors

Agenda• Motivation• Features / Demo• Outlook & Summary

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 2

Page 3: Woogle4MediaWiki - From Searchers to Contributors

MOTIVATION

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 3

Page 4: Woogle4MediaWiki - From Searchers to Contributors

Wikis• Web-based repositories for sharing

consensual knowledge in a collaborative fashion

• In enterprises, Wikis often act as a layer of "glue" code relating content from other information systems and capturing all the rest– Wikis are a typical entry point for information

seeking– People are often confused why and what to

put into the Wiki

4Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Page 5: Woogle4MediaWiki - From Searchers to Contributors

There are two kinds of people in a Wiki…

Readers• Have information needs

– But only few means for articula-ting such needs (e.g. search)

• Have collaboration needs– Search is an inherently social

activity (40,4% of users interact with others before and during search; 58,7% share information after search [EC08])

Contributors• Have limited ressources

(time/effort) and face futher barriers (motivation, privacy)

• Choices on contributions are seldom based on demand of readers (lack of guidance)

• Wiki communities stress the relevance of content („what is worth to be documented“)

5

• Knowledge sharing is a communication process• Artificial decoupling of readers and contributors in Wikis• Only few readers become contributorsWoogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Page 6: Woogle4MediaWiki - From Searchers to Contributors

SEARCHING

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 6

Page 7: Woogle4MediaWiki - From Searchers to Contributors

• Search in MediaWiki– No query logs– „Red links“ (links to non-existing pages) are

the basic means to „demand“ content– No collaboration

7Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Page 8: Woogle4MediaWiki - From Searchers to Contributors

Woogle4MediaWiki• Can replace MediaWiki built-in search or

act as an additional/parallel search engine (user configurable!)

• Features– Search suggestions based on fulltext– Allows to index Wikitex or rendered text– Allows to index certain files (e.g. PDF)– Show previous search attempts based on query

log– Social search (Wiki-page per query, social

ranking)

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 8

Page 9: Woogle4MediaWiki - From Searchers to Contributors

Every query can be a Wiki page…• Allows for

– Wiki-Text („Social Did-You-Mean“)

– Discussion page– Watches/Notifications

(WoogleAlerts)

• Shows basic statistics about query– „How many people

searched for that?“

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 9

Page 10: Woogle4MediaWiki - From Searchers to Contributors

Social Search• Add, remove and re-rank query results• Add comments

• Beyond that: – How to support truly collaborative information seeking (for

researchers, lawyers etc!)?

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 10

Page 11: Woogle4MediaWiki - From Searchers to Contributors

Two flavours of Woogle

• Native PHP (using Zend Lucene Search)

• Can handle Wikis up to 50.000 pages (Wikipedia size) and serves normal queries in >1 sec

• Accesses Java (Lucene) backend via Webservices

• Supports searching additional sources (e.g. Ticket systems, SVN, other Wiki engines)

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 11

Page 12: Woogle4MediaWiki - From Searchers to Contributors

CONTRIBUTING

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 12

Page 13: Woogle4MediaWiki - From Searchers to Contributors

Contributing in MediaWiki

• Are Wikis really contribution-friendly?– „Red links“ do not tell how much this

information is required– The „empty-page“ phenomenon (How to

start an article)

13Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Page 14: Woogle4MediaWiki - From Searchers to Contributors

Demand guidance• Woogle uses queries and further metadata to qualify desired

content• Users are informed how strong the information need is, and in

which context it arises

14Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Page 15: Woogle4MediaWiki - From Searchers to Contributors

ADDITIONAL FEATURES

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 15

Page 16: Woogle4MediaWiki - From Searchers to Contributors

Statistics

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 16

Page 17: Woogle4MediaWiki - From Searchers to Contributors

Woogle Configuration

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 17

Page 18: Woogle4MediaWiki - From Searchers to Contributors

Preferences & Instrumentation• Preferences might be defined globally or

by each individual user

• Scientific instrumentation framework– Logs certain kinds of user interactions– Customize system/override configuration

for experimental groups– Randomized group assignment– Informed consent (user acceptance dialog)

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 18

Page 19: Woogle4MediaWiki - From Searchers to Contributors

OUTLOOK & SUMMARY

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 19

Page 20: Woogle4MediaWiki - From Searchers to Contributors

Outlook• Roadmap for Woogle4MediaWiki

– Implement social search backend– Request feature– Support users creating redirects– Lightweight integration with SMW (i.e. using

semantics in result snippets)

• Application of the same principles to Semantic MediaWiki („Semantic Need“ extension)

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 20

Page 21: Woogle4MediaWiki - From Searchers to Contributors

Summary• Woogle addresses the atificial separation of

information seeking and information provision in Wikis– Offers information seekers means to express needs– Guides contributors by aggregating demands

• Find downloads, demos, slides etc. at– http://www.mediawiki.org/wiki/

Extension:Woogle4MediaWiki– http://www.teamweaver.org– http://amazonas.fzi.de/wooglenative/ (Demo)– Feedback & additional evaluation partners sought!

• Questions?

21Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Page 22: Woogle4MediaWiki - From Searchers to Contributors

BACKUP SLIDES

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 22

Page 23: Woogle4MediaWiki - From Searchers to Contributors

(Media)Wikis can do better with Woogle• Improve search with collaborative

features• Allow people to express information

needs• Provide a first-order representation for

information needs• Guide contributions by aggregating

information needs („need-driven knowledge sharing“)

23Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Page 24: Woogle4MediaWiki - From Searchers to Contributors

24

Collaborative description of the information need

Notifications concerningsearch activities

Discussion page

Directly create new knowledge from the search dialog

Search has a URI and can be linked from within the Wiki(one page per search term)

Search across different data sources/types (WoogleRemote)

Additional clues with meta-information about a need

http://localhost/wiki/Woogle:Woogle4MediaWiki

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Page 25: Woogle4MediaWiki - From Searchers to Contributors

Demand guidance

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 25

Page 26: Woogle4MediaWiki - From Searchers to Contributors

Literature• [EC08] Evans, B. M. and Chi, E. H. 2008. Towards a model of understanding social search. In

Proceedings of the ACM 2008 Conference on Computer Supported Cooperative Work (San Diego, CA, USA, November 08 - 12, 2008). CSCW '08. ACM, New York, NY, 485-494.

• [Hap08a] Hans-Jörg Happel: Growing the Semantic Web with Inverse Semantic Search. In Proceedings of the 1st Workshop on Incentives for the Semantic Web (INSEMTIVE '08)

• [Hap08b] Hans-Jörg Happel: Closing Information Gaps with Inverse Search. In Proceeedings of the 7th International Conference on Practical Aspects of Knowledge Management (PAKM2008)

• [Hap09a] Hans-Jörg Happel: Woogle – On Why and How to Marry Wikis with Enterprise Search. In Proceedings of the 2nd Workshop on Integrated Knowledge Management Systems (IKMS2009)

• [Hap09b] Hans-Jörg Happel: Towards Need-driven Knowledge Sharing in Distributed Teams. In Proceedings of the 9th International Conference on Knowledge Management (I-KNOW 2009)

• [Hap09c] Hans-Jörg Happel: Social search and need-driven knowledge sharing in Wikis with Woogle. In Proceedings of the 5th international Symposium on Wikis and Open Collaboration (Orlando, Florida, October 25 - 27, 2009). WikiSym '09. ACM, New York, NY, 1-10.

26Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland

Page 27: Woogle4MediaWiki - From Searchers to Contributors

Aggregate query & click logging

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 27

Page 28: Woogle4MediaWiki - From Searchers to Contributors

Implementation Trivia• Search UI

– Completely intercepts MW search process – indenpendent of MW search interfaces

• Red Links– Realized via JavaScript/jQuery (parses red links, retrieves needs from

backend batch-wise) to avoid patching MW

• Very basic implementation of access rights (via MediaWiki groups)

• Instrumentation addon– Allows to re-configure most system features based on user groups– Provides evaluation participation dialog– Logs various user behaviour data

Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland 28

Page 29: Woogle4MediaWiki - From Searchers to Contributors

docdoc

twIS Backend(Public Index)twIS Backend(Public Index)

docdoc

docdoc

docdoc

User 2

docdoc

docdoc

docdoc

User 1

twIS EclipsetwIS Eclipse twIS Eclipse RCPtwIS Eclipse RCP

JSF FrontendJSF Frontend

Web BrowserWeb Browser

WoogleWoogle

twIS Backend 2twIS Backend 2

twIS Backend 3twIS Backend 3

DirectAuthDirectAuthAuth via

WikiAuth via

Wiki

Push IndexerPush Indexer

DirectAuthDirectAuth

Search, Crawl

Search, Crawl

Push IndexPush Index

CrawlCrawl

SearchSearch

Big Picture

TeamWeaver Integrated SearchSimple Lucene-based toolkit for searching (crawling + Web

and Eclipse frontends) different data sources (e.g. files, SVN, CVS, Ticket Systems, Wikis); plus provides DB-based query

logging and partially semantic features (metadata extraction, ontology-based working context model in Eclipse)

TeamWeaver Integrated SearchSimple Lucene-based toolkit for searching (crawling + Web

and Eclipse frontends) different data sources (e.g. files, SVN, CVS, Ticket Systems, Wikis); plus provides DB-based query

logging and partially semantic features (metadata extraction, ontology-based working context model in Eclipse)

Inverse Search is an Eclipse-based Desktop search tool for selecting local/private documents for sharing based on a

TeamWeaver Integrated Search backend

Inverse Search is an Eclipse-based Desktop search tool for selecting local/private documents for sharing based on a

TeamWeaver Integrated Search backend

Woogle is an extension for MediaWiki, which can access a a TeamWeaver Integrated Search backend. It leverages query statistics to guide knowledge sharing in the Wiki.

Woogle has further collaboration features upcoming (e.g. Google SearchWiki-style social ranking).

Woogle is an extension for MediaWiki, which can access a a TeamWeaver Integrated Search backend. It leverages query statistics to guide knowledge sharing in the Wiki.

Woogle has further collaboration features upcoming (e.g. Google SearchWiki-style social ranking).

29Woogle4MediaWiki: From Searchers to Contributors - WikiSym 2010; Gdansk, Poland