© ncsr, frascati, july 18-19, 2002 wp1: plan for the remainder (1) ontology ontology use of...

14
© NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology Use of PROTÉGÉ to generate ontology and lexicons for the 1 Use of PROTÉGÉ to generate ontology and lexicons for the 1 st st domain compatible to the current ones (RTV) (end of August) domain compatible to the current ones (RTV) (end of August) Enrich the lexicons with all the necessary lexical Enrich the lexicons with all the necessary lexical information (partners give the lexicons to RTV) (end of July) information (partners give the lexicons to RTV) (end of July) Use of PROTÉGÉ to generate ontology and lexicons for the 2 Use of PROTÉGÉ to generate ontology and lexicons for the 2 nd nd domain, taking into account the existing draft ontology (RTV) domain, taking into account the existing draft ontology (RTV) (end of August) (end of August) Report on ontologies (related research, use of PROTÉGÉ in Report on ontologies (related research, use of PROTÉGÉ in CROSSMARC, ontology maintenance task) (RTV, NCSR) (draft mid CROSSMARC, ontology maintenance task) (RTV, NCSR) (draft mid September, final end September) September, final end September) Corpus formation for the needs of page filtering Corpus formation for the needs of page filtering Corpus formation for the 1 Corpus formation for the 1 st st domain according to the domain according to the methodology agreed (partners send to NCSR) (end of August) methodology agreed (partners send to NCSR) (end of August) Corpus formation for the 2 Corpus formation for the 2 nd nd domain according to the domain according to the methodology agreed (partners send to NCSR) (end of September) methodology agreed (partners send to NCSR) (end of September) Report on corpus formation task (NCSR) (early October) Report on corpus formation task (NCSR) (early October) Web spidering (NCSR) Web spidering (NCSR) Finalise page filtering experiments for the 1 Finalise page filtering experiments for the 1 st st domain (mid- domain (mid- September) September) Finalise link scoring experiments for the 1 Finalise link scoring experiments for the 1 st st domain (end domain (end September) September) Finalise site navigator (end September) Finalise site navigator (end September) Documentation (end September) Documentation (end September)

Upload: erica-mcdonald

Post on 20-Jan-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP1: Plan for the remainder (1) OntologyOntology

Use of PROTÉGÉ to generate ontology and lexicons for the 1Use of PROTÉGÉ to generate ontology and lexicons for the 1stst domain domain compatible to the current ones (RTV) (end of August)compatible to the current ones (RTV) (end of August)

Enrich the lexicons with all the necessary lexical information (partners Enrich the lexicons with all the necessary lexical information (partners give the lexicons to RTV) (end of July)give the lexicons to RTV) (end of July)

Use of PROTÉGÉ to generate ontology and lexicons for the 2Use of PROTÉGÉ to generate ontology and lexicons for the 2ndnd domain, domain, taking into account the existing draft ontology (RTV) (end of August)taking into account the existing draft ontology (RTV) (end of August)

Report on ontologies (related research, use of PROTÉGÉ in Report on ontologies (related research, use of PROTÉGÉ in CROSSMARC, ontology maintenance task) (RTV, NCSR) (draft mid CROSSMARC, ontology maintenance task) (RTV, NCSR) (draft mid September, final end September)September, final end September)

Corpus formation for the needs of page filteringCorpus formation for the needs of page filtering Corpus formation for the 1Corpus formation for the 1stst domain according to the methodology agreed domain according to the methodology agreed

(partners send to NCSR) (end of August)(partners send to NCSR) (end of August) Corpus formation for the 2Corpus formation for the 2ndnd domain according to the methodology agreed domain according to the methodology agreed

(partners send to NCSR) (end of September)(partners send to NCSR) (end of September) Report on corpus formation task (NCSR) (early October)Report on corpus formation task (NCSR) (early October)

Web spidering (NCSR)Web spidering (NCSR) Finalise page filtering experiments for the 1Finalise page filtering experiments for the 1stst domain (mid-September) domain (mid-September) Finalise link scoring experiments for the 1Finalise link scoring experiments for the 1stst domain (end September) domain (end September) Finalise site navigator (end September)Finalise site navigator (end September) Documentation (end September)Documentation (end September)

Page 2: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP1: Plan for the remainder (2) Focused Crawling Tool Focused Crawling Tool

Use of existing search engines (EDIN) (end of August)Use of existing search engines (EDIN) (end of August) Language identification module (EDIN) (end of August)Language identification module (EDIN) (end of August) Use of the page filtering module (NCSR) (mid September)Use of the page filtering module (NCSR) (mid September) Evaluation methodology (EDIN) (early September)Evaluation methodology (EDIN) (early September) Evaluation results (EDIN) (end September) Evaluation results (EDIN) (end September) Report on the Methodology for focused crawling (EDIN) (end of Report on the Methodology for focused crawling (EDIN) (end of

September)September) Other tools for web pages collection Other tools for web pages collection

Meta-TIDY (NCSR, RTV, LingWay)Meta-TIDY (NCSR, RTV, LingWay) Documentation (RTV) (end of August)Documentation (RTV) (end of August)

Final version of the pre-demarcation toolFinal version of the pre-demarcation tool Evaluation for the 1Evaluation for the 1stst domain (NCSR) (mid-September) domain (NCSR) (mid-September) Documentation (NCSR) (end September)Documentation (NCSR) (end September)

Integration of all the tools involved in the collection Integration of all the tools involved in the collection process (NCSR) (end September)process (NCSR) (end September)

Page 3: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP1: Plan for the remainder (3) Corpus collection for the needs of NERC and FECorpus collection for the needs of NERC and FE

URL collection for the 2URL collection for the 2ndnd domain (partners send to domain (partners send to NCSR) (end of July)NCSR) (end of July)

Collection of web pages for the 2Collection of web pages for the 2ndnd domain according to domain according to the methodology (NCSR) (early September)the methodology (NCSR) (early September)

Report on the corpus collection task (NCSR) (mid Report on the corpus collection task (NCSR) (mid September)September)

Web Annotator final version for both domainsWeb Annotator final version for both domains Documentation Documentation (NCSR) (end September)(NCSR) (end September)

Other ToolsOther Tools Cross-merge final version, Cross-merge final version, documentation (NCSR) documentation (NCSR)

(early September)(early September) Deliverable D1.3 (NCSR)Deliverable D1.3 (NCSR)

Integration of the various reports and documentation Integration of the various reports and documentation (mid October)(mid October)

Page 4: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP2: Plan for the remainder (1) NERC DTDNERC DTD

Report on NERC DTD for both domains (EDIN) (mid September)Report on NERC DTD for both domains (EDIN) (mid September) Corpus annotation for the needs of NERCCorpus annotation for the needs of NERC

Corpus annotation for the 1Corpus annotation for the 1stst domain according to the annotation domain according to the annotation methodology agreed (Velti-EL, RTV-I, NCSR-F, EDIN-EN) (mid methodology agreed (Velti-EL, RTV-I, NCSR-F, EDIN-EN) (mid September for RTV, end August for the others) September for RTV, end August for the others)

Final NERC annotation guidelines for the 1Final NERC annotation guidelines for the 1stst domain (NCSR) (mid domain (NCSR) (mid September)September)

Report on corpus annotation task (NCSR) (end September)Report on corpus annotation task (NCSR) (end September) Corpus annotation for the 2Corpus annotation for the 2ndnd domain according to the annotation domain according to the annotation

methodology agreed (Velti-EL, RTV-I, Lingway-F, EDIN-EN) (mid methodology agreed (Velti-EL, RTV-I, Lingway-F, EDIN-EN) (mid November)November)

NERC v.2NERC v.2 Finalise NERC v.2 development (name normalisation and matching) Finalise NERC v.2 development (name normalisation and matching)

(ENERC, INERC, HNERC) (end September)(ENERC, INERC, HNERC) (end September) Finalise NERC v.2 development (name normalisation and matching) Finalise NERC v.2 development (name normalisation and matching)

(FNERC) (end October )(FNERC) (end October ) Evaluation results for the 1Evaluation results for the 1stst domain using the final corpus (end domain using the final corpus (end

September, end October for French) September, end October for French) Documentation (EDIN) (end October)Documentation (EDIN) (end October)

Page 5: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP2: Plan for the remainder (2) NERC-based demarcator (NCSR)NERC-based demarcator (NCSR)

New version for 1New version for 1stst domain (end September) domain (end September) Evaluation results for the 1Evaluation results for the 1stst domain (mid domain (mid

October)October) Documentation (end October)Documentation (end October)

Deliverable D2.3 (EDIN)Deliverable D2.3 (EDIN) Integration of the various reports and Integration of the various reports and

documentation (end October)documentation (end October)

Page 6: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP3: Plan for the remainder (1) FE schemaFE schema

FE schema for the 2FE schema for the 2ndnd domain (RTV) (end August) domain (RTV) (end August) Report on FE schema for both domains (RTV) (mid Report on FE schema for both domains (RTV) (mid

September)September) Corpus annotation for the needs of FECorpus annotation for the needs of FE

Corpus annotation for the 1Corpus annotation for the 1stst domain according to the domain according to the annotation methodology agreed (Velti-EL, RTV-I, annotation methodology agreed (Velti-EL, RTV-I, NCSR-F, EDIN-EN) (early October)NCSR-F, EDIN-EN) (early October)

Final FE annotation guidelines for the 1Final FE annotation guidelines for the 1stst domain domain (NCSR) (mid October)(NCSR) (mid October)

Report on corpus annotation task (NCSR) (mid Report on corpus annotation task (NCSR) (mid October)October)

Corpus annotation for the 2Corpus annotation for the 2ndnd domain according to the domain according to the annotation methodology agreed (Velti-EL, RTV-I, annotation methodology agreed (Velti-EL, RTV-I, Lingway-F, EDIN-EN) (mid December)Lingway-F, EDIN-EN) (mid December)

Page 7: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP3: Plan for the remainder (2) Wrapper InductionWrapper Induction

STALKER-based wrapper induction (NCSR)STALKER-based wrapper induction (NCSR) Evaluation results for the 1Evaluation results for the 1stst domain and 4 languages (EL, F end of domain and 4 languages (EL, F end of

August, En mid September, I early October)August, En mid September, I early October) Provide the trained monolingual modules to the partners (same dates)Provide the trained monolingual modules to the partners (same dates) Documentation (mid October) Documentation (mid October)

Boosted Wrapper Induction (EDIN)Boosted Wrapper Induction (EDIN) Evaluation results for the 1Evaluation results for the 1stst domain and 4 languages (mid September domain and 4 languages (mid September

for 3 languages, early October for Italian)for 3 languages, early October for Italian) Provide the trained monolingual modules to the partners (same dates)Provide the trained monolingual modules to the partners (same dates) Documentation (mid October)Documentation (mid October)

WHISK (RTV)WHISK (RTV) Evaluation results for the 1Evaluation results for the 1stst domain and 4 languages (check Greek, domain and 4 languages (check Greek,

system end September, mid October)system end September, mid October) Provide the trained monolingual modules to the partners (end Provide the trained monolingual modules to the partners (end

October)October) Documentation (end October)Documentation (end October)

Page 8: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP3: Plan for the remainder (3) Handling of images Handling of images

Report on the techniques that can be used Report on the techniques that can be used (NCSR, Lingway)(NCSR, Lingway) (draft end September, final (draft end September, final end October)end October)

version 0.x of the techniques (Lingway, NCSR)version 0.x of the techniques (Lingway, NCSR) Evaluation for the 1Evaluation for the 1stst domain (Lingway, NCSR) domain (Lingway, NCSR)

Deliverable D3.1 (NCSR)Deliverable D3.1 (NCSR) Integration of the various reports and Integration of the various reports and

documentation (end October)documentation (end October)

Page 9: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP4: Plan for the remainder (1) System ArchitectureSystem Architecture

Specify the multi-agent aspects of the architecture (Velti, NCSR, Specify the multi-agent aspects of the architecture (Velti, NCSR, EDIN, RTV, Lingway) (end July draft for discussion)EDIN, RTV, Lingway) (end July draft for discussion)

Report on the refined architecture (Velti) (end September)Report on the refined architecture (Velti) (end September) End-User InterfaceEnd-User Interface

Help texts, localisation, End-user interface for the 1Help texts, localisation, End-user interface for the 1stst prototype prototype (Velti) (end September)(Velti) (end September)

Link to the web page from which information was extracted Link to the web page from which information was extracted (NCSR sends draft for discussion by end of July) (NCSR sends draft for discussion by end of July)

Enrichment of the products database (Velti) (early October) Enrichment of the products database (Velti) (early October) Separate the UI (data server, XML server) (Velti) (end September)Separate the UI (data server, XML server) (Velti) (end September) Documentation (Velti) (early October)Documentation (Velti) (early October)

Page 10: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP4: Plan for the remainder (2) System IntegrationSystem Integration

IE Remote Invocation (NCSR) (end August)IE Remote Invocation (NCSR) (end August) DB Inserter (EDIN) (conversion to Java) (end DB Inserter (EDIN) (conversion to Java) (end

September)September) Integrate Web pages collection with IE remote Integrate Web pages collection with IE remote

invocation, DB insertion (NCSR) (end September)invocation, DB insertion (NCSR) (end September) Documentation for the 1Documentation for the 1stst integrated prototype (Velti) integrated prototype (Velti)

(mid October)(mid October) EvaluationEvaluation

Evaluation report from SCIE event (Velti) (end July)Evaluation report from SCIE event (Velti) (end July) Other evaluation tasks – reports (by EDIN students and Other evaluation tasks – reports (by EDIN students and

RTV students, during October)RTV students, during October) Final report for the evaluation of the 1Final report for the evaluation of the 1stst prototype prototype

(Velti) (end October)(Velti) (end October)

Page 11: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP4: Plan for the remainder (3) Deliverable D4.2 (Velti)Deliverable D4.2 (Velti)

Integration of the various reports and Integration of the various reports and documentation (end October)documentation (end October)

Page 12: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP5: Plan for the remainder (1) Contract amendment signContract amendment sign Consortium agreement signConsortium agreement sign Management reportsManagement reports

55thth Quarterly report (missing data send by partners to Quarterly report (missing data send by partners to NCSR after the meeting, NCSR sends final to EC end NCSR after the meeting, NCSR sends final to EC end of July) of July)

33rdrd Cost statement (March-August 2002) (partners send Cost statement (March-August 2002) (partners send input to NCSR mid September, NCSR sends the final to input to NCSR mid September, NCSR sends the final to EC end September)EC end September)

33rdrd Semestrial Report (partners send input to NCSR mid Semestrial Report (partners send input to NCSR mid September, NCSR sends the final to EC end September, NCSR sends the final to EC end September)September)

66thth Quarterly report (partners send input to NCSR mid Quarterly report (partners send input to NCSR mid September, NCSR sends the final to EC end September, NCSR sends the final to EC end September)September)

Page 13: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP5: Plan for the remainder (2) Formation of a user group. Final decisions Formation of a user group. Final decisions

will be taken during next meeting. NCSR will be taken during next meeting. NCSR will initiate a discussion.will initiate a discussion.

Publication in Conferences, Journals … Publication in Conferences, Journals … 2003 Intern. Conference on "Intelligent Agents, 2003 Intern. Conference on "Intelligent Agents,

Web Technologies and Internet Commerce", Web Technologies and Internet Commerce", WWW, IJCAI, ACL, ???????WWW, IJCAI, ACL, ???????

Participation in events Participation in events LangTech 2002LangTech 2002 IST-2002IST-2002 ………………

Page 14: © NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain

© NCSR, Frascati, July 18-19, 2002

WP5: Plan for the remainder (3) Updated Deliverable 5.2 “Exploitation and Updated Deliverable 5.2 “Exploitation and

Use Plan” (end October) Use Plan” (end October) Date and place of next meeting (Paris, 5-6 Date and place of next meeting (Paris, 5-6

December 2002) December 2002)