the web of what? exploring the web of data with dave tarrant
TRANSCRIPT
PowerPoint Presentation
Web of what?Dr David Tarrant@davetazThe Open Data Institute
Content created by The Open Data Institute
1
Course aimExplore how the web of data can revolutionise the way we use computersCourse aim
Content created by The Open Data InstituteOutcomesExplain how to publish open data on the web of documentsExplain the importance of web based identifiers in dataExplain the difference between the web of documents and the web of dataAnalyse the future of the web of data
Outcomes
Content created by The Open Data InstituteHow do we access open data?
Content created by The Open Data InstituteThe web as we know it
Content created by The Open Data InstituteData
Content created by The Open Data InstituteDocument & Datahttp://bbc.co.uk/news
http://feeds.bbci.co.uk/news/rss.xml How do we switch between document and data?
Content created by The Open Data InstituteIdentifiersWhat is this the identifier for?
978-0471781172
Content created by The Open Data InstituteIdentifiersWhat is this the identifier for?
http://swtrains.co.uk/train/1W75
Content created by The Open Data InstituteIdentifiersWhat is this the identifier for?
http://swtrains.co.uk/train/1W75AuthorityDatabasetable?Identifier
Content created by The Open Data InstituteDataset without identifiersPayment DateExpense TypeExpense AreaSupplier NameTransactions Number Amount ()12-May-10Assets Under Construction (Oracle PA Control Account)NON CASH & UNALLOCATED COSTSGIBS LTD1002189251,542.2612-May-10Corporate Strategy ConsultancyDG FINANCEHAYS SPECIALIST RECRUITMENT LTD116372646.2712-May-10VAT (Input) (Oracle Sub-Ledger Control Account)NON CASH & UNALLOCATED COSTSHAYS SPECIALIST RECRUITMENT LTD11637268,130.4912-May-10Fco Healthcare SchemeDG CHANGE & DELIVERYHEALIX INTERNATIONAL LTD1163727113,130.7012-May-10Programme Spend (Oracle Projects Control Account)DG EUROPE AND GLOBALISATIONTHE CARBON TRUST116375334,500.0012-May-10P.A.Y.E Income Tax (Oracle PAY Control Account)NON CASH & UNALLOCATED COSTSHM REVENUE & CUSTOMS40067724,000,783.1612-May-10Rent and Condominium Charges Non-ResidentialDG CHANGE & DELIVERYCOLLIERS MEREDITH & GREW / TECHNOLOGY AGENCY ACCT200002656038,948.1013-May-10Programme Spend (Oracle Projects Control Account)DG POLITICALNATO10015494,920,263.1413-May-10Assets Under Construction (Oracle PA Control Account)NON CASH & UNALLOCATED COSTSGIBS LTD100155139,127.9213-May-10Research BudgetUK TRADE AND INVESTMENT DIRECTORATEERNST & YOUNG LLP116379425,000.0013-May-10Research BudgetUK TRADE AND INVESTMENT DIRECTORATEERNST & YOUNG LLP11637944,375.0013-May-10Information Systems MaintenanceDG CENTRAL GROUPMICROSOFT LIMITED1163825385,500.0013-May-10VAT (Input) (Oracle Sub-Ledger Control Account)NON CASH & UNALLOCATED COSTSMICROSOFT LIMITED116382567,462.50
Content created by The Open Data InstituteDataset with identifiers
Content created by The Open Data InstituteIdentifiersWhat is this the identifier for?https://opencorporates.com/companies/gb/01123045
Content created by The Open Data InstituteIdentifiersWhat is this the identifier for?http://id.southampton.ac.uk/building/32
Document or Data?Wrong. It identifies the physical building, not the document or the data!
Content created by The Open Data InstituteIdentifiersBuilding: http://id.southampton.ac.uk/building/32Document: http://data.southampton.ac.uk/building/32.html Data: http://id.southampton.ac.uk/building/32.rdfHow does a machine know to make the changes?
Content created by The Open Data InstituteNegotiationSame bookDifferent languageDifferent format (hardback, paperback, eBook)Different size (pocket vs full)
Content created by The Open Data InstituteLayers of the webPhysicalDocumentsData
Content created by The Open Data Institute
http://5stardata.info/
5-Stars
Content created by The Open Data Institute
5 star data
Content created by The Open Data InstituteThe future?
Content created by The Open Data InstituteThank-youDr David Tarrant@davetazThe Open Data InstituteTools usedgraphite.ecs.soton.ac.ukjsonlint.comPostman rest client for Google Chrome
Content created by The Open Data Institute
21
Structure and Unstructured
Content created by The Open Data InstituteDocuments vs DataFor documents the machine is told where to put different things on screen to suit humans. Very fixed output.
Given data, the machine can decide how to use it and how to display it best without the need to be told explicitly by a human.
Content created by The Open Data InstituteData formatsXMLJSONCSVThe occasional XLSXLS and CSV can also be unstructured!
Content created by The Open Data Institute