index [ptgmedia.pearsoncmg.com]ptgmedia.pearsoncmg.com/images/0201844524/index/chaudhri... · 2009....

48
Index 601 &= operator, 49, 65 |= operator, 49, 65 % wildcard, 128 _ wildcard, 128 3-tuple identifiers, 58 4SuiteServer, 477 A Abstract DTD, 473 Abstract view, 422 AbstractFeatureType element, 368 accessPredicate element, 31 ACS (attitude control system), 380 Ad hoc queries, 170, 248–252, 345 Ad hoc SQL, 225 Adabas, 23 Adapters, 377–378 auto-configuration, 388 automatically generated, 391 building with generated mapping code, 392–393 drill-down methods, 388 as executables, 392 formalizing notation describing, 392 interaction support for communication, 389 interpreting specifications, 392 quality of service support, 395 to support component interactions, 389–392 Adaptive spacecraft software, 380–382 AddDocument() function, 263, 266, 284 addNode method, 162 address element, xxv //address expression, 280 Address_Type object, 137 AddSchema() function, 263, 266 add_..._section procedures, 131–132 Adenine (A), 293 Adjacency List Model, 195 ADLs (Architecture Description Languages), 389, 393–394 ADO, 167, 169–170 AGE attribute, 427, 432 @AGE expression, 427 Aggregation queries, 515 Chaudhri_Index.qxd 2/11/03 12:04 PM Page 601

Upload: others

Post on 19-Feb-2021

0 views

Category:

Documents


0 download

TRANSCRIPT

  • Index

    601

    &= operator, 49, 65|= operator, 49, 65% wildcard, 128_ wildcard, 1283-tuple identifiers, 584SuiteServer, 477

    AAbstract DTD, 473Abstract view, 422AbstractFeatureType element, 368accessPredicate element, 31ACS (attitude control system), 380Ad hoc queries, 170, 248–252, 345Ad hoc SQL, 225Adabas, 23Adapters, 377–378

    auto-configuration, 388automatically generated, 391building with generated mapping code,

    392–393

    drill-down methods, 388as executables, 392formalizing notation describing, 392interaction support for communication, 389interpreting specifications, 392quality of service support, 395to support component interactions, 389–392

    Adaptive spacecraft software, 380–382AddDocument() function, 263, 266, 284addNode method, 162address element, xxv//address expression, 280Address_Type object, 137AddSchema() function, 263, 266add_..._section procedures, 131–132Adenine (A), 293Adjacency List Model, 195ADLs (Architecture Description Languages),

    389, 393–394ADO, 167, 169–170AGE attribute, 427, 432@AGE expression, 427Aggregation queries, 515

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 601

  • 602 Index

    Aliases, starting with at sign (@), 136AllChildren table, 468Alternative splicing of the mRNA, 304Amino acid sequence, 312Amino acids, 294–295, 295, 317Ancestor nesting in ancestor-descendant

    selection queries, 512Ancestor-descendant complex pattern selection

    queries, 514Ancestor-descendant selection queries, 512and logical operator, 158AND (&) operator, 128Annotating schemas, 27–28, 182–183Apache, 149Apache Web site, 192APB-1 benchmark, 478–479APIs

    based on XQuery, 114–115Tamino XML Server, 22–23

    appinfo element, 34APPLICANT element, 430, 433Application-level benchmarks, 500Applications

    ADLs (Architecture Description Language),393–394

    building for embedded XML databases, 73component integration, 375–376components, 390connectors, 390constructing from reusable components, 388DB2 support, 111–112DOM-compliant, 194easily moving across environments, 388embedding XML database, 72eXist development of, 50–56local joins of table data, 112rapid development of, 41reclaiming log records space, 86

    Approval_Status.xml document, 332–333APPROVED flag, 332

    AreaFeatureType type, 368, 372arithmetic operators, 158AS3AP benchmark, 478ASSOCIATING clause, 407Association rules

    aggregation, 406binary model, 414confidence, 405, 414defining, 405, 413–414EVALUATE RULE operator, 406–408extracting and evaluating, 404–408extraction of (XDM), 436–438frequency, 414MINE RULE operator, 405–406PMML model, 418support, 405XDM, 435–447

    AssociationModel element, 418AssociationRules element, 418AssociationRulesData XDM database, 435–447ASSOCIATION-RULE-SET node, 438, 440atomic value, 271AT&T Labs, 541Attribute table, 468attribute table, 525attribute_name table, 201, 226–227Attributes, 5, 6, 307

    applying to all data elements, 7data portion, 10long values, 202mapping names to node identifiers, 62metadata, 7, 10names as tags, 145names exceeding maximum length, 235object types, 136scope, 8starting with at sign (@), 136storing names, 201units, 14used as data elements, 10–11

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 602

  • Index 603

    attribute_value_leaf table, 202, 227author element, 466authorspublications view, 466AUTO_SECTION_GROUP section group, 131Axis SOAP toolkit, 52

    BBase pairs, 293BaseType element, 505–506Basket analysis applications, 402BDGP (Berkeley Drosophila Genome

    Project), 299BEA, 4Benchmark Handbook, The (Gray), 478Benchmarks

    application-level, 500currently proposed, 499data set, 479–482, 500database disk space evaluation, 539database results, 539–540database storage evaluation, 536–538design of, 479directory servers, 535existing XML, 482–497generating synthetic XML data, 501guidelines for databases, 545native XML databases, 535object-oriented databases, 534–535portability, 478queries, 480–482relational databases, 534relevance, 478scalability, 478simplicity, 478specification, 478–479test results of database storage, 536–540Wisconsin benchmark, 500X007, 482–488, 501–502

    XMach-1, 488–490, 501XMark, 491–497, 501XML data, 499–501XML databases, 544–545, 549–550XML document storage, 533–535

    Berkeley DB library, 74Berkeley DB tables, 74Berkeley DB XML applications recovery

    processing after failures, 86Berkeley DB library, 74Berkeley DB tables, 74configuration settings, 75container within environment, 75creating index on collection, 76

    environment, 75indexing containers, 75–76indexing strategies, 76–77Java language bindings, 74opening and using container, 73overview, 73–74running query against collection, 79–80statistical index information, 74threading packages, 85transactional updates, 80–81XPath query processing, 78–80

    Best-effort service networks, 384bib2html.sxl file, 54Binary association rules, 414Bioengineering Institute at the University

    of Auckland, 299Bioinformatics, 291, 296, 311Biological information complexity, 307BIOML (BIOpolymer Markup Language), 298BIOML Web site, 298BIOXML Web site, 299BLAST (Basic Local Alignment Search

    Tool), 312complex query using tBLASTx option, 317integrating into NeoCore XMS, 311–319tBLASTx search, 317–318

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 603

  • 604 Index

    BLASTp, 316blocking operation, 264–265BODY node, 440BODY-SCHEMA element, 437, 443boundedBy element, 368Broader Term operator, 128BSML (Bioinformatic Sequence Markup

    Language), 298B+-tree classes, 61BUCKY benchmark, 479Bulkload object, 186–187Bulkloading, 186Business information systems, 261–262Business process integration support, 112

    CCache sizing, 84California Institute of Technology Web site, 299Callback functions, 33Capabilities XML document, 363CARD element, 437CAR-INSURANCE element, 424, 426, 430Carriage return/newline characters, 239CAR-TYPE attribute, 427, 432@CAR-TYPE expression, 427C/C++ XDK (XML Development Kit), 160CDATA, 220CDATA sections, 204, 239cdata_leaf table, 204, 227, 229CDATASection table, 524–526Celera Genomics, 292Celko, Joe, 195Cell type category, 303CellML, 299CellML Web site, 299Center for Computational Biology, 310Center for Computational Biology Web site, 310Character data, whitespaces as, 238–239

    characters() method, 237–238Checkpoints, 85–86Chemical signaling, 295Children table, 468Cities table, 31City doctype, 29, 33City element, 31/City/Monument/Name query, 38–39Class Generator, 161–162Classification

    rule set, 409–410with XDM, 423–435

    Classification tree, 402, 409ClassificationData XDM database, 423–425CLASSIFICATION-MODEL element, 431Classification_Model table, 411CLASSIFICATION-TREE element, 429CLASSIFICATION-UNIT element,

    426, 427, 431Classified_Applicants table, 412CLASSIFIED-NEW-APPLICANTS element, 433Classifiers, 401–402Classifying data

    classification rule set, 409–410classification tree, 409for data mining, 408–413DEFAULT rule, 410test phase, 408, 409, 412–413training phase, 408, 409–411training set, 408–409

    CLASS-PARAM element, 427, 429CLASSROOM_RECORD table, 329Client/server databases, 71, 72, 82

    checkpoints, 85–86indexing, 76optimizing queries, 76recovery processing after failures, 86

    CLOB document, maintaining link to, 160CLOB system data type, 103CLOB tables, maintaining, 163

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 604

  • Index 605

    CLOBscomparisons, 126storing XML as, 126–132substring search, 126

    ClobTab table, 160Clustering applications, 402Clustering procedures, 402Cocoon, 43, 47, 54–56Cocoon Web site, 54Codons, 295, 314Collation facet, 34collection() function, 48Collections

    doctypes, 25documents included inquiry evaluation, 48eXist, 44mapping, 136searching with indexes, 75–76XML Schema descriptions, 25XML:DB API, 50

    collections.dbx file, 61Collins, Francis, 292Colorado University Health Science Center, 310Colorimeters, 5command-line testing, 213–214comment_leaf table, 204, 227–228comment() method, 242–244Commit, 33Communications satellite

    constellations, 379–380spinning (at night) mode, 381

    COMPARE() function, 273–274Complete view, 422Complex pattern selection queries, 513Component integration, 375–376Components, 390

    adaptation, 381–382adapters, 377–378configuration language, 388difficulty of reuse, 378

    integration with architecture, 383–384multiple interactions between, 391as query terms, 15reducing cost of development, 376–377reusability, 376, 377–378, 392services, 390solving interface problems, 383

    Composite elements, xxvComposite views, 458–459Compositors, xxvComputational Pharmacology Group, 310Concrete DTD, 473Condensed structure index, 36CONDITION element, 429condition element, 464Conditions, describing, 464Confidence, 405, 414CONFIDENCE element, 440, 446Configuration language, 388, 389–392Connect request, 38connection element, 462Connections, multiple interactions between

    components, 391connect() method, 214–215Connectors, 390Constraints, xxviiContainer-managed beans, 346Containers

    common queries, 75indexing, 75–77opening and using, 73–74

    CONTAINS function, 127–131contains operator (~), 37Content, differentiating between types, 237–238Contention, 84–85Context list, 7–8Control actors, 304 tag, 154&copyright entity reference, 238CORBA, 383, 397

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 605

  • 606 Index

    CREATE FEDERATED VIEW statement, 112CREATE NICKNAME statement, 112CREATE SERVER statement, 112CREATE WRAPPER statement, 112createXML method, 132, 133, 135create_xmlrep_db.sql script, 192, 197, 225Crick, Frances, 291CTXSYS.context index type, 131Cursor to walk over results set, 256–257Custom complex types, xxvCUSTOMER attribute, 443Customer_Type object type, 139CustOrd.xdr file, 184Cytosine (C), 293

    DDAD (data access definition) files, 96, 107

    relational database node mapping, 97,99–100

    SQL node mapping, 97–98DADS file, 110–111DADX files, 107, 108, 110Data

    adding context to, 5application view, 113classifying for data mining, 408–413complete context, 5–6exchanging with XML, 332heterogeneity, 387heterogeneous federated access, 111–112mapping to functions, 31–33storage, 113storing with leafing mechanism, 203stronger typing constraints, xxvwithout context, 298

    Data elements, 307attributes used as, 10–11describing, 5

    inappropriate grouping, 12–14interpreting, 5–6parts of forms to file in, 14proper use of, 6–7something about, 5, 6truly representing data, 14used as metadata, 11–12

    Data engines, 116Data entry, 225–231Data management, xxix–xxx

    Web services technology, 107–111Data mining, 450

    background knowledge, 448classifying data, 408–413extracting and evaluating association

    rules, 404–408inductive databases, 413–416, 451integrating algorithms and databases,

    450–451PMML (Predictive Model Mark-up

    Language), 416–419XDM (XML for Data Mining), 419–449

    Data modelsDOM (Document Object Model), 193–194XML documents, 520–523

    Data space, 36Data store, 61, 116DATA table, 272Data types for bioinformatics data, 296Data warehouse applications, 401DATA-AND-RULE-SET element, 446Database attribute, 424Database server, as information repository for

    user interface, 189Database systems, 69

    switching between alternative, 47two-phase locking, 81write-ahead logging, 81–82

    DatabaseManager class, 50DatabaseManager.getCollection() method, 52

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 606

  • Index 607

    DatabaseManager.registerDatabase method, 50Databases

    accessing, 166automatically building from patterns, 9benchmark results, 539–540browsing, 39cache sizing, 84collections, 44converting data into XML, 344disk space evaluation, 539extracting data patterns from, 401–402full functionality, 42indexing, 35, 277inductive, 402, 413–416integration with data-mining tools, 403–416native XML, xxix–xxx, 556–558OAI access, 345–347posting XML documents into, 151–152root collection, 48security, 42storage evaluation, 536–538for storing XML documents, 523–533Tamino XML Server access, 28–31test results for storage benchmarks, 536–540traditional, 308Transact-SQL statement, 170typed DOM (Document Object Model)

    implementation, 527Web access to, 149–155XML data management, xxixXML-related database size, 556–558

    Data-centric documents, xxv, 21, 45fine-grained storage of table attributes,

    124–125queries, 37relational databases, 124–125

    Data-centric XML, xxiii–xxvDataDictionary element, 418Data-mining applications, 401–402

    Data-mining tools, integration with databases, 403–416

    DATA-SECTION element, 442DataSet object, 170Datastore, 129Datawarehouse component, 468–469DAWAX, 461–462, 470–473dawax package, 471dawax.interrogation package, 471dawax.management package, 471dawax.specification package, 471dawax.xml.documentmodel package, 471dawax.xml.parser package, 471/db collection, 48DB2, 69, 92

    access and support for unstructured data, 112–113

    application support, 111–112data management Web services

    architecture, 107data management Web services

    technology, 107–111database managers, 103DB2 family and Informix IDS data

    sources, 102federated systems, 103heterogeneous federated data access,

    111–112"Improve Cash Flow" case study, 327–334"Our Most Valued Customers Come First"

    case study, 322–327processing support, 94relational data stored in, 325searching XML documents, 327shared architecture and technology, 92solution to problem, 322–326SQL XML support, 92, 102–103Web service support, 92WORF (Web services Object Runtime

    Framework) tools, 107XML Extender, 92, 94–95, 323, 324, 477

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 607

  • 608 Index

    DB2 databasesSQL statements executed on relational

    data, 107student and classroom record system, 328

    DB2 for Linux, 92DB2 for UNIX, 92DB2 for Windows, 92DB2 SQL Reference, 104DBAccess bean, 163DBBroker interface, 46, 63DBDOM, 477DBMS hierarchical concepts, 272–273DBMS_XMLQuery, 139DBMS_XMLSave, 139DBMX_XMLGen PL/SQL package, 148–149DBMX_XMLGen.ctxHandle class, 148DBUri path, 157DBUriType objects, 158–160DBUriType subtype, 156–157DBViewer bean, 163dbXML, 477dbXML project, 61DCOM (Distributed Component Object

    Model), 383dc:title element, 56DDBJ Web site, 292Deadlocks, 81–84DEFAULT rule, 410DELETE method, 23, 134deleteXML method, 142Department of Education reimbursement

    request Web service, 328–333Derived XDM data item, 432–433description element, 368Detailed view, 422DEVICEs, 397–398Digital-mapping database system, 353DIPCS (Distributed Interprocess

    Communication System), 397–399Directory servers

    benchmarks, 535nontyped DOM implementation, 530–532

    parent-child relationships of DOM tree,530–531

    typed DOM implementation, 530XML and, 543–544XML documents, 528–533

    disconnect() method, 215Distance education technology, 398Distributed applications, 388, 399–400Distributed instructional software, 384DIVIDE BY clause, 407DLA (Defense Logistics Agency), 338DMAP (Digital Mapping, Charting and Geodesy

    Analysis Program), 353DMG (Data Mining Group), 416DMG Web site, 416, 452DML (Data Manipulation Language), 152DNA (Deoxyribonucleic Acid), 293–294,

    314–315DNA double helix, 291DNA polymerase, 295DNA sequence model, 307–308DNA sequences

    determining set of nucleotides making up, 314

    different types of information, 308genes, 303tBLASTx search, 317–318

    element, 303, 308doc table, 199, 234DocTab table, 127Doctypes, 36

    dynamically created, 25open or closed content, 28

    element, 484Document queries, 480document table, 468Document tree, 57–59Document-centric documents, 21

    addressing human users, 45DTDs (Document Type Definitions), xxvLOBs (Large Objects), 124

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 608

  • Index 609

    queries, 37relational databases, 124

    Document-centric XML, xxiii–xxivdocument() function, 48Documents

    attributes, 26complex, proprietary standards, xxiidata-centric, xxv, 21doctype, 25document-centric, 21grouping, 25hybrid, xxiiimaximum size of indexed, 59–60ordered sequential access to nodes, 62printing, 341rapid serialization, 195root element, 25subtrees, 26text nodes of elements, 251against which expression evaluated, 48

    DoD (Department of Defense), 338DOM (Document Object Model), xxvi–xxvii,

    193–194, 310attributes, 201interface, 160nontyped implementation, 521–522object-oriented databases, 528translating XML document into in-memory

    tree structure, 343typed implementation, 522–523

    DOM API and repository, 258DOM applications, memory limitations, 194DOM Builder bean, 162DOM (Document Object Model) parsers,

    xxvi, 343DOM (Document Object Model) tree,

    xxvi, 359–360DOM-compliant applications, 194dom.dbx file, 61DOMParser class, 162DOM-to-GIDB mapping process, 360Double helix DNA structure, 293

    DPP (Digital Pattern Processing) technology, 309

    Drill-down methods, 388Drivers, 50–51DSx1 data set, 504–505DSx10 data set, 504–505DSx100 data set, 504–505DTD events, 244DTDHandler interface, 220DTDHandler methods, 244DTDs (Document Type Definitions), xxiii,

    xxiv–xxvi, 348, 520document-centric XML, xxvhandling content, 220Java class generation, 161–162lacking information, 24simple XML document, xxvsource file creation from, 161–162

    dxxShredXML stored procedure, 324Dynamic data types, 308Dynamic Web pages, xxii

    EEBI Web site, 292ebXML initiative, 569edge table view, 177–180EJB (Enterprise Java Beans), 23EJB API, 23EJB-based API (Enterprise Java Bean-based

    API), 337Element content selection queries, 511Element name selection queries, 510ELEMENT node, 440Element table, 468element_name table, 200–201, 226Elements

    generating benchmark content, 507–508long names, 201mapping names to node identifiers, 62

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 609

  • 610 Index

    Elements (continued)names exceeding maximum length, 235not containing tags or child elements, xxv

    elements.dbx file, 61, 62Embedded XML databases, 69

    ad hoc queries, 71advantages, 72application design, 71basics, 70–71building applications, 73–86checkpoints, 85–86code requirements, 71component failure, 70configuration, 74–75connecting legacy applications, 71–72deadlocks, 81–84ease of use for end user, 70end-user query tools, 70function call or class method API, 70graphical user interfaces, 70indexing, 75–77latency, 70performance, 71policy decisions by developer, 74–75programming for transactions, 80–81query interfaces, 70recovery processing after failures, 86reducing contention, 84–85reliability, 71report generators, 70running directly in application or system, 70running inside application, 73scalability, 71speed, 70storing and fetching records, 70threading model, 73throughput, 70two-phase locking, 81–84

    Emp element, 106endCDATA() method, 242–244endDocument method, 242

    endDTD method, 241endElement() method, 240–241endEntity() method, 242–244Enterprise Information Portal, 113EntireX Communicator, 31Entity beans, 346Entity references, 203entity_reference table, 203, 228Environment, 75Enzymes, 295equality keys, 78–79Equality operator, 64–65Error checking stored procedures, 205ESQL/C (Embedded SQL for C), 268Eukaryota/Prokaryota breakdown of cell, 303EVALUATE RULE operator, 406–408, 413, 444EVALUATE statement, 407EVALUATED-FOR element, 446–447EVALUATE-RULE operator, 441–443Evaluating with fast path join algorithms, 45EVALUATION-FEATURE element, 442Event-handling functions, 31Exact match attribute value selection

    queries, 510Exact match query expressions, 65Exception-handling methods, 218–219, 223ExecXPath() function, 267ExecXPath() module, 264eXist, 43

    document id, name id pairs for keys, 62abbreviated XPath syntax, 49accessing with SOAP, 52–54application development, 46–47, 50–56architecture overview, 45–47Axis SOAP toolkit, 52broker classes, 46Cocoon, 47collections, 44command-line clients, 47current versions, 47database engine, 46

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 610

  • Index 611

    default keyword tokenizer, 49deployment, 46direct access to local instance, 50document identifiers, 58driver implementations, 50efficient, index-based query processing, 43extensions for full-text searching, 45fast path join algorithms, 45full-text search extensions, 49generated node identifiers, 61hardware requirements, 47index and storage implementation, 61–63index files, 61index-based access to full-text content of

    nodes, 48–49index-based query processing, 44–45indexing system, 58–61installation process, 47integration with Cocoon, 54–56JDK (Java Development Kit), 47native XML database, 45node identifiers, 58numbering scheme, 58path join algorithms, 64pluggable storage backends, 45–46programming Java applications, 50–52query engine, 66query execution approaches, 57–58query language extensions, 47–49query language processing, 63–65query performance, 65–66query processor, 63–65querying text, 48–49relational backend, 45, 46remote database engine using XML-RPC

    calls, 50schema-less XML data storage, 44search engine, 43serializer, 62specifying input document set, 48technical background, 56–66

    testing scalability, 66Tomcat Web server, 47, 52XML nodes, 58XML resources, 50XML:DB API, 55XML-RPC interface, 52XPath query engine, 49

    eXist databases, 51–52eXist Web site, 47existsNode method, 133existsNode SQL function, 144Exons, 304extended data type, 270EXTEND-WITH-CLASS element, 432Extensibility, 4extract function, 133–135extract SQL function, 144EXTRACTING clause, 406extract() method, 146extractXML class, 245–247extractXML.java file, 192

    FFALSE-BRANCH element, 429FASTA, 312featureAttributeDate subtype, 370featureAttributeDecimal subtype, 369featureAttributeInteger subtype, 369featureAttributeString subtype, 369FeatureCollection element, 368featureMembers element, 368Federated queries, 109Fields, separating, 15FILE_DATASTORE property, 131Files, processing by URL, 142Filter expressions, 279Filtering, 129Flat streams, 467Flexibility, 4

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 611

  • 612 Index

    Flight control software, 380FNMOC (Fleet Numerical Meteorology and

    Oceanography Center), 356FOR XML query, 170–171, 181, 184Formatting XML documents, xxviiiForms, 16fragment element, 463Fragments, 459, 463–464, 469Frequency, 414Frise, Erwin, 299Full structure index, 36Full-text searching, 45Fully qualified URI (Uniform Resource

    Identifier), 50fuzzy search (?) function, 128

    GGAME (Genome Annotation Markup

    Elements), 299GemStone, 353GenBank, 300 . . . element, 303Gene Expression RFP (Request For

    Proposal), 300Gene sections, 303Generalized queries, 251Generic hash table, 343Genes, 293–294, 303–305Genetic code, redundancies in coding, 316Genetic Code Table, 295–296Genetic-based information models, 300–308Genome, 293Geographic information, transfer and

    storage, 368Geographical layer object, 363Georeferenced map images, 367George Mason University, 384Georgetown University Web site, 299GET method, 23

    GET request, 152getClob() method, 158getClobVal() method, 132, 133getCollection method, 50getExternalUrl method, 159getNumberVal() method, 132, 133, 134getNumRowsProcessed() method, 140, 148getStringVal() method, 132, 133, 134getStringValue method, 145getUri method, 159getURL method, 142getUrl() method, 158getXML method, 148getXMLDOC method, 140getXMLMetaData method, 141getXMLSAX method, 140getXMLSchema method, 141getXMLString method, 140GIDB (Geospatial Information Data Base), 353

    application schema document (.XSD), 368architecture, 355catalog, 360GML import and export, 368–372hierarchy, 359instance document (.XML), 368integration of METOC (Meteorological and

    Oceanographic) data, 356–362Metcast connectivity, 357Ozone, 354standard interface to data stores, 357success of, 355WMS architecture, 367WMS (Web Map Service) Interface

    Specification, 363GIDB catalog tree, 360–361GIDB Library node, 360GIDB mapping application, 354, 359, 366GIDB Metcast driver, 357, 362gidb namespace, 368, 370–372GIDB Ozone database, 354

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 612

  • Index 613

    GIDB portaldrivers, 355geographical layers, 365retrieving data from many data

    repositories, 354URL request for Capabilities interface, 363WMS driver, 367

    GIDB portal driver, 363GIDB request, 361gidb:_featureAttribute subtype, 368, 369gidb:VectorFeatureType type, 368, 370gidb.xsd namespace application schema, 370GIS (Geographic Information System)

    applications, 355, 372Global sequence alignment algorithm, 312GML (Geographic Markup Language), 355,

    368–372gml namespace, 368gml:AbstractFeatureType feature type, 368Gordon, Paul, 300GROUP BY clause, 147, 406, 407groupby elements, 466Grouping documents, 25GROUPING element, 437, 442Groupings, 7Guanine (G), 293

    HhandleText() method, 238, 239–240Harmon, Paul, 567HASPATH operator, 129–130, 131HASPATH query form, 126HDML (Handheld Device Markup

    Language), 151HEAD method, 23HEAD node, 440HEAD-SCHEMA element, 437, 443hello-world.xml file, 211–213hello-world-xml.sql script, 192, 211

    Heterogeneity, 4Heterogeneous federated data access, 111–112Hierarchical operators, 271–272Hierarchy, 5–6High school reimbursement request

    application, 329–330HLA architecture, 400Hormones, 295HTML, 151

    Cocoon, 54resulting from XSLT transformation, xxvix

    HTML documents, xxi–xxiiHTML files, search templates based on

    structure, xxiHTML pages, embedding JSP tags, 41HTTP, 351

    accessing remote eXist server, 46encoding information, 34Microsoft SQL Server 2000, 167–169security, 169Tamino XML Server, 22

    HttpUriType subtype, 156Human genome, starting and stopping

    points, 314Human Genome Project, 291, 292Hunter, Lawrence, 310Hybrid documents, xxiii

    IIBM Content Management and Enterprise

    Information Portal, 112–113IBM Informix IDS 9.x, 259IBM XML data management

    API goals, 113–116application interface, 113–116compiler updates to support XML, 119data manager goals, 116data type, 113–116engine, 116

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 613

  • 614 Index

    IBM XML data management (continued)indexing, 119logging and locking mechanisms, 120new storage and runtime support, 119storage, 116transaction management, 120utilities, 120vision, 113

    IDREF attributes, 465IETF (Internet Engineering Task Force), 567ignorableWhitespace() method, 238–239IIS ISAPI extension, 167Image repository, 337Images

    converting data into XML, 344–345printing, 341retrieving, 341

    Imielinski, T., 413"Improve Cash Flow" case study

    addressing problem, 328–333company scenario, 327–328Department of Education, 328, 329Department of Education reimbursement

    request Web service, 330–333future extensions, 333–334high school, 328high school reimbursement request

    application, 329–330input data to Web service, 330reimbursing teachers, 327–328response data, 330–331

    Index entries, 8Index space, 36Index-based query processing, 44–45Indexes

    3-tuple identifiers, 58alternating level-order numbering scheme,

    60–61attribute node type, 76Berkeley DB XML specifications, 76–77

    edge node type, 79edge path type, 76edge-element-presence-none type, 79element node type, 76embedded XML databases, 75–77equality key type, 76, 79eXist implementation, 61–63implementation, 61level-order numbering, 60node path type, 76node-attribute-equality-string type, 78–79node-attribute-presence-none type, 78node-element-equality-string type, 78node-element-presence-none type, 78none syntax type, 76number of children node may have, 60number syntax type, 76, 79numbering scheme, 58organization, 61organizing entries by collections, 62–63presence keys, 76, 78searching collections with, 75–76string syntax type, 76substring key type, 76, 79traditional inverted, 63

    Indexingembedded XML databases, 75–77maximum document size, 59–60queries, 280–281

    Inductive databases, 402, 451for binary association rules, 414instance of, 413instances dealing with specific type of

    pattern, 415–416schema, 413types of patterns represented, 415XDM (XML for Data Mining), 403

    Information, 298attributes, 5context list, 7–8

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 614

  • Index 615

    data elements, 5efficient use of storage, 309–310expressing, 5–6hierarchy, 5–6immediate availability, 308–309integration, 92ordering components, 5tags, 5transforming, xxviii

    Information integration-specific architectureand technology, 111–113

    Information modelsgenetic-based, 300–308pitfalls, 9–14

    Information systems, modeling, 4–5Informix, 477Informix Dynamic Server data, 111Informix IDS, 277Inhabitants attribute, 31Inner collection, 136INPATH operator, 129–130, 131INPATH query form, 126Input validation, 347/inquiry expression, 280/inquiry//address expression, 280/inquiry/inquiry-from expression, 280INRIA, 540, 541INSERT, 284insertValue() method, 217–218insertXML method, 142, 143InterestingCustomers table, 407intExecSQL() method, 215–216Introns, 304Invalid characters, 106INVENTORY table, 327ISAPI extension, 169isFragment() method, 132Isolation level, 38

    JJ2EE 1.3 platform, 351Japan International Protein Information

    Database, 299Java

    class generation, 161–162DAWAX packages, 471GIDB database and system code, 354processing XMLType, 135SQL-J standard, 268XDK (XML Development Kit), 160XML, 342XSU (XML SQL Utility), 125

    Java API, 23Java API for XML Processing (JAXP version

    1.1), xxviJava applications, 50–52Java Beans, 162–163Java classes, 192Java programs

    constructing XML documents, 162invoking, 154–155

    Java RMI (Remote Method Invocation), 383, 397

    Java Transaction API, 351Java-based Cloudscape, 277java.lang.String getStringVal() method, 135java.lang.String getXMLString method, 140java.net.URL object, 142JAXB (Java Architecture for XML Binding)

    generating Java classes representing XMLdocument structure, 343–344

    support for XML Schema, 348JAXB compiler, 348Jaxen, 57, 65Jaxen Web site, 65JAXM (Java API for XML Messaging), 351JDBC, 268JDBC database connection, 135, 149–150

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 615

  • 616 Index

    JDBC-ODBC Bridge, 219JDOM, 343JEDMICS (Joint Engineering Data Management

    Information Control System), 337–339inability to select and return different types

    of data, 341mass transfer capability, 350Oracle XSU, 347querying and producing reports on print

    jobs, 351querying repository, 347–348as XML documents, 350

    JMS (Java Messaging System), 351Join conditions, 459, 465join element, 465Journal space, 36JSP (Java Server Pages), 41 JSP mechanism, 155 JSP mechanism, 155

    KKDD (Knowledge Discovery) process, 402keepObjectOpen method, 140 tag, 300Keyword-based expression, 65Keywords, 63, 300Knowledge management system, 292, 298Kweelt, 477

    LLabBook, Inc. Web site, 298Lancaster QoS-A (Quality of Service

    Architecture), 399Lanzi, P.L., 409Layered Queuing Networks, 400LDAP (Lightweight Directory Access

    Protocol), 529LDIF (Lightweight Directory Interchange

    Format), 529

    LegoDB, 473Lewis, Suzanna, 299Lexers, 129Life sciences, modeling information with XML,

    296–300LineFeatureType type, 368, 372Linnaean classification system, 297–298,

    302–303Linnaeus, Carolus, 297LOBs (Large Objects), 124Location element, 28Locking mechanisms, 61Long-time logging, 36–37LRU (Least Recently Used) page-buffering, 61

    MMAGE-ML (Microarray Gene Expression

    Markup Language), 299MAGE-OM (Microarray Gene Expression

    Object Model), 299main() method, 213–214MAML (Microarray Markup Language), 300Mannila, H., 413Map formats, 365–366Map-in functions, 31, 33Map-out functions, 31, 33Mapping, 136

    asymmetric, 33data to functions, 31–33SQL names to XML names, 106symmetric, 33

    Mass transfer capability, 350match function, 49match-all function, 49match-any function, 49MATHML, 388–389MAX() function, 274MBL (Metcast Broker Language), 356MBL requests, 357, 361–362MEASURES element, 438

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 616

  • Index 617

    MedNet, 398Messaging, 350–351Metabolic reactions, 295Metadata, 6, 263

    attributes, 7, 10data elements used as, 11dynamic, 308managing in XML warehouses, 462–466mediated schema definition, 466relational, 118retrieving with JDBC statements, 345unnecessary indirection, 12–14XML, 118

    Metamodeling, 467Metcast

    acquiring products from, 358catalog, 356implementation, 357–362MBL requests, 357, 361–362

    Metcast catalog, 359–360, 362Metcast catalog DTD (Document Type

    Definition), 356Metcast data, 359Metcast data model, 356Metcast driver, 358–359Metcast server, 356Methionine, 314, 316Method query, 53Methods, exception-handling, 218–219METOC (Meteorological and Oceanographic)

    data, 356–357MGED Web site, 300Michigan benchmark, 544–545

    data set used by, 502–508detail table, 516generating string attributes and element

    content, 507–508queries, 508–516scaling, 504–505schema of data, 505–507

    summary table, 516usage, 516–517

    Micro-benchmarks, 500Microsoft, 4Microsoft SQL Server 2000, 477

    access via HTTP, 167–169ad hoc URL query mechanism, 168 ADO, 167database access, 166direct query access, 168IIS ISAPI extension, 167ISAPI extension, 167.NET interface, 167providing relational views over XML,

    176–180querying using XPath, 183–185rowset mechanism over XML, 177serializing SQL query results into XML,

    170–176SOAP access, 169SQL Server ISAPI, 167–168SQLOLEDB provider, 167template access, 168XDR (XML Data Reduced) schema

    language, 182XML access to, 166–170XML features through SQLOLEDB, ADO,

    and .NET, 169–170XML Schema format, 182XPath XML view access, 168

    Microsoft.Data.SqlXml class, 170MINE CLASSIFICATION TEST operator, 413MINE-CLASSIFICATION operator, 409,

    411–413, 426–427, 435, 441MINE-RULE operator, 405–406, 408, 413, 416,

    422, 436–438, 441–443, 452MIN() function, 274Minimizing or eliminating deadlocks, 84MIX (Mediation of Information), 472Modeling information systems, 4–5

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 617

  • 618 Index

    element, 484Molecular biology background, 293–295Molecular machines, 295Monument element, 28@mp:id meta property, 178@mp:parentid meta property, 178@mp:prev meta property, 178@mp:xmltext meta property, 178mRNA (messenger ribonucleic acid), 295, 304Multi-Media MedNet, 398Multiroot B+-tree, 61Munich Information Center for Protein

    Sequences, 299mycollection1 collection, 33MyCustomer view, 142, 146mydb database, 31myelement element, 26myGroup group section, 131myQuery.xsql file, 149MySQL, 470myStorage preferences, 131myXsl.xsl style sheet, 151

    NName attribute, 33name element, 33, 368... tag, 136NAMESPACE_HANDLING Boolean, 221Namespaces, xxv

    long prefixes, 201prefixes exceeding maximum length, 235Tamino XML Server, 28XML Schema, 28

    Name_Type type, 137Nara Institute of Science and Technology, 542Narrower Term operator, 128National Biomedical Research Foundation, 299National Ocean Survey, 354

    Native XML databases, xxix–xxx, 123, 477, 547basic and advanced features, 43benchmarks, 535best performance of, 563–564built and designed for handling XML, 21database size, 556–558direct mapping, 346eXist, 45performance, 347performance-enhancing technologies, 548reporting, 561–562round-tripping, 346SQL operations (mass records), 560–561SQL operations (one record), 558–560storing XML data directly, 548–549Tamino XML Server, 21XMach-1 benchmark, 550–551XML documents, 533XPath or XQL for generating queries, 346

    Native XML type, 114NATIX, 473Naval Oceanographic Office, 354Naval Research Laboratory, 353NCBI (National Center for Biotechnology

    Information), 292, 300NCBI Web site, 292 tags, 313near function, 65NEAR operator, 128Needleman-Wunsch algorithm, 312NeoCore, 4NeoCore DPP technology, 318NeoCore sequence search plug-in, 312NeoCore SSE, 313–319NeoCore XMS (XML Management System)

    accessing, 311adding structural information to protein-

    record, 311dynamic management of metadata, 308efficient use of storage, 309–310

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 618

  • Index 619

    hierarchical vector convergence algorithm, 309

    immediate availability of information,308–309

    indexing data for data-only queries, 310–311information couplets, 308integrating BLAST into, 311–319loading information into, 310powerful plug-ins, 316query retrieval, 311retrieving information, 309scalability, 309schema independence, 309Swiss-Prot protein database, 310well-informed XML, 317

    Nested Sets model, 195Nested Sets property, 252Nested structure queries, 137Nesting tags, 139.NET, 167

    wsdl.exe tool, 52XML features, 169–170

    Networksbest-effort service, 384component integration, 375–376heterogeneity, 388QoS service levels, 383

    New Applications data item, 430NEW distance education, 384NEW-APPLICANTS element, 430, 431New_Applicants table, 412NHGRI (National Human Genome Research

    Institute), 298NIH (National Institutes of Health), 292NIMA (National Imagery and Mapping

    Agency), 354Nimble notation, 392NLM (National Library of Medicine), 292Node identifiers, 61

    node table, 200inserting document node, 234node ID, 226

    Node types, 59Node UDT, 277, 284NodeImpl class, 521Nodes

    adding, 284–285child nodes, 253, 271descendants, 271modifying data, 284names, 279–280names and kinds, 271Nested Sets coordinates, 231–232nulls value of y, 230parent node, 252–253, 271position in hierarchy, 264relative locations, 271unique value, 263–264value of node_id, 230value of y, 230

    node_type lookup table, 198–199Nontyped DOM (Document Object Model)

    implementation, 521–522directory servers, 530–532relational databases, 524–526

    Non-Web based applications interactions, 394nonXML doctype, 25NOT (~) operator, 128not() logical operator, 158NS_PREFIX_HANDLING Boolean, 221Nucleotide sequences, 311–312, 318Nucleotide-Nucleotide Search (Blastn), 313Nucleotides, 293, 295NUM attribute, 138Numeric text, 133

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 619

  • 620 Index

    OOAI

    relational databases, 347OAI (Open Application Interface), 342

    accessing, 339architecture, 339component architecture, 340content validation, 347control over presentation, 342CORBA clients, 342CORBA layer, 338design choices, 341–345DTDs for defining engineering

    documents, 342early implementation of JAXB, 348entity beans, 346extensibility, 342external components, 339generating output DTD and XML

    Schema, 344image metadata, 344Image Query Service, 339, 341, 344–345input and output to external client

    interfaces, 342input validation, 347–348JAXM (Java API for XML Messaging), 351JDBC statements for retrieving metadata, 345JEDMICS, 342JMS (Java Messaging System), 351limited subset of DOM capability, 343mass transfer capability, 350messaging, 350–351middle-layer services components, 339Open Standards-based interface, 338overview, 339–341persistence layer components, 339portability, 342Print Service, 339, 341queries, 340

    Query Service, 339, 340, 348, 349querying JEDMICS repository, 347–348retrieving image data, 344Security Service, 339stored procedures for manipulating

    collections of data, 345–346structure validation, 347unique session identifier, 339updating engineering drawings and

    collection metadata, 346user communities, 345Web Services, 349–350XML, 341–342XML data, 339XSLT (XML Transformations), 349

    OASIS (Organization for the Advancement ofStructured Information Standards), 569

    Oasis Web site, xxxviObject communication de facto standard, 383Object databases, 190Object types, 136–137Object view, 138–139Object-oriented databases

    benchmarks, 534–535DOM (Document Object Model), 528XML and, 543

    Object-oriented programming languages, 190Object-relational databases and XML, 542–543Object-relational extensions, 116Objects, converting XML input into, 343–344ObjectStore, 353OccasionalType element, 505–506ODBC (Open Database Connectivity), 268ODBC data source, 219OGC (OpenGIS Consortium), 363OGC WMS (OpenGIS Consortium Web Map

    Service), 355OLAP (On-Line Analytical Processing)

    applications, 401OMEGA endpoint architecture, 399

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 620

  • Index 621

    OMG (Object Management Group), 300, 569OO (object-oriented), 353Open Source PostgreSQL-DBMS, 260, 277OpenGIS Web Map Server Interface

    Implementation Specification, 363OpenGIS Web site, 363OpenXML, 177–178, 180OpenXML rowset provider, 177Operators, hierarchical, 271–272Optimization technique, 312OR (Object-Relational) approach, 119or logical operator, 158OR (|) operator, 128Oracle, 69, 148, 344

    database access, 345–347full XPath syntax, 130Java Beans, 162–163object types to support URI concept,

    155–156special features, 155–163"thin" JDBC driver, 150

    Oracle Text cartridge, 126–127Oracle XML TransViewer beans, 162–163Oracle XSU (XML SQL Utility), 344Oracle8i, 477Oracle9i, 477

    CLOB (Character Large Object) functionality, 125

    parsers, 160–161URIs (Uniform Resource Identifiers)

    support, 155–160as XML database, 124XMLType object type, 132–135

    Oracle9iAS, 149oracle.sql.CLOBgetClob() method, 135OracleText, 127–130, 133oracle.xdb.XMLType Java class, 135OracleXMLQuery class, 139–141OracleXMLSave class, 139, 142

    oracle.xml.uri.OraDbUriServlet servlet, 158oracle.xml.xsql.XSQLRequest object, 155ORDBMS (Object-Relational Database

    Management System), 259, 268Compare() UDF, 274extended data type, 270indexing queries, 280–281user-defined functions, 276user-defined types, 270

    ORDBMS Cloudscape, 259–260Order-based selection queries, 510ORDERS table, 325Order-sensitive parent-child selection queries,

    511–512ORF (Open Reading Frames), 314org.w3c.dom.Document getXMLDOC

    method, 140OSE (Oracle Servlet Engine), 158OT Land Web site, 567"Our Most Valued Customers Come First"

    case studyacting on problems from most important

    customers first, 322better business decisions for future product

    lines, 327company scenario, 322future extensions, 326–329immediate response to customer, 326more accurate marketing strategies, 327order data, 325problem reports as XML documents,

    323–324solution to problem, 322–326triggered actions, 325–326

    Outbound Web services, 110OUTPUT element, 422, 432Ozone, 354Ozone Web site, 354

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 621

  • 622 Index

    PPalomar Web site, 303Pan-enterprise information systems, 260 element, 484PARAM element, 426, 431Parent node, 271Parent-child complex pattern selection

    queries, 513Parent-child selection queries, 511Parsed character data, xxvparse() method, 221–223Parser Event Objects, 267parser interface, 264Parsers for Oracle9i, 160–161Parsing XML data, 102Partial view, 422Partstock table, 110path expressions, 271Path join algorithms, 64PATH_SECTION_GROUP section, 130–131pattern element, 464Pattern indices, 8–9pattern.node element, 464Patterns, 6–8, 459, 468

    automatically building database, 9describing, 464evaluating, 469

    #PCDATA keyword, xxv#PCDATA nodes, 437PDF files and Cocoon, 54Pepper, Senator Claude, 292 . . . element, 304PHONE elements, 146–147 tag, 146PHONES tag, 138Phone_Tab table, 137Physiome Sciences, Inc., 299pi_data_leaf table, 203, 228PIP (payload interface processor), 380

    PIR (Protein Information Resource) database, 299

    pi_target_leaf table, 203, 228–229PLAY node set, 64PLAY//SPEECH expression, 64PL/SQL

    generating XML documents from queryresults, 148–149

    XDK (XML Development Kit), 160PMML (Predictive Model Mark-up Language),

    403, 416–419, 448–449, 452PMML documents, 416–418PNG image, 366Pointer-based join queries, 514PointFeatureType type, 368, 372POST method, 23posted-xml parameter, 154Preferred Term operator, 128Presence keys, 78"Primer on Molecular Genetics," 293Primitive data types, xxvPrimitive elements, xxvprint method, 161Printing

    documents, 341images, 341

    Printing bean, 350–351Problem reports as XML documents, 323–324PROBLEMS table, 325, 327Procedures, stored for error checking, 205Process hierarchies, 277processContents element, 26Processing instruction, 237processingInstruction method, 237PRODUCT element, 436, 437PROFILE element, 424, 427, 430Programming for transactions, 80–81 . . . element, 304Promoter sections, 304Protein-Protein Search (Blastp), 313

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 622

  • Index 623

    Proteins, 293–295, 304Proteometrics, LLC, 298Proteometrics Canada, Ltd., 298Psaila, G., 409PSDML (Protein Sequence Database Markup

    Language), 299PUT method, 23

    QQoS-aware configuration language, 395–397Quaternary system, 293Queries

    ad hoc, 345aggregation, 515ancestor nesting in ancestor-descendant

    selection, 512ancestor-descendant complex pattern

    selection, 514ancestor-descendant selection, 512benchmarks, 480–482blocking operation, 264–265complex pattern selection, 513components as terms, 15converting single SELECT expression to,

    144–146cursor mechanism on results, 38DADX files, 110data-centric environments, 37describing structure, 139displaying results as XML documents, 163document, 480document-centric environments, 37element content selection, 511element name selection, 510exact match attribute value selection, 510execution, 39eXist execution approaches, 57–58eXist performance, 65–66

    federated, 109full-text searching, 45generalized, 251GET request, 152index-based processing, 43, 44–45indexing, 280–281information about, 248–249Michigan benchmark, 508–516nested structure, 137OAI, 340optimizing, 76order-based selection, 510order-sensitive parent-child selection,

    511–512over hierarchy of type-specific tables, 270parameterization, 152–154parent-child complex pattern selection, 513parent-child selection, 511pointer-based join, 514relational, 480results as well-formed XML, 38–39returned structure, 509–510selection, 509–514session context, 38SQL, 124SQL XML output, 103SQL/XML, 105SSE (sequence search embedded), 312string distance selection, 511structural selection, 511–514Tamino XML Server, 37–39text content, 251text leaves, 250transactions, 38update, 515–516value-based join, 514X007 benchmark, 486–488XMach-1 benchmark, 489–490XMark benchmark, 495–497

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 623

  • 624 Index

    Queries (continued)XML view, 168XPath, 78–80, 262

    queriesstandardization, 125

    Query functions, 31Query language extensions, 47–49Query language processing, 63–65Query languages for XML, 472Querying

    JEDMICS repository, 347–348repository, 248–258using XPath, 183–185XML documents, 254XML messages, 262

    QueryResponse type, 53QueryService class, 53Quilt, 472Quotes, escaping, 216

    RRapid application development, 41Rapide, 393RDBMS (relational database management

    systems)attribute modeling approach, 555–556element modeling approach, 555–556as information store, 190long element names and namespace

    prefixes, 201modeling columns as XML DTD structure,

    555–556RDF (Resource Description Framework), 54Rechenmann, Francois, 296 element, 303registerHandler method, 159Regular path expressions

    decomposing into chain of basic steps, 63–65index structure, 57–58

    re_i_avl procedure, 227Related Term operator, 128Relational data

    building XML documents from, 144–155database support, 124density, 118lack of order, 118providing XML views over, 182–187reusing, 124stored in DB2, 325XML, 165–166

    Relational database systems, 69, 123–124, 166Relational database-mining framework, 413Relational databases, 189–192

    Adjacency List Model, 195benchmarks, 534book sales and reports design, 551–554bulk loading XML data into, 166creation of, 196–213data-centric documents, 124–125data-mining operators built on top of, 413Department of Education, 331document-centric documents, 124DOM storage, 195emulating extensibility, 11–14mapping XML data to, 467–468name/value pairs, 12Nested Sets Model, 195node mapping, 97, 99–100nontyped DOM implementation, 524–526OAI system, 347physical data model, 197–198serializing document out of repository,

    206–211source raw data, 402storing XML documents, 524–526table creation, 198–206user-defined data types, 198XML and, 541–542XML as transport format, 165xmlrep_user login, 197–198

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 624

  • Index 625

    Relational metadata, 118Relational queries, 480Relational storage, supporting along with XML,

    118–119Relational tables, mapping XML data to, 271Relational views, 176–180Relative path expressions, 280rep_gen_xpath procedure, 254–256rep_i_an procedure, 226–227rep_i_cdl procedure, 227rep_i_cl procedure, 228rep_i_d procedure, 225rep_i_en procedure, 226rep_i_er procedure, 228rep_i_n procedure, 226rep_i_pidl procedure, 228rep_i_pitl procedure, 228–229rep_i_tl procedure, 229replace() method, 216–217Reporting hierarchies, 277Repository

    adding document data, 263closing connection to, 215connecting to, 213–220DOM API, 258initializing connection to, 214locking granularity, 284manually building XML documents, 211–213new schema specifications, 263predefined schema, 292querying, 248–258serializing XML documents out of, 206–211SQL definition of schema, 269–270storing XML data in shredded format, 264supporting update operations, 284tables, 270testing connection, 220

    REPS table, 326rep_s_cdlid procedure, 229rep_s_children procedure, 253rep_serialise_nodes procedure, 245

    rep_serialise_nodes stored procedure, 212rep_s_n_last procedure, 230rep_s_parent procedure, 252–253rep_s_tlid procedure, 229rep_u_n_y procedure, 230rep_u_n_y_null procedure, 230Requests and URLs (uniform resource

    locators), 363Resources

    WebDAV management, 40XML:DB API, 50

    ResourceSet, 52Response XML document, 333restart() method, 140restartQuery function, 148result element, 465, 466Result pattern, 459result.getIterator() method, 52result.node element, 466Retrieve method, 54Reusable architecture, 377–378R.G. Edwards Web site, 257–258Ribosomes, 295RISK attribute, 432, 433RMI (Remote Method Invocation), xxvixRollback, 33Rollback transactions, 219Ron Bourret's Web site, xxxviRoot tag, 8 tag, 145–146, 150–151... tag, 136, 138 element, 150ROWSET tag, 146 tag, 151... tag, 136RPCs (Remote Procedure Calls), 349RULE element, 446RULE-SCHEMA element, 437, 443–444RULE-SECTION element, 442Rule_Set1 result set, 406Runtime network control system, 397–398

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 625

  • 626 Index

    SSALSA, 312Sanger, Frederick, 291SAX (Simple API for XML), xxvii, 220

    data as text stream, 343endElement event, 232exception-handling methods, 223LexicalEvent handler methods, 238

    SAX ContentHandler, 220SAX events, 62SAX interface, 160SAX LexicalHandler interface, 220SAX parser, 220

    beginning and end of DTD (document typedefinition), 241–242

    closing tab, 240–241DTD events, 244end of file, 242instantiating and invoking, 221–223lexical events, 242–244parsing entity references, 238startDocument() method, 234well-formed XML files, 224–225whitespace, 238

    SAX1parser, 220–221SAX2parsers, 220–225SBML (Systems Biology Markup Language), 299SBW (Systems Biology Workbench), 299Scalability and NeoCore XMS, 309Scalable tools, 375scanXML class, 224scanXML.java file, 192Schema documents based on W3C XML

    Schema, 39Schema Editor, 40Schema representation language, 389Schemas

    annotating, 27–28arbitrary support for, 118declaring, 178

    default mapping, 182dependence, 309editing, 39–40evolution of, 28flexible approach to, 118independence, 309inductive databases, 413mediated definition of, 466of Michigan benchmark data, 505–507organizing, 39source file creation from definition, 161–162SQL definition of repository schema,

    269–270subsets of, 279

    SCHOOLID value, 332SCHOOL_INFO table, 331SCORE function, 128Searching

    for text, 252XML data, 102

    Security, 42, 169SELECT statement, 107, 406

    concatenation as function call, 145left-to-right appearance of table alias, 173

    Select views, 458SELECT-FROM-WHERE query, 134, 137Selection queries, 509–514Self-constructing XML information system, 8Sequence alignment tools, 311sequence tag, xxvSequoia benchmark, 479Serializing

    document out of repository, 206–211SQL query results into XML, 170–176

    Serine, 316service.query method, 52Services, 50 tag, 154Sessions, 38Set enumeration model, 402Set Query benchmark, 478

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 626

  • Index 627

    setBatchSize method, 143setCommitBatch method, 143setMaxRows method, 140, 148setRowIdAttrName method, 141setRowIdAttrValue method, 141setRowSetTag method, 141, 148setRowTag method, 141, 144, 148setSkipRows method, 140, 148setStylesheetHeader method, 141Setting user permissions, 204–205setXSLT method, 144SEVERE_PROBLEMS table, 325, 326SEV_PROBLEMS trigger, 325SGML (Standard Generalized Markup

    Language), xxivShapefile format, 354shorten function, 160Shorttext view, 160Shredded rowset view, 177–180Sibling relationships, 7Simon, Hank, 300Simple .NET client in C#, 52–54Simple types, defining, xxvsitemap.xmap file, 54–55Sleepycat Software Web site, 73Smalltalk, 353SMIL, 389, 394, 399Smith-Waterman algorithm, 312SNPs (Single Nucleotide Polymorphisms),

    304–305SOAP (Simple Object Access Protocol),

    xxvix, 349, 569accessing eXist with, 52–54accessing remote eXist server, 46client libraries, 52toolkits, 52user-defined types, 52WSDL (Web Services Description Language)

    service description, 52SOAP messages, 351

    Softwarearchitectures and reusability, 376components, 376

    Software AG, 21soundex (!) function, 128SOURCE element, 422, 426, 431, 437, 442, 443source element, 462–463Space applications, 376–378Spacecraft on-board memory, 382Spacecraft software, 380–382Spaces, 36–37SPEAKER="HAMLET" predicate

    subexpression, 64SPEC table, 272, 279SPEECH elements, 64SPEECH[SPEAKER] expression, 64SQL

    ad hoc, 225combining XPath with, 134data entry stored procedures, 225–231data structured as relations, 118definition of repository schema, 269–270executing statements, 215–216expressions, 177node mapping, 97–98Node User-Defined Type Expression,

    275–276queries, 124stored procedures, 252–254update operations, 107

    SQL Node type, 272–274, 276, 284–285SQL queries

    ad hoc, 248–252aliases, 136auto mode, 170–171, 173–174avoiding caching of large XML fragments on

    server, 172checking XPath qualifiers, 282explicit mode, 170–171, 174–176generating arbitrary XML, 174–176nested mode, 170–171, 173–174parameterization, 152–154

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 627

  • 628 Index

    SQL queries (continued)raw mode, 170–172related to any tables or views, 136relational NULL values serialized, 172serialization process as post-processing

    step, 171serializing results into XML, 170–176as string value or as JDBC ResultSet,

    139–140universal table format, 174–176well-formed XML documents, 172

    SQL XMLcomposed data, 113–114data generation, 103development, 114enhanced query results, 112functions, 103information integration functionality, 105output from queries, 103publishing relational data in XML fragments,

    102–103support, 92support architecture, 102–103support technology, 104–106viewing table, 104

    SQL XML View Mapper, 183SQL-3, 281–284

    DBMS engines, 270queries, 264reconstructing XML document from

    repository schema, 276–277sorting data, 280XML_Doc_Trees view, 280

    SQL-3 structured types, 268SQLOLEDB, 169–170SQLOLEDB property, 172SQLOLEDB provider, 167, 168sql:relation annotation, 183SQLXML OLEDB provider, 168, 169SQL/XML query, 105SQLXML templates, 180–181

    SQLXML Web Microsoft Web site, 165SqlXmlAdapter class, 170SqlXmlCommand class, 170SqlXmlParameter class, 170sql:xpath-query element, 184SS7 (Signaling System 7)

    adapters, 386communication mechanism interface,

    385–386MG (manager) component, 385–386MT (monitor) component, 385–386Nimble notation, 386–387potential integration problems, 384–387

    SSE (sequence search embedded), 312SSW (Space Wide Web)

    adaptive spacecraft software, 380–382adaptive Web-based configuration

    language, 378scenario using future, 378–379supporting communications satellites

    constellations, 378–382Standard indexes, 35–36startCDATA() method, 242–244startDocument method, 234startDTD method, 241–242startElement method, 235–237startEntity() method, 242–244starts-with function, 79State transitions, 421–422stem ($) function, 128Stock table, 110Stop codons, 314Stoplists, 128Storage system

    eXist implementation, 61–63high volumes of information, 262XML-enabled repository, 262

    Stored procedures, 252–254, 345–346data entry, 225–231parameter, 170setting permissions, 231

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 628

  • Index 629

    Storing XML documents, 31, 523–533STREAMs, 397–398String distance selection queries, 511String types, 34Strings

    comparisons, 126cutting into leaves, 217–218generating benchmark attributes, 507–508replacing, 216–217searching for, 48splitting into leaves, 237substring search, 126XMLType object, 145

    Structure index, 36Style sheets, 144–145Subelements, xxvSUBJECT_DATA table, 331Substitution groups with feature attributes,

    369–370Substring keys, 79Subtrees, 26Support, 405SUPPORT element, 440, 446SVG (Scalable Vector Graphics), 54, 151Swiss-Prot, 300Swiss-Prot Web site, 292SWW (Space Wide Web), 375

    distributed component-based software applications, 376

    research and boundaries, 387SWWACXML (Space Wide Web Adapter

    Configuration eXtensible MarkupLanguage), 387

    adapters, 376application-level network QoS control,

    399–400building adapters with generated mapping

    code, 392–393components, 390configuration language, 389–392connectors, 390

    formalizing notation adapters describing, 392interaction support for adapter

    communication, 389–400lower-level approach to describing structure

    and properties of components and connectors, 393

    mapping notation, 391–392 section, 395, 396MT and MG interactions, 390–391network autoconfiguration, 395–397Nimble notation, 392parsing specifications, 392performance validation and evaluation, 399QoS, 395–397research approach, 388–389research tasks, 389–400reusable components, 392runtime network control system, 397–398 section,

    395, 396, 398 section, 395, 396, 398 . . . pairs, 390testing, 398XML compliant, 390

    . . . , 391

    .. . , 390

    SWW-XML, 379 element, 327Synonym operator, 128SYS_DBURIGen function, 159–160System Management Hub, 23Systema Naturae, 297SYS_XMLAgg function, 146–147SYS_XMLGen functions, 144–147SYS-XMLGenFormatType type, 146SYS_XMLGenFormatType type, 145SYS_XMLGenFormatType.createFormat

    method, 145

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 629

  • 630 Index

    TTableName-given rowset schema, 178Tables

    alias as element name, 173combination primary key, 200creation of, 198–206repository, 270

    Tagesspiegel Online, 543–544Tags, 5, 271

    attribute names as, 145describing data elements, 7hierarchy, 8inadequate use of, 12–14naming, 148nesting, 139open, 235preprinted parts of forms, 14renaming, 144

    Tamino, 123, 452, 477Tamino databases, 25, 36–37Tamino EJB API, 23Tamino Manager, 23Tamino XML Schema, 31–33Tamino XML Server

    accessing external relational databases, 23accessing other databases, 28–31APIs, 22–23architecture, 22–23collation information, 34–35compression techniques, 36configuration data, 25different types of data, 22doctypes, 25editing schemas, 39–40enhancing standard schematic

    descriptions, 24full database functionality, 42full-text search, 37graphical browser-based administration

    interface, 23

    handling query results, 38–39HTTP, 22identifier for documents or non-XML

    object, 25indexing, 35–36internationalization, 34–35journal space, 36long-time logging, 36–37modified schemas, 28modifying database information, 29namespaces, 28native XML databases, 21non-Unicode encoding, 34open content option, 26propagating information to external

    systems, 23queries, 37–39Schema Editor, 40schema example, 29–31security, 42sessions, 38simple XML schema, 27–28SSL communication, 22standard indexes, 35–36storing and retrieving information, 23storing arbitrary objects, 25structure index, 36supported HTTP methods, 23text indexes, 35–36tools, 39transactional logging, 36transactions, 38Unicode, 34user-defined functions, 37WebDAV (Web-based Distributed Authoring

    and Versioning) access, 40word fragment index, 36X-Application, 41XML documents, 22XML Schema, 26–28XML storage, 23–25

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 630

  • Index 631

    X-Node component, 28–31XPath, 37X-Plorer, 39X-Port component, 22XQuery, 37

    Tamino X-Node, 23Tamino X-Tension, 23Target, storing with leafing mechanism, 203 tag, 303Taylor, Ron, 310, 311Technologies, typical cycle of, 567Technology data management Web services,

    107–111Telephone directory listing, 15, 16Templates, 180–181Test pages, automatically generating, 107TEST-CLASSIFICATION operator,

    430, 432, 435Text, 220

    numeric, 133querying, 48–49searching for, 252splitting into multiple nodes, 239

    Text content queries, 251Text indexes, 35–36Text leaves queries, 250Text search operator (&=), 49text() function, 35, 133, 134, 158text_leaf table, 202–203, 229Thymine (T), 293TIBCO, 4Tiny HTML, 151title element, 466Tokenizing text into keywords, 63Tomcat, 47, 52, 107Topography and Bathymetry layer, 365TPC Web site, 479TPC-D benchmark, 478TPC-H benchmark, 478TPC-R benchmark, 478TPC-W benchmark, 479

    Training set, 408–409Training_Set table, 409–410, 411Transaction log, 85Transaction-handling methods, 219Transactions, 38

    beginning and ending, 80commit or rollback, 33, 81committing, 219locking, 81–82multiple operating on same data, 84programming for, 80–81rollback, 33, 82, 219rolling back deadlocked, 83–84terminating, 82

    TRANSACTIONS element, 436Transactions table, 404–405Transact-SQL statement, 170Transcription, 295Transcription factors, 304 . . ..

    elements, 304Transforming

    information, xxviiiXML data, 101–102

    transform.transformIn function, 33transform.transformOut function, 33Translation, 295Tree Java objects, 343Treeviewer bean, 163tRNA (transfer RNA), 295TRUE-BRANCH element, 429T-SQL FOR XML extensions, 168T-SQL fragment, 178–180T-SQL statements, 168, 180–181Two-phase locking, 81–84Typed DOM (Document Object Model)

    implementation, 522–523databases, 527directory servers, 530

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 631

  • 632 Index

    UUDDI (Universal Discovery Description

    Integration), 349, 569UDFs (user-defined functions), 96, 117

    outbound Web services, 110XML Extender, 94

    UDTs (user-defined types), 94, 272UML (Unified Modeling Language), 521, 569Unicode (Unicode Consortium 2000), 34Universal table format, 174–176University of Wisconsin, 540University Personnel DTD, 520Unstructured data, 112–113UPDATE method, 267, 284Update queries, 515–516UPDATE statement, 134UPDATE WHERE CURRENT OF CURSOR, 266Updategrams, 180–181, 185–186updateXML method, 142, 143Updating XML documents, 142uploadFile method, 233–234Uploading XML documents, 220–247uploadXML class, 231–245uploadXML.java file, 192UriFactory type, 159UriFactoryType class, 156URIs (Uniform Resource Identifiers), 125URIs (Uniform Resource Identifiers) support,

    155–160URIType object, 155UriType object, 156, 159URLs (uniform resource locators), 155, 363urn:oracle-xsql namespace, 149urn:schemas-microsoft-com:xml-sql

    namespace, 168, 182urn:schemas-microsoft-com:xml-updategram

    namespace, 186U.S. Army Corps of Engineers, 354U.S. Census Bureau, 354U.S. Geological Survey, 354

    U.S. Navy, 338useLowerCaseTagNames method, 141User-Agent type, 154User-defined

    data types, 198facets, 34functions, 37, 272object types, 137

    useTypeForCollElemTag method, 141useUpperCaseTagNames method, 141

    VvalidateContent() method, 161, 162Validating XML data, 102Value-based join queries, 514Variables, 459VectorFeatureType type, 372Venter, Craig, 292view element, 462, 463Views, 469

    composite, 458–459defining, 463–466definition example, 460definition of result pattern, 465–466result pattern, 459select, 458XDM databases, 422

    Virtual names, 169voidExecSQL() method, 215VPF (Vector Product Format), 354vroot (virtual root), 167

    WW3C

    DOM methods node types, 59XML Query Language Working Group, 480XML Schema, 24

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 632

  • Index 633

    XPointer specification, 155WAP (Wireless Access Protocol), 54WAS, 107Watson, James, 291Web map servers, 363Web pages

    dynamic, xxiiX-Application, 41

    Web servers, accessing databases from, 149–155Web Services, 92, 107–108, 349–350, 569Web-based interconnection, 394–395WebDAV (Web-based Distributed Authoring

    and Versioning)access, 40accessing remote eXist server, 46

    WebDAV (Web-based Distributed Authoringand Versioning) Server, 40

    WebDAV-based resource management, 40WebLogic, 149WebSphere Studio Application Developer, 107Well-informed XML, 317Whitespaces as character data, 238–239(*) wildcard, 48WIRE, xxiWisconsin benchmark, 478, 500WITHIN operator, 129–130, 131WITHIN query form, 126WML (Wireless Markup Language), xxiii, 151wml-example.xml file, 248WMS (Web Map Service), 363, 367WMS Capabilities DTD, 363WMS driver, 365–367WMS-compliant map servers

    GIDB portal driver, 363image or raster maps, 365PNG image, 366

    Word fragment index, 36words.dbx file, 61, 63WORF (Web services Object Runtime

    Framework) tools, 107World Wide Web (Web), 375

    Wright, 393Write-ahead logging, 81–82WSDL (Web Services Description Language),

    169, 349, 569WSDL documents, 107wsdl.exe tool, 52WWW (World Wide Web), 376

    XX007 benchmark, 479, 501–502, 544

    database parameters, 485queries, 486–488X007 database, 482–485

    XADT type, 543X-Application, 41xdb_g.jar JAR archive, 135XDK (XML Development Kit), 140, 160–162XDM (XML for Data Mining), 403

    association rules, 435–447basic concepts, 419–423benefits, 447–449classification model, 428–429classification with, 423–435comparing with PMML, 448–449derivation dependencies, 447–448extensibility, 449extraction of association rules, 436–438flexibility, 447, 449incremental computations, 449–450open description, 448as open framework, 427user interaction, 450XDM databases, 419–423

    XDM data items, 419containing purchase transaction data,

    435–436derivation with evaluated association rules,

    444–447

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 633

  • 634 Index

    XDM databasesbuilding classification model, 425–427classification process, 434–435classification tree, 429derivation of association rules, 438–441derivation of classified test data, 432–434derivation of evaluated association rules,

    444–447derived data items, 419evaluating association rules, 441–444initial state, 423–425nonderived data items, 419–420partial view, 428state, 419–421state transition, 421–422, 425–427, 436test phase, 430–432training set, 423–425view showing test phase, 433–434views, 422XDM data items, 419XDM element, 420

    XDM element, 420XDM-DATA-ITEM element, 419, 420–421,

    424, 429XDM-DERIVATION element, 419, 421, 429,

    433, 438, 444XDM-STATEMENT element, 422, 437, 442XDM-TRANSITION element, 421–422,

    425–426, 436–437, 441Xerces XML parser, 192–193, 359, 363Xindice, 43Xindices project, 61XMach-1 benchmark, 501, 544, 549

    book sales and reports database design,551–554

    native XML databases, 550–551queries, 489–490

    XMach-1 database, 488–489XML-related databases, 550–551XMark benchmark, 501

    queries, 495–497

    XMark database, 491–494XMAS, 472XMI, 569XML (eXtensible Markup Language),

    xxi, 388–389abuse of, 10advantages, 567business information systems, 261–262complex distributed requests, 112concepts, xxiv–xxviias container for data managed by legacy

    systems, 3control over presentation, 342conversion of Database Data into, 344conversion of Image Data into, 344–345conversion of XML input into objects,

    343–344converting documents from legacy format, 72for data exchange, 332data management, xxix–xxxData Model, 270as data transport for external data, 346database access, 345–347data-centric, xxiii–xxivdefinition of, xxiidirectory servers and, 543–544document-centric, xxiii–xxivDOM (Document Object Model), xxvi–xxviiDTDs (Document Type Definitions),

    xxiv–xxviexisting benchmarks, 482–497expressing information, 5–6extensibility, 4, 16, 342extensions to, 568flexibility, xxiii, 4, 10grammar, 3–4heterogeneity, 4as information domain, 4–5information model, 292–293informationally complete, 4infrastructure to support business, 568

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 634

  • Index 635

    interoperability, 342Java, 342linear/hierarchical structure, 118mapping legacy data formats to, 72mapping to another data model, 21marked-up content, 165Metcast catalog, 362Microsoft SQL Server 2000 access to,

    166–170modeling behavior of entire application

    once, 3namespaces, 368navigating tree structure, 183Nested Sets Model, 195nesting manipulation operations, 114with no data elements, 10OAI, 341–342object-oriented databases, 543object-relational databases and, 542–543patterns, 6–9platform-independent, 165, 190portability, 123, 342potential, 3query languages, 125, 472relational data, 165–166relational databases, 541–542relational views over, 176–180representation for DBUriType subtype,

    156–157research on, 260, 451revolutionary nature, 297–298self-describing, 4semi-structured data, 165serializing SQL query results into, 170–176

    simple data access, 112SQL-centric and XML-centric way to gener-

    ate, 170SQLOLEDB, 169storing, 31storing as CLOB, 126–132structured data, 165

    style, 3–4supporting along with relational storage,

    118–119syntax, 3transforming into another format, 141Unicode (Unicode Consortium 2000), 34Unicode-based syntax, 165use-case scenario, 261–262valid, xxiiiwell-formed, xxiiiXML schemas, xxiv–xxvi

    XML applicationsdata store help with creating, 116request support, 94textual content ignorable, 202

    XML Benchmark Project, 550XML Collections, 92, 99–101, 107XML Columns, 92, 107XML comments, 204XML data

    benchmark data set, 479–482benchmarks, 499–501bulk loading into relational databases, 166Bulkloading, 186changing data schema, 118characteristics, 502–505data set granularity, 504database support, 124depth, 502–503depth and width of tree, 480different approaches to storing, 467document centric, 191edge table view, 177–180fanout, 502–503flat streams, 467graphical interface for management of, 470

    indexing methods, 117mapping to relational databases, 467–468mapping to relational tables, 263–264, 271meta properties, 177–178metamodeling, 467

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 635

  • 636 Index

    XML data (continued)mixed content, 191mixed storage, 467multirow updates, 180OAI, 339parsing, 102posting into database, 151–152searching, 102shredded rowset view, 177–180storage, 472storing in native format, 72storing in shredded format, 264systems for integration, 472–473as text stream, 343transforming, 101–102validating, 102viewing mechanism for, 455–456XML columns, 95–96, 98–101

    XML data component, 468XML data store, 61XML databases

    benchmarks, 544–545, 549–550client/server, 71, 72embedded, 71–73guidelines for benchmarking, 545hierarchy of concepts, 157observations, 427–428position and structure of nodes, 61

    XML document tree, 177XML documents, xxviii

    access privileges, 157adding nodes, 284–285as aggregations of information, 309appending documents, 284atomic types, 270atomic value, 271base64 encoding, 344–345batch size, 151BLOB (Binary Large Object) form, 266building DOM tree from, 162

    building from relational data, 144–155closure property, 458complex attribute values, 157complex nested structure, 137–138composing or decomposing, 96–98composition functions, 326–327data about person, xxii–xxiiidata models, 520–523databases for storing, 523–533data-centric, xxiiidecoding processes, 344defining structure, xxiv–xxvideleting, 151Department of Education, 329describing structure, xxiiidirectory servers, 528–533displaying graphically as tree, 163document-centric, xxiiiDOM representations, 194editing, 163extensible constructs, xxvfine-grained storage, 135–144flow semantics, xxviiifollowing XML rules, 150formatting, xxviiigenerating from PL/SQL queries, 148–149generating Java classes representing struc-

    ture, 343–344Huffman coding, 345implicit order-document order, 118improved scalability, 114indexing schemes, 58–61input and output to external client

    interfaces, 342inserting, 151as linked chunks, 191locating and linking information, xxviimaintaining document order, 284management of large collections, xxiimanaging within databases, 452

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 636

  • Index 637

    manually building, 211–213mapping from side tables to, 96–98memory management in large, xxvi–xxviiMIME type, 158more complex XPath examples, 278native XML databases, 533navigating logical structure, 57–58navigation improvement, 114as node tree, 193node types, 270nodes, 271nontyped DOM implementation, 521–522opening, 234outputting, 31posting into databases, 151–152problem reports as, 323–324qualifying, 133–134query results as, 163querying, 254readable output, 245–247relational database storage, 524–526relational tables with CLOB columns, 126restructuring, 455–456restructuring possibilities, 458–459schema structure, 263schema-less storage, 44schemas, 26–28schematic information, 23–24self-describing, xxiiserializing out of repository, 206–211side tables, 95simple method for designing, 14–16sparcity, 118SQLXML templates, 180–181storage, 125, 163, 190–191storage benchmarks, 533–535storage in one table, 141–142stored as CLOBs (Character Large

    Objects), 94stored as files linked from DBMS, 94

    stored as fragments in collection of table,94–95

    stored in regular relational tables, 191stored intact in relational column, 94stored procedures for data entry, 225–231stored with XML Extender, 94storing as contiguous blocks of data, 284storing data with one node-per-second, 284studies in storing and retrieving, 541subsets of schema, 279table or view, 151tags, 271Tamino XML Server, 22 templates, 168translating into in-memory tree

    structure, 343transparently composed from one or

    many data sources, 102two-character hexadecimal

    representation, 344updating, 142, 151–152uploading, 220–247valid, 151valid characters, 344validating, xxiii, 161–162view model, 458–460views, 141–142visualizing, 163well formed, xxiiiwell-formedness check, 233–234XSL transformations, 163

    XML elements, proper use of, 6–7XML Extender, 92, 107, 323, 324, 477

    architecture, 94–95DAD files, 107DAD (data access definition) files, 96–98data storage, 113federated environments, 102locating elements and attributes, 96processing support for DB2, 94

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 637

  • 638 Index

    XML Extender, (continued)shared architecture and technology, 92subset of XPath standard, 102technology, 95–100UDFs (user-defined functions), 94, 102UDTs (user-defined types), 94XML application request support, 94XML collections, 99–101XML columns, 95–96, 98–101XML object storage, 94XML Schema support, 102

    XML Extender applications, 117XML files

    transforming and creating output file,162–163

    well-formed, 224–225XML fragment

    describing customer, 105output, 105–106

    XML markup languages, 298–300XML metadata, 118XML objects storage, 94–101XML parsers, 125–126, 160XML Path Language. See XPathXML query language, 480, 547–548XML repository prototype

    detailed design descriptions, 268–286high-level system architecture, 262–268overview, 261–262

    XML Schema, xxiii, xxiv–xxvi, 24, 270, 348attributes, 26collation facet, 34complexity, xxvdata items within attributes, 307data-centric, xxvnamespaces, xxv, 28simple XML document, xxvistring types, 34stronger typing constraints on data, xxvsubtrees, 26Tamino-specific notation, 25

    user-defined facets, 34wildcards, 26xs:annotation element, 27–28

    XML Schema Language, 568XML SourceView Bean, 163XML storage, 116–117

    handling objects of predefined type, 24Tamino XML Server, 23–25

    XML tags, 342XML template, 170XML Transform Panel bean, 163XML views, 166

    annotated schema, 182–183defining Customer-Order hierarchy, 182–183providing over relational data, 182–187selecting data from, 183updating using Updategrams, 185–186as virtual views, 184

    XML warehouses, 455–456architecture, 457–458Datawarehouse component, 468–469Datawarehouse implementation

    component, 458Datawarehouse specification component, 458DAWAX, 470–473defining views, 463–466different approaches to storing XML

    data, 467extracting data, 469graphic tool for specifications, 461–462managing, 466–469managing metadata, 462–466mapping XML to relational databases,

    467–468mediated schema definition, 466query manager, 458root element, 462specification, 458–462storage, 466–469View definition example, 460view model, 458–460

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 638

  • Index 639

    view storage, 468–469XML data component, 468

    XML2CLOB statement, 104, 106Xml2Sql component, 469XMLAGG statement, 103, 104XML-as-data model, 264–265XML-as-document model, 264–266, 284XMLATTRIBUTES statement, 103, 104xml_books.xml file, 54XMLCLOB type, 95XML_Data_Type SQL type, 270XML:DB API, 46–47, 50–52, 55xmldb namespace, 56XML:DB XSP logic sheet, 55–56xmldb:collection element, 56xmldb:get-xml function, 56xmldb:xpath tag, 56XML_Doc_Trees view, 276, 280XMLELEMENT statement, 104, 106XML-enabled databases, 477, 547–548XML-enabled repository, 262XMLFile type, 95XML_Node_Type type, 270XmlNote table, 468XML-QL (XML Query Language), 472, 548XML-related databases

    database size, 556–558reporting, 561–562SQL operations (mass records), 560–561SQL operations (single record), 558–560XMach-1 benchmark, 550–551

    XML-related technologiesSOAP (Simple Object Access Protocol), xxvixXPath, xxviiXSL (eXtensible Stylesheet Language),

    xxviii–xxixxmlrep database, 197XMLREP ODBC data source, 214xmlrepDB class, 213xmlrepDB.java file, 192xmlrepSAX class, 220–225

    xmlrepSAX.java file, 192xmlrep_user login, 197–198XML-RPC (XML Remote Procedure Call)

    accessing remote eXist server, 46client libraries, 52

    XML-RPC interface, 52XML_SECTION_GROUP section group, 131XMLSERIALIZE statement, 106xml-stylesheet parameter, 154XMLType object type, 125

    content in serialized format, 133converting, 146example, 133methods, 132processing in Java, 135

    XMLType objectscanonical mapping, 144–146checking well formedness, 133–134strings, 145

    XMLTypeTab table, 133XMLVarchar type, 95, 96XMS (XML Management Systems), 4

    behavior of, 8benchmark specification, 478–479index entries, 8native XML databases, 477XML-enabled databases, 477

    X-Node component, 28–31XPath, xxvii, 37, 264

    arbitrary information access, xxviiaxes operations, 275–276coercion rules of strings, 184combining with SQL, 134comparison operators, 279constraints, xxviiconstructs, 183–184data type coercion, 184eXist extensions, 47–49hierarchical information access, xxviimathematical and string-manipulation

    functions, 279

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 639

  • 640 Index

    XPath (continued)multiple qualifiers, 282node-to-string conversions in predicates, 184queries, 51, 183–185, 262query processing, 78–80regular path expressions, 57searching for strings, 48semantics, 184slow query execution, 48SQLOLEDB, 169–170substring searches, 79supported features, 158variety of queries over container, 76

    XPath expressions, 45, 280converting to SQL-3 queries, 264cursor to walk over results set, 256–257describing path within target document, 278ease of writing and understanding, 427evaluating, 267–268examples, 278–279generating, 254–257handling, 264identifying subsets of XML data

    document, 278parsing, 264paths, 254–256, 279predicate expressions filtering documentlocations, 279qualifiers, 282with range predicates, 270restrictions, 134simplicity, 278SPEC table, 279terminating with qualifying filter

    expression, 282 transparent information sources, 427validating, 178, 279

    XPath format, 157XPath interpreter, 268XPath queries, 53, 78–79, 168, 180–181

    calling SSE plug-in, 312–313parameter, 170

    XPathQueryService service, 52Xperanto project, 92, 124X-Plorer, 39XPointer, 158X-Port component, 22XQL (XML Query Language), 472, 548Xqps (XML queries per second), 550XQuery, 37, 57, 124, 264, 451, 472, 551

    interpreter, 267statements, 115–116support, 116–117

    XRel, 542XRM (eXtended integrated Reference

    Model), 399XRR (XML Repository), 118xs:annotation element, 27–28xs:appinfo element, 28XSD (XML Schema Definitions), 191xsd:anyType type, 271 special null attribute, 157XSL (eXtensible Stylesheet Language), xxiii,

    101–102, 141, 349, 455–456example, xxviii

    XSL FO (XSL Formatting Objects), xxviiiXSL transformations, 163XSL Transformer bean, 162–163XSLT (XSL Transformations), xxviii, 101–102,

    118, 141, 191XSLT style sheets, 54, 154XSP, 55XSQL

    bind-params example, 152default values for bind-params, 153directly executing DML (Data Manipulation

    Language) statements, 152include-request-params example, 153–154interoperability with JSP, 149invoking programs, 154–155lexical substitution parameter, 153parameterizing queries, 152–154query examples, 150–151

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 640

  • Index 641

    .xsql extension, 149

    .xsql filesexecuting, 154registering XSLT style sheet, 151

    XSQL servlet, 149–151XSQLConfig.xml file, 149 tag, 151 forms,

    153–154 forms, 153–154 tag, 154 tag, 151 tag, 151–152 tag, 149 tag, 151xs:schema element, 27

    XSU (XML SQL Utility), 125, 135canonical mapping, 136–139executing query in Java environment and

    producing XML, 139–141performance, 148registering style sheet, 141splitting XML documents into pieces and

    storing, 136transforming data into XML, 136

    X-Tension, 31–33, 37Xyleme, 473

    ZZipf distribution, 508

    Chaudhri_Index.qxd 2/11/03 12:04 PM Page 641

  • Chaudhri_Index.qxd 2/11/03 12:04 PM Page 642

  • Chaudhri_Index.qxd 2/11/03 12:04 PM Page 643

  • Chaudhri_Index.qxd 2/11/03 12:04 PM Page 644

  • Chaudhri_Index.qxd 2/11/03 12:04 PM Page 645

  • Chaudhri_Index.qxd 2/11/03 12:04 PM Page 646

  • Chaudhri_Index.qxd 2/11/03 12:04 PM Page 647

  • Chaudhri_Index.qxd 2/11/03 12:04 PM Page 648