rethinking choices for multi-dimensional point indexing

13

Rethinking Choices for Multi-dimensional Point Indexing You Jung Kim and Jignesh M. Patel University of Michigan

Upload: norman-combs

Post on 03-Jan-2016

13 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

DESCRIPTION

Rethinking Choices for Multi-dimensional Point Indexing. You Jung Kim and Jignesh M. Patel. University of Michigan. Outline. Motivation Index structures Experimental evaluation Conclusion. Motivation. Need for multi-dimensional point indexing in low to medium dimensional space - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Rethinking Choices for Multi-dimensional Point Indexing

Rethinking Choices for Multi-dimensional Point Indexing

You Jung Kim and Jignesh M. Patel

University of Michigan

Page 2: Rethinking Choices for Multi-dimensional Point Indexing

Outline

Motivation Index structures Experimental evaluation Conclusion

Page 3: Rethinking Choices for Multi-dimensional Point Indexing

Motivation

Need for multi-dimensional point indexing in low to medium dimensional space Inherent nature of problems Use of dimensionality reduction techniques, e.g. PCA

Examples Spectral/image search (in feature space) Similarity search in sequence and structure databases Subsequence matching in time-series databases

Frequent choice: R*-tree

Is this the Right Choice?

Page 4: Rethinking Choices for Multi-dimensional Point Indexing

Index Structures

R* tree

Data Partition

Quadtree

Balanced/Disjoint Space Partition

Pyramid-Technique

Unbalanced/Disjoint Space Partition

Balanced Tree Unbalanced Tree Balanced Tree

Page 5: Rethinking Choices for Multi-dimensional Point Indexing

Packed Quadtree

Reduced disk footprint for the index Clustering sibling nodes

Regular QuadtreeRegular Quadtree Packed QuadtreePacked Quadtree

Page 6: Rethinking Choices for Multi-dimensional Point Indexing

Experimental Setup

Three indices and a file scan in SHORE Synthetic and real datasets

Uniformly distributed point dataMAPS Catalog data

Query workload Random and skewed queries following the

underlying data distribution

Page 7: Rethinking Choices for Multi-dimensional Point Indexing

Experiments with uniform data

Uniform-2D Uniform-4D Uniform-8D

Total execution time for varying data dimensionality

Page 8: Rethinking Choices for Multi-dimensional Point Indexing

Experiments with skewed data

MAPS-2D MAPS-4D MAPS-8D

Total execution time for varying data dimensionality

Page 9: Rethinking Choices for Multi-dimensional Point Indexing

Analysis with skewed data

The (relative) poor performance of R*-treeHigh overlap amongst MBRs Skewed data points are spread under several non-le

af nodes The (relative) poor performance of Pyramid-T

echniqueThe unbalanced space split is adversarial for skewe

d data

Page 10: Rethinking Choices for Multi-dimensional Point Indexing

Quadtree

Uses the buffer pool very efficiently Better spatial locality with skewed queries

R*-tree Quadtree

Page 11: Rethinking Choices for Multi-dimensional Point Indexing

Effect of packing in Quadtree

MAPS-2D MAPS-4D MAPS-8D

Total execution time of packed and unpacked Quadtree

Page 12: Rethinking Choices for Multi-dimensional Point Indexing

Conclusion Quadtree outperforms R*-tree and Pyramid-Tech

nique, especially for skewed (real) datasets Efficiency of the Quadtree comes from

Packing technique Regular and disjoint partitioningBetter spatial locality and an efficient use of buffer

Analytical cost model agrees with experimental results i.e. our claims are not due to implementation differences, or dat

aset peculiarities

Page 13: Rethinking Choices for Multi-dimensional Point Indexing

Questions?

RETHINKING DECISION USEFULNESS - … about rethinking decision usefulness. ... rather to shape future choices amongst alternative ... any situation in which a choice must be made involving

Improvements in the FamilySearch Indexing Programmedia.ldscdn.org/pdf/family-history/web-indexing/... · Indexing Training Guide Improvements in the FamilySearch Indexing Program

Marwan Al-Namari Hassan Al-Mathami. Indexing What is Indexing? Indexing is a mechanisms. Why we need to use Indexing? We used indexing to speed up access

INDEXING JURNAL ELEKTRONIK - pdsi.unisayogya.ac.idpdsi.unisayogya.ac.id/.../wp-content/uploads/2015/11/indexing-journal-elektronik.pdf · INDEXING JURNAL ELEKTRONIK-----Workshop Pengelolaan

A20Direct C-axis indexing enables deceleration direct to chosen . ... to minimise processing and calculation times. A20 Rotating Rotating Indexing Indexing Direct C-axis indexing function

CENDI Indexing Workshop NASA Headquarters September · PDF fileCENDI Indexing Workshop NASA Headquarters ... DOE/OSTI Proposal to Develop a Category Indexing ... CENDI Indexing Workshop

INTViewer Indexing Tutorial - INT | 2D/3D Data ... · INTViewer Indexing Tutorial Page 1 INTViewer Indexing Tutorial This tutorial illustrates how to use the Indexing feature of the

Indexing languages

KSR Indexing Chuck · 2019-11-25 · Automatic Indexing chuck KSR Indexing Chuck 2 Configuration diagram of an automatic indexing chucking system Configuration diagram of an automatic

Indexing Techniques for Multimedia Databases Multimedia Similarity Search Structure Image Indexing Video Indexing

Fundamental Indexing

Horizontal Machining Center MA-400HA - Prime MachineCurvic coupling 1˚ indexing (Standard), NC 0.001˚ indexing (Optional) Indexing time (90 ˚/180 ) 1˚ indexing: 1.2/1.5 sec, 0.001˚

Rethinking green infrastructure Rethinking green ... · PDF fileRethinking green infrastructure Rethinking green infrastructure Rethinking green infrastructure Rethinking green

MongoLA - Indexing

INDEXING* INDEXING*

Green Lecture Series: Rethinking Local Energy Choices and Costs

Indexing basics

Multidimensional Indexing

Rethinking food and nutrition science · Rethinking food and nutrition science discussion paper 5 Empowering food choices . Empowering consumers to improve their food and nutrition

File Storage and Indexing. File Organizations Indices Types of index Tree based indexing Hash based indexing

File Processing - Indexing MVNC1 Indexing Jim Skon

Indexing Techniques Indexing Techniques in Warehousing …H.Haddouti/UB_Tree.pdf · Indexing Techniques Indexing Techniques in ... Processing Relational OLAP Queries with UB-

Indexing and Active Fund Management: International Evidence...indexing and lower in countries with more closet indexing. Overall, our evidence suggests that explicit indexing improves

Multidimensional Indexing: Spatial Data Management & High Dimensional Indexing

Rethinking food and nutrition science - Monash University › files › 254638675 › 254638560_oa.pdfRethinking food and nutrition science discussion paper 5 Empowering food choices

Random Indexing

BUSINESS STRATEGY: RETHINKING COLOR CHOICES...Are business and design strategists equipped with the color ... predictable than previously thought, color choices become based less on

Executive Summary Rethinking the Marketing Why is a Modern … Ebook on Marketing Englis… · Rethinking Presence Rethinking Content Approaches Rethinking Marketing Skills Rethinking

manual indexing

FamilySearch Indexing : Indexing - LDS

Indexing & retrieval. Approaches to indexing Key word indexing Concept indexing Social indexing Non-text indexing

STRATEGIC RETHINK CHOICES - RAND Corporation€¦ · Afghanistan and Pakistan: Staying or Going..... 108 CHAPTER TEN Rethinking American ... both parties will probably try to distinguish

Salton2-1 Automatic Indexing Hsin-Hsi Chen. Salton2-2 Indexing indexing: assign identifiers to text items. assign: manual vs. automatic indexing identifiers: