department of computer science city university of hong kong department of computer science city...

31
1 Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong A Statistics-Based Sensor Selection Scheme for Continuous Probabilistic Queries in Sensor Networks Song Han 1 , Edward Chan 1 , Reynold Cheng 2 , and Kam-Yiu Lam 1 Department of Computer Science 1 , City University of Hong Kong 83 Tat Chee Avenue, Kowloon, HONG KONG Department of Computing 2 Hong Kong Polytechnic University PQ706, Mong Man Wai Building Hung Hom, Kowloon, Hong Kong

Upload: sophie-erika-johnson

Post on 31-Dec-2015

218 views

Category:

Documents


5 download

TRANSCRIPT

Page 1: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

1Department of Computer ScienceCity University of Hong Kong

Department of Computer Science

City University of Hong Kong

A Statistics-Based Sensor Selection Scheme for Continuous Probabilistic

Queries in Sensor Networks

Song Han1, Edward Chan1, Reynold Cheng2, and Kam-Yiu Lam1

Department of Computer Science1, City University of Hong

Kong83 Tat Chee Avenue, Kowloon,

HONG KONG

Department of Computing2 Hong Kong Polytechnic UniversityPQ706, Mong Man Wai Building

Hung Hom, Kowloon, Hong Kong

Page 2: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

2

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Agenda

Introduction Objective System Model Methodology Performance Analysis Conclusion

Page 3: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

3

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Introduction

Constantly-evolving Environment

Uncertainty of Sensor Data

Sensor Data are erroneous, unreliable and noisy Database may store inaccurate values Query results can be incorrect

Page 4: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

4

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Introduction

Statistical Model of Sensor Uncertainty

A sensor value can be described more accurately as a Gaussian Distribution

Mean µ Variance σ2

Gaussian Distribution(,2)

Page 5: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

5

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Introduction

Probabilistic Queries [SIGMOD03]

Represent the imprecision in the value of the data as a probability density function. e.g., Gaussian

Augment query answers with probabilities Give us a correct (possibly less precise) answer,

instead of a potentially incorrect answer

Page 6: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

6

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Introduction

Query Quality and Variance

Query quality can be improved with lower variance To obtain a smaller σ2, a simple idea is to use more

sensors Get an average of these readings N(µ,σ2) becomes N(µ,σ2/ns), where ns is the number of

“redundant” sensors

Page 7: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

7

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Introduction

Deploying Redundant Sensors Exploit the fact that sensors are cheap Example: 1000 sensors in the room to obtain average

temperature Variance decreased by a factor of 1000

Resource Limitation Problem Wireless network has limited bandwidth Sensors have limited battery power Can’t afford too many sensors!

Page 8: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

8

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Introduction

The Sensor Selection Problem

How to decide sensors’ sampling period How many sensors to use for the guaranteed

level of query quality? Select which sensors?

Page 9: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

9

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Objective

Adaptive Sampling Period Decision Scheme Find out the minimum variance of each entity

being monitored to meet the probabilistic query quality requirement

Select minimum number of “good” sensors to achieve the required variance

Decide which sensors should be selected

Page 10: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

10

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

System Model

User

Base Station

WirelessNetwork

region region

region

region

Page 11: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

11

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

System Model

User

Base Station

coordinator

Page 12: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

12

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Methodology

Adaptive Sampling Period Decision

Sensor Selection Process1. obtain (, max 2) from sensors in region

2. Derive max 2 for each item to satisfy quality

3. Determine sensor nodes to be used

Page 13: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

13

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Adaptive Sampling Period Decision

The region’s value is changing continuously Periodical Sample will consume excessive

system resource Adaptive Sample Scheme for MAX/MIN query

ESSENCE: To increase the sampling period for the regions whose values have little effect on the query result.

Page 14: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

14

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Adaptive Sampling Period Decision Adaptive Sample Scheme for MAX/MIN query

Predicted Sampling Time (PST)

max max

max

(( 3 ( )),0)i ii

i

MaxPST

v v

Page 15: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

15

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Sensor Selection Process

Types of Probabilistic Queries Factors Affecting Query Quality Probabilistic Query Quality An Example: MAX Query Reselection of Sensors for Continuous

Queries

Page 16: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

16

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Types of Probabilistic Queries

MAX/MIN: Which region has max or min temperature? (A, 60%), (B, 30%), (C, 10%)

AVG/SUM: What is the average temperature of regions A, B and C?

Range Count: How many objects are within 50m from me?

COUNT 1 2 3 4 5

Probability 0.1 0.2 0.5 0.15 0.05

Page 17: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

17

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Factors Affecting Query Quality

Error distribution of each sensor reading

Variance of Gaussian distribution

Each query has its own correctness requirement1. MAX / MIN

2. AVG / SUM

3. Range Count Query

Page 18: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

18

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Probabilistic Query Quality

Probabilistic queries allow specification of answer quality

1. MIN/MAX: highest probability ≥ P

2. AVG/SUM: variance of answer ≤ T

3. Range count: Top K counts contribute total probability ≥ P

Page 19: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

19

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Example: MAX Query

pi ( f i(s)

j1̂ ji

N

( f j (t)dt

s )ds)

Quality requirement: the maximum of pi must be larger than P

Let the probability of the i-th region be p i, where fi(s) is the pdf of N(µ,σ2)

Page 20: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

20

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

1. Set the variance of each region (σ1,σ2,…, σn) to their maximum possible

2. Find pimax, the maximum of pi’s

3. Find jmax, the index of the maximum of

i.e., the sensor with greatest impact to pimax

Finding variance for MAX

1 2( , ,..., )nk

P

Page 21: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

21

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Finding variance for MAX (Cont.)

4. Adjust variance of the jmaxth sensor

σjmax=σjmax-∆σ

5. Keep reducing variances until

pimax(σ1,σ2,…, σn) P

6. Return σ1,σ2,…, σn as the variances for the n regions

Page 22: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

22

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Deciding Set of Sensors

Distribution of ns samples follows normal distribution N(µ,σ2/ns)

Compute ns satisfying σ2/ns ≤ max variance

Compute expected value of E(s) Select ns sensors with the lowest difference of

readings from E(s) Only these sensors send their sampled values to

the coordinator for computing N(µ,σ2/ns)

Page 23: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

23

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Reselection of Sensors for CQ

Sensor selection runs again when:

1. Probabilistic query quality cannot be met (e.g., due to change of mean)

2. Coordinator detects some sensor is faulty (e.g., its value deviates significantly from the majority) or gives no response after some timeout period

Page 24: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

24

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Simulation Model

Continuous query length: 1000 sec Sensor sampling interval: 5 sec Number of regions: 4 Number of sensors per region: U [100,150] Sensor error variance range: 5-25% Difference in the values of different regions: 2-10% Quality requirement for MIN/MAX Query : 95% Variance Change Step (∆σ): 0.3

Page 25: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

25

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Performance Analysis

% in Sensor Selected vs.Difference in Region’s Values

Accuracy vs.Difference in Region’s

Values

Page 26: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

26

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Performance Analysis

Accuracy vs.Sensor Error Variance Percentage

Percentage of Sensors Selected vs.Sensor Error Variance

Page 27: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

27

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Performance Analysis

Changes in Value of Regions over Time Percentage of Sensors Selected over Time for Continuous Changes in Values of Regions

Page 28: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

28

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Conclusion

Accuracy improved through multiple sensors Adaptive Sample Period Decision Scheme Limited network bandwidth allows only limited

number of redundant sensors Sensor selection algorithm selects good sensors

for reliable readings

Page 29: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

29

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Future Work

Region Selection Reducing the Computational Complex of the

sensor selection progress Differentiating bad sensors from “good ones”

that report true surprising events Hierarchical organization of coordinators How to assign coordinators?

Page 30: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

30

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

References

1. [VSSN04] K.Y. Lam, R. Cheng, B. Y. Liang and J. Chau. Sensor Node Selection for Execution of Continuous Probabilistic Queries in Wireless Sensor Networks. In Proc. of ACM 2nd Intl. Workshop on Video Surveillance and Sensor Networks, Oct, 2004.

2. [SIGMOD03] R. Cheng, D. Kalashnikov and S. Prabhakar. Evaluating Probabilistic Queries over Imprecise Data. In Proc. of ACM SIGMOD, June 2003.

3. [Mobihoc04] D. Niculescu  and B. Nath. Error characteristics of adhoc positioning systems. In  Proceedings of the ACM Mobihoc 2004, Tokyo, Japan, May 2004.

4. [WSNA03] E. Elnahrawy and B. Nath. Cleaning and Querying Noisy Sensors. In ACM WSNA’03, September 2003, San Diego, California.

Page 31: Department of Computer Science City University of Hong Kong Department of Computer Science City University of Hong Kong 1 A Statistics-Based Sensor Selection

31

Department of Computer Science

City University of Hong Kong

Statistics-Based Sensor Selection Scheme in Sensor Networks

Thank you!

HAN Song

[email protected]