a clustering method based on repeated trip behaviour to identify road user classes using bluetooth...

Institute for Transport StudiesFACULTY OF ENVIRONMENT

A clustering method based on repeated trip behaviour to identify road user classes using Bluetooth data

F. CrawfordInstitute for Transport Studies, University of LeedsEmail: ts12fc@leeds.ac.uk

Repeated trip making

Often assumed that urban traffic consists of commuters who drive between home and work at the same times each weekdayBut…• increases in part time, flexible and home working?• longer shop opening hours?

What proportion of travellers on the roads are these mythical regular commuters?

Point-to-point sensors e.g. Bluetooth

Methodology overview

Traveller 1: (s, t) (s, t) (s, t) (s, t) (s, t) (s, t) (s, t) (s, t) (s, t) (s, t) ….

Traveller 3: (s, t) (s, t) (s, t) (s, t) (s, t) (s, t) (s, t) (s, t) (s, t) (s, t) ….…….

Sensor 1 Sensor 2 Sensor 3 ………

………

Traveller 1: freq1, spat1, tod1

…….

Cluster A Cluster DCluster CCluster B ………

Trip frequency

• Simply look at the number of trips per traveller in the data• Assume individual trips missing at random• Using data in this format can we calculate other measures

to provide other types of information?

Spatial variability: Sequence Alignment

- OD pairs?

- OD pairs?- Trip sequences?

Sequence Alignment

Seq1: ABDC

Seq1: ABDCSeq2: BEDF

Spatial variability:Sequence Alignment

Dissimilarity between sequence x and y:

Seq1: A B - D C

Seq2: - B E D F

Time of day variability

- Which are ‘comparable trips’? No information about trip purpose etc.

- Use as much data as possible- Time at most common site (likely to be near home/work?)- Avoid arbitrary cut-offs

The times of day I walk along my street

8am 5pm 8pm4pm7am 1pm

Time of day

The times of day I walk along my street

8am 5pm 8pm4pm7am 1pm

Time of day

Mixture of Gaussian Distributions?

Model-based clustering using Maximum Likelihood Estimation

Which cluster does each observation belong to?What are the parameters associated with each cluster?

Likelihood function:

P(X,Z|Ѳ)

- Expectation-Maximisation algorithm

Overall clustering

…….

Cluster A Cluster DCluster CCluster B ………

Empirical example - Wigan

Data from the 23 fixed Bluetooth detectors in and around the town of Wigan (Figure 3) is analysed for a full year (2015). Data from the 23 fixed Bluetooth detectors in and around the town of Wigan (Figure 3) is analysed for a full year (2015).

A full year of data (2015) from 23 fixed Bluetooth detectors in and around Wigan

Trip frequency

The data for 2015 included:• 7.5 million trips• 327,264 unique MAC addresses• almost 28% of the travellers had only 1 trip• just 2% had greater than or equal to 260 trips (equivalent to

at least one trip per working day in the year)

Spatial variability

15 most common sequences in one spatial cluster

A-B-M-N-R-T-W A-B-G-N-R-T-W A-B-M-R-WA-B-G-M-N-R-T-W A-B-R-T-W A-B-M-N-S-WA-B-N-R-T-W A-B-M-R-T-W A-B-M-N-S-T-WB-G-M-N-R-T-W A-B-R-W A-B-G-M-R-T-WA-B-M-N-R-W A-B-N-R-W A-B-G-M-N-R-W

Road user classes

Using the Elbow Method, decided on 9 road user classes

Approximately 3 groups of 3:• infrequent (< 1 / week), • frequent, and • very frequent (> 1.5 / day)

Trips in 20150

2 4 12 92226

Average trip per person

Infrequent travellers (ABC)

• 92% of travellers• 23% of trips

• Less than 1 trip per week (6 trips per year on average)• Intrapersonal variability?

Trips in 20150

C 12.3

Average trip per person

More frequent travellers

Freq travellers (DEF)

Very freq travellers (GHI)

Total trips observed 57%

Travellers observed 8%

Frequency 1/week to 1.5/day(50-550)

Average trips per spatial cluster

% trips in most common spatial cluster

Average number of time of day clusters

Average time of day cluster variance

More trips -> more clusters with smaller

variance

More frequent travellers

Freq travellers (DEF)

Very freq travellers (GHI)

Total trips observed 57% 20%

Travellers observed 8% 0.5%

Frequency 1/week to 1.5/day(50-550)

>1.5/day(550-6155)

Average trips per spatial cluster

4-10 12-23

% trips in most common spatial cluster

29% 25-20%

Average number of time of day clusters

2-4 4.5-5.5

Average time of day cluster variance

More trips -> more clusters with smaller

variance

Smaller variance on average than DEF, but fairly constant by trips

Conclusions

• A method to identify road user classes was presented• Method was successfully applied to a fairly large case study

area• User classes depend on trip frequency and tell us about

spatial and temporal variability• Future work

Acknowledgements

Supervised by Professor David Watling and Dr Richard Connors at ITS

Funded by

Data from

http://www.its.leeds.ac.uk/people/f.crawford

Thank you for listening!

Any questions?

a clustering method based on repeated trip behaviour to identify road user classes using bluetooth...

Science

10/18/2007 eets 73041 bluetooth bluetooth architecture...

o bluetooth - motorola€¦ · o bluetooth - motorola ... o...

bluetooth presenter bluetooth presenter

bluetooth profile. bluetooth profile a bluetooth profile is...

the repeated-measures anova - rutgers...

1 sta 617 – chp11 models for repeated data analyzing...

12:15-13.30 inspirationsoplæg datasandboxes · consistent...

anova approaches to repeated measures • univariate...

clustering. 2 outline introduction k-means clustering...

bt-micro4 qig v2abluetooth bluetooth bluetooth 77djlŒfž(r)...

repeated reading - lsu human development center | new...

clustering iv. outline impossibility theorem for clustering...

fm10bt - media.djmania.netmini haut-parleur bluetooth mini...

sistema sonido bluetooth 150w 150w bluetooth sound...

bi-clustering. 2 data mining: clustering where k-means...

bluetooth接続マルチメディアリモコン iremote...

a look at repeated readings. agenda what is repeated...

fuzzy clustering 2009/2010. 2 what is data clustering? ...

chapter19 clustering analysis. content similarity...

location-based group clustering using ble beacons in an...