efficient algorithms for finding differences between … · 2020. 1. 9. · • internet marketing;...

32
EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN PROCESS MODELS A.V. SKOBTSOV, A.A.KALENKOVA 2019 IVANNIKOV ISP RAS OPEN CONFERENCE DECEMBER 05, 2019

Upload: others

Post on 15-Aug-2021

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN PROCESS MODELS

A.V. SKOBTSOV , A .A .KALENKOVA 2 0 1 9 IVANNIKOV ISP RAS OPEN CONFERENCE DECEMBER 05 , 2019

Page 2: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Process mining The number of information

systems around us is constantly

growing…

We don't always know how they

are used:

• What actions are most often performed?

• In what order?

• Are there bottlenecks?

• How far we are from the expected

behavior of the system.

2

Page 3: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Process mining

User interaction:

• Internet marketing;

• E-government services;

Complex multi-component systems

(software process mining)

3

Page 4: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Process mining

a

a

a

a a

a

b

b

b

b

b b b

b

c c c

c

c c

c

d

d d d d

d d

d

d

e

e e

e

e

e e

e

a b

b a

a b c

Process-Aware

Information

Systems

Industry

a

b

c

d

f

e

Discovery

• Discovery

4

Page 5: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Process mining

a

a

a

a a

a

b

b

b

b

b b b

b

c c c

c

c c

c

d

d d d d

d d

d

d

e

e e

e

e

e e

e

a b

b a

a b c

Process-Aware

Information

Systems

Industry

a

b

c

d

f

e

Enhancement

• Enhancement

5

Page 6: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Process mining

a

a

a

a a

a

b

b

b

b

b b b

b

c c c

c

c c

c

d

d d d d

d d

d

d

e

e e

e

e

e e

e

a b

b a

a b c

Process-Aware

Information

Systems

Industry

a

b

c

d

f

e

Discovery

Conformance

checking

Enhancement

• Conformance

checking

6

Page 7: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Process mining. Conformance checking

L2L – Comparison of event structures

L2M – Replay techniques

M2M - Must be something visual?

7

Page 8: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Finding Minimal Graph Edit Distance

register

sign in

fill the form

select payment type

pay

register

sign in

fill the form

8

Page 9: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

A* Algorithm

register

sign in

fill the form

select payment type

pay

register

sign in

fill the form

cost=0

register

sign in

fill the form

select payment type

pay

register

sign in

fill the form

cost=0

register

sign in

fill the form

select payment type

pay

register

sign in

fill the form

cost=10 (Levenshtein

distance)

register register

9

Page 10: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

register

sign in

fill the form

select payment type

pay

register

sign in

fill the form

cost=13 (Levenshtein

distance)

register

sign in

fill the form

select payment type

pay

register

sign in

fill the form

cost=10 (deletion

cost)…

register

sign in

fill the form

select payment type

pay

register

sign in

fill the form

cost=0

A* Algorithm

10

Page 11: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

register

sign in

fill the form

select payment type

pay

register

sign in

fill the form

cost=10 (Levenshtein

Distance)

register

sign in

fill the form

select payment type

pay

register

sign in

fill the form

cost=10 (Deletion

cost)

… …

… … …

NP problem

A* Algorithm

11

Page 12: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Greedy algorithm

...

...

... ...

..... .. .. .. ...Possible solutions

12

Page 13: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Greedy algorithm

...

...

... ...

..... .. .. .. ...Possible solutions

13

Page 14: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Greedy algorithm

...

...

... ...

..... .. .. .. ...Possible solutions

14

Page 15: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Greedy algorithm

...

...

... ...

..... .. .. .. ...Possible solutions

15

Page 16: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Greedy algorithm

...

...

... ...

..... .. .. .. ...Possible solutions

16

Page 17: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Greedy algorithm

...

...

... ...

..... .. .. .. ...Possible solutions

17

Page 18: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Greedy algorithm

...

...

... ...

..... .. .. .. ...Possible solutions

18

Page 19: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Greedy algorithm

...

...

... ...

..... .. .. .. ...Possible solutions

19

Page 20: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Greedy algorithm

...

...

... ...

..... .. .. .. ...Possible solutions

20

Page 21: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

...

...

... ...

..... .. .. .. ...Possible solutions

Tabu search algorithm

x

Tabu list =<x>

21

Page 22: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

...

...

... ...

..... .. .. .. ...Possible solutions

Tabu search algorithm

x

Tabu list =<x, y>

y

22

Page 23: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

...

...

... ...

..... .. .. .. ...Possible solutions

Tabu search algorithm

x

Tabu list =<x, y, z>

y z

23

Page 24: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Simulated Annealing algorithm

24

Page 25: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Ant Colony algorithm

1. Initialize pheromone map with the initial value

2. Generate N ants; each ant does the following:

1. Generates all the possible vertex replacements for the current state

2. Calculates the cost for each vertex replacement by the formulae:

𝑒𝑑𝑔𝑒𝑃ℎ𝑒𝑟𝑜𝑚𝑜𝑛𝑒𝑠𝑝ℎ𝑒𝑟𝑜𝑚𝑜𝑛𝑒𝑃𝑜𝑤𝑒𝑟

𝑝𝑎𝑡ℎ𝐶𝑜𝑠𝑡𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒𝑃𝑜𝑤𝑒𝑟

where 𝑒𝑑𝑔𝑒𝑃ℎ𝑒𝑟𝑜𝑚𝑜𝑛𝑒𝑠 – the cost of replacing vertices

(taken from the pheromone map);

𝑝𝑎𝑡ℎ𝐶𝑜𝑠𝑡 – sum of all the edges and vertex replacements

3. Selects a random replacement with the probability of each replacement: 𝑟𝑒𝑝𝑙𝑎𝑐𝑒𝑚𝑒𝑛𝑡𝐶𝑜𝑠𝑡

𝑠𝑢𝑚𝑂𝑓𝐴𝑙𝑙𝐶𝑜𝑠𝑡𝑠

4. Performs steps 1-3 until all the vertices are processed

3. Change all the pheromone values by formulae: 1 − 𝑝ℎ𝑒𝑟𝑜𝑚𝑜𝑛𝑒𝐸𝑣𝑎𝑝𝑜𝑟𝑎𝑡𝑖𝑜𝑛 ∗ 𝑐𝑢𝑟𝑟𝑒𝑛𝑡𝑉𝑎𝑙𝑢𝑒

4. For all solutions from the step 2 do the following:

1. Increase the pheromone map’s value for the replacement by 𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒𝐶𝑜𝑒𝑓𝑓

𝑟𝑒𝑝𝑙𝑎𝑐𝑒𝑚𝑒𝑛𝑡𝐶𝑜𝑠𝑡

5. Repeat steps 2-3 predefined number of times 25

Page 26: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

BPMNDiffViz Tool

https://pais.hse.ru/research/projects/CompBPMN 26

Page 27: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Experimental results. BPMN models discovered from artificial event logs (different algorithms)

27

Page 28: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Experimental results. BPMN models discovered from artificial event logs (different algorithms)

28

Page 29: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Experimental results. BPMN models discovered from real event logs (different parts of logs)

29

Page 30: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Experimental results. BPMN models discovered from real event logs (different parts of logs)

30

Page 31: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Future work

Industry

New suboptimal

methods

New application fields

Theory

Compare with other

conformance checking methods

Different discovery

algorithms (different

structure of process models)

31

Page 32: EFFICIENT ALGORITHMS FOR FINDING DIFFERENCES BETWEEN … · 2020. 1. 9. · • Internet marketing; • E-government services; Complex multi-component systems (software process mining)

Thank you!

32

A.V. Skobtsov: [email protected]

A.A. Kalenkova: [email protected]