cs4335:design and analysis of algorithms

58
111/01/16 CS4335 Design and Analysis of Algorithms /WANG Lusheng Page 1 CS4335: Design and Analysis of Algorithms Who we are: Professor Lusheng WANG Dept. of Computer Science office: B6422 phone: 2788 9820 e-mail: [email protected] Course web site: http://www.cs.cityu.edu.hk/~lwang/ccs4335.html I will do tutorial for two groups on Friday. Another two groups of tutorial will be conducted on Monday by: • Mr. ZHAO Zicheng (Ex: 2546) and Mr. WANG Kai (EX:)

Upload: libby

Post on 25-Jan-2016

67 views

Category:

Documents


10 download

DESCRIPTION

CS4335:Design and Analysis of Algorithms. Who we are: Professor Lusheng WANG Dept. of Computer Science office: B6422 phone: 2788 9820 e-mail: [email protected] Course web site: http://www.cs.cityu.edu.hk/~lwang/ccs4335.html I will do tutorial as well. Text Book:. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 1

CS4335: Design and Analysis of Algorithms Who we are:

Professor Lusheng WANG Dept. of Computer Science office: B6422 phone: 2788 9820 e-mail: [email protected] Course web site: http://www.cs.cityu.edu.hk/~lwang/ccs4335.html

I will do tutorial for two groups on Friday. Another two groups of tutorial will be conducted on Monday by:

• Mr. ZHAO Zicheng (Ex: 2546) and Mr. WANG Kai (EX:)

Page 2: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 2

Text Book: • J. Kleinberg and E. Tardos, Algorithm design, Addison-Wesley,

2005.

• We will add more material in the handout.

References:• T. H. Cormen, C. E. Leiserson, R. L. Rivest, Introduction to

Algorithms, The MIT Press.

http://theory.lcs.mit.edu/~clr/

• R. Sedgewick, Algorithms in C++, Addison-Wesley, 2002.

• A. Levitin, Introduction to the design and analysis of algorithms, Addison-Wesley, 2003.

• M.R. Garry and D. S. Johnson, Computers and intractability, a guide to the theory of NP-completeness, W.H. Freeman and company, 1979

Page 3: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 3

Page 4: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 4

Algorithms

Any well-defined computational procedure that takes some value, or set of values, as input and produces some value, or set of values, as output.

A sequence of computational steps that transform the input into output.

A sequence of computational steps for solving a well-specified computational problem.

Page 5: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 5

Example of well-specified problem: Sorting

Input: a sequence of numbers: 1, 100, 8, 25, 11, 9, 2, 1, 200.

Output: a sorted (increasing order or decreasing order) sequence of numbers

1, 2, 8, 9, 11, 25, 100, 200. Another example: Create web page that contains a list of papers using

HTML.-everybody can do it. Not need to design computational steps.

Page 6: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 6

Find a shortest path from station A to station B.

-need serious thinking to get a correct algorithm.

A

B

Page 7: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 7

A Real-Time Driver’s Direction SystemGiven an electronic map (stored on a computer), the

position of your car (provided by GPS), and the destination,

the system can tell you the way to go to the destination.Tell you tern left or right 40 meters before according to

the shortest path. If you did not follow the direction, re-calculate the

shortest path.400 US$ each.

Page 8: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 8

Dijkstra’s Algorithm: (Recall)

Dijkstra’s algorithm assumes that w(e)0 for each e in the graph.

maintain a set S of vertices such that Every vertex v S, d[v]=(s, v), i.e., the shortest-path from s to v has

been found. (Intial values: S=empty, d[s]=0 and d[v]=)

(a) select the vertex uV-S such that

d[u]=min {d[x]|x V-S}. Set S=S{u}

(b) for each node v adjacent to u do RELAX(u, v, w). Repeat step (a) and (b) until S=V.

Page 9: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 9

Descriptions of AlgorithmsFlow chartPseudo-code ProgramsNatural languages The purpose: Allow a well-trained programmer to write a program to

solve the computational problem.Any body who can talk about algorithm MUST have

basic programming skills Also CS3334: Data structures.

Page 10: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 10

What We Cover:

1. Some classic algorithms in various domains Graph algorithms

• Euler path, shortest path, minimum spanning trees, maximum flow, Hamilton Cycle, traveling salesman,

String Algorithms• Exact string matching, Approximate string matching, Applications

in web searching engines Scheduling problems Computational Geometry

2. Techniques for designing efficient algorithms divide-and-conquer approach, greedy approach, dynamic

programming

Page 11: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 11

What We Cover(continued):

3. Introduction to computational complexity NP-complete problems

4. Approximation algorithms Vertex cover Steiner trees Traveling sales man problem

Page 12: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /Shuai Cheng Li

Page 12

Why You have to take this course

Apply learned techniques to solve various problems

Have a sense of complexities of various problems in different domains

Find excuses if you cannot solve a problem?

College graduates vs. University graduates Supervisor v.s. low level working force

Page 13: CS4335:Design and Analysis of Algorithms

Or a great leader and thinker?

Page 14: CS4335:Design and Analysis of Algorithms

Original Thinking

Page 15: CS4335:Design and Analysis of Algorithms

Boss assigns task:

A Real-Time Driver’s Direction System

Page 16: CS4335:Design and Analysis of Algorithms

Um? Tell me what to code.

With more sophisticated software engineering systems,the demand for mundane programmers will diminish.

Your answer:

Page 17: CS4335:Design and Analysis of Algorithms

Your answer:

I learned this great algorithm that will work.

Soon all known algorithms will be available in libraries.

Page 18: CS4335:Design and Analysis of Algorithms

Your answer:

I can develop a new algorithm for you.

Great thinkers will always be needed.

Page 19: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 19

Page 20: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 20

Page 21: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 21

Page 22: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 22

Why You have to take this course

You can apply learned techniques to solve various problems

Have a sense of complexities of various problems in different domains

College graduates vs. University graduates Supervisor v.s. low level working force

Page 23: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 23

A joke: The boss wants to produce programs to solve the following

two problems Euler circuit problem:

• given a graph G, find a way to go through each edge exactly once. Hamilton circuit problem:

• given a graph G, find a way to go through each vertex exactly once.

The two problems seem to be very similar. Person A takes the first problem and person B takes the

second. Outcome: Person A quickly completes the program, whereas

person B works 24 hours per day and is fired after a few months.

Page 24: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 24

Euler Circuit: The original Konigsberg bridge

Page 25: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 25

Hamilton Circuit

Page 26: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 26

A joke (continued):

Why? no body in the company has taken CS4335. Explanation:

Euler circuit problem can be easily solved in polynomial time.

Hamilton circuit problem is proved to be NP-hard. So far, no body in the world can give a polynomial time

algorithm for a NP-hard problem. Conjecture: there does not exist polynomial time algorithm

for this problem.

Page 27: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 27

Evaluation of the Course

Course work: 30% Four assignments

• 6 points for each of the first three assignments

• 12 points for the last assignment

• Working load is Not heavy

A final exam: 70%;

Page 28: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 28

Evaluation of the Course (OBTL)No.CILOs .1. prove the correctness and analyze the running time

of   the basic algorithms for those classic problems in various domains; 

2. apply the algorithms and design techniques to solve problems; 

3. analyze the complexities of various problems in different domains. 

For CILO 1- CILO 3, we have assignments to evaluate.

You have to pass all CILOs in order to pass the course.

Page 29: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 29

How to Teach

Contents are divided into four classes

1. Basic part -- every body must understand in order to pass

2. Moderate part -- most of students should understand. Aim at B or above. 3. Hard part -- used to distinguish students. Aim at A or above.

4. Fun part -- just illustrate that some interesting things can be done if one works very hard. -- useful knowledge that will not be tested. -- I will give some challenging problems for fun.

Page 30: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 30

Challenging Problems

For those who have extra energy, we have challenging problems.• Challenging problems will be distributed at the end of some lectures. •No mark will be given for those challenge problems. •I will record who completely solved the problem. (The records will be used to decide the boundary cases.)•Good Training for solving new problems

•Have high chance to solve the hard problems in the final exam.

Page 31: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 31

How to Learn

1. Attend every lecture (2 hours per week) and tutorial (1 hour per week)

2. Try to go with me when I am talking

3. Ask questions immediately

4. Try to fix all problems during the 1 hour tutorial

5. Ask others.

Page 32: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 32

The Process of Design an Algorithm

• Formulating the problem• with enough mathematical precision• we can ask a concrete question• start to solve it.

•Design the algorithm•list the “precise” steps. (an expert can translate the algorithm into a computer program.)

•Analyze the algorithm•prove that it is correct•establish the efficiency

• the running time or sometimes space

Page 33: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 33

Terminologies A Graph G=(V,E): V---set of vertices and E--set of

edges.  

Path in G: sequence v1, v2, ..., vk of vertices

in V such that (vi, vi+1) is in E.

vi and vj could be the same

Circuit: A path v1, v2, ..., vk such that v1 = vk

.Degree of a vertex: number of edges incident to the vertex.

Page 34: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 34

Euler circuit

Input: a connected graph G=(V, E) Problem: is there a circuit in G that uses each

edge exactly once.

Note: G can have multiple edges, .i.e., two or more edges connect vertices u and v.

Page 35: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 35

Story:

The problem is called Konigsberg bridge problem it asks if it is possible to take a walk in the town shown in

Figure 1 (a) crossing each bridge exactly once and returning home.

solved by Leonhard Euler [pronounced OIL-er] (1736)

The first problem solved by using graph theory A graph is constructed to describe the town. (See Figure 1 (b).)

Page 36: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 36

The original Konigsberg bridge (Figure 1)

Page 37: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 37

Theorem for Euler circuit

Theorem 1 (Euler’s Theorem) A connected graph has an Euler circuit if and only if all the vertices in the graph have even degrees.

Proof: if an Euler circuit exists, then the degree of each node is even.

Going through the circuit, each time a vertex is visited, the degree is increased by 2. Thus, the degree of each vertex is even.

Page 38: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 38

Proof of Theorem 1: if the degree of every node is even, then there is an Euler circuit.

We give way to find an Euler circuit for a graph in which every vertex has an even degree.

Since each node v has even degree, when we first enter v, there is an unused edge that can be used to get out v.

The only exception is when v is a starting node. Then we get a circuit (may not contain all edges in G) If every node in the circuit has no unused edge, all the edges in G have been used since G is connected. Otherwise, we can construct another circuit, merge the two circuits and get a larger circuit. In this way, every edge in G can be used.  

Page 39: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 39

Example 1:

after merge

f

a

d

b

g

h

j

i

c

e

1

4

32

87

6 5

10

9

12

11

13

a b

c

1

3 2

d

feb

4

7 6

5

d

fea b

c

15 4

32

7 6

g

h

j

i

c

e

1213

8

910

11

C1:abca, C2:bdefb, =>C3:abdefbca. C4:eijhcge. C3+C4=>abdeijhcgefbca

Page 40: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 40

An efficient algorithm for Euler circuit

1. Starting with any vertex u in G, take an unused edge (u,v) (if there is any) incident to u2. Do Step 1 for v and continue the process until v has no unused edge. (a

circuit C is obtained)  3. If every node in C has no unused edge, stop. 4. Otherwise, select a vertex, say, u in C, with some unused edge incident to u and do Steps 1 and 2 until another circuit is

obtained. 5. Merge the two circuits obtained to form one circuit 6. Goto Step 3.

 

Page 41: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 41

Example 2:

a b

c d

e

f

First circuit: a-b-d-c-a

Second circuit: d-f-e-c-d.

Merge: a-b-d-f-e-c-d-c-a.

3rd circuit: e-g-h-e

Merge: a-b-d-f-e-g-h-e-c-d-c-a

Note: There are multiple edges.

gh

Page 42: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 42

Representations of Graphs (Fun Part Sildes 42-52)

Two standard ways Adjacency-list representation

Space required O(|E|) Adjacency-matrix representation 

Space required O(n2).

Depending on problems, both representations are useful.

Page 43: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 43

Adjacency-list representation

Let G=(V, E) be a graph. V– set of nodes (vertices) E– set of edges.

For each uV, the adjacency list Adj[u] contains all nodes in V that are adjacent to u.

21

5 4

3

2

1

2

2

4

5 /

5

4 /

5

1

3 4 /

2 /

3 /

1

2

3

4

5

(a) (b)

Page 44: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 44

Adjacency-list representation for directed graph

1

2

3

4

5

6

/

2

5 /

6

2

4 /

4 /

5 /

6 /

6

21

4 5

3

(b)(a)

Page 45: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 45

Adjacency-matrix representation

Assume that the nodes are numbered 1, 2, …, n. The adjacency-matrix consists of a |V||V| matrix A=(aij) such that

aij= 1 if (i,j) E, otherwise aij= 0.

21

5 4

3

(a)

0 1 0 0 1 1 0 1 1 1

0 1 0 1 0

0 1 1 0 1

1 1 0 1 0

1 2 3 4 5

1

2

3

4

5

(c)

Page 46: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 46

Adjacency-matrix representation for directed graphIt is NOT symmetric.

(c)

6

21

4 5

3

(a)

0 1 0 1 0 0 0 0 0 0 1 0

0 0 0 0 1 1

0 1 0 0 0 0

0 0 0 1 0 0

0 0 0 0 0 1

1 2 3 4 5 6

1

2

3

4

5

6

Page 47: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 47

Implementation of Euler circuit algorithm (Not required)

Data structures: Adjacency-list representation Each node in V has an adjacency list Also, we have two lists to store the circuits

One for the circuit produced in Steps 1-2. One for the circuit produced in Step 4

In Step 1: when we take an unused edge (u, v), this edge is deleted from the adjacency-list of node u.

Page 48: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 48

Implementation of Euler circuit algorithm

In Step 2: if the adjacency list of v is empty, v has no unused edge.

1. Testing whether adjacency-list v is empty.

2. A circuit (may not contain all edges) is obtained if the above condition is true.

3. If the adjacency-list for v is empty, DELETE the list.

In Step 3: if all the adjacency-list is empty, stop. (Note we did 3).

In step 4: if some adjacency-list is not empty, use it in step 4.

Page 49: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 49

b

a

a

b

d

b

c

c

e

h

c

c

b

e

f

e

e

j

j

i

d

g

g

f

h

i

a

b

c

d

e

f

g

h

i

j

Figure1: The adjacency-list representation of Example 1(Slide 32)

C1:a-b-c-a,

Page 50: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 50

d

g

b

d

b

c

c

e

h

f

h

e

f

e

e

g

j

i

g i

a

b

c

d

e

f

g

h

i

j

After edge (a,b), (b, c) and (c, d) are used.

C1:a-b-c-a, C2: bdefb,

=>C3:abdefbca

Page 51: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 51

g

g

c

c

e

h

h

i

e

j

j

i

a

b

c

d

e

f

g

h

i

j

After edges (b, d) (d, e), (e, f), (f, b) are used.

C1:abca, C2:bdefb, =>C3:abdefbca. C4:eijhcge. C3+C4=>abdeijhcgefbca

Page 52: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 52

Time complexity (Fun Part) If it takes O(|V|) time to add an edge to a circuit, then the time

complexity is O(|E||V|).

This can be done by using an adjacency list. The implementation is tricky Even O(|V||E|) algorithm is not trivial. Do not be disappointment if you do not completely understand the

implementation The best algorithm takes O(|E|) time. Euler Circuit: A complete example for Designing

Algorithms.

-formulation, design algorithm, analysis.

Page 53: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 53

Euler Path A path which contains all edges in a graph G

is called an Euler path of G. Deleting an edge in a graph having an Euler

Circuit, we obtain an Euler path in the new graph.

 

Corollary: A graph G=(V,E) which has an Euler path has 2 vertices of odd degree.

Deleting the edge

Page 54: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 54

Proof of the Corollary

Suppose that a graph which has an Euler path starting at u and ending at v, where uv.

Creating a new edge e joining u and v, we have an Euler circuit for the new graph G’=(V, E{e}).

  From Theorem 1, all the vertices in G’ have even degree. Remove e.   Then u and v are the only vertices of odd degree in G. (Nice argument, not required for exam.) 

Page 55: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 55

Application 1: In a map, two countries may share a common

border. Given a map, we want to know if it is possible to find a tour starting from a country, going across each shared border once and come back to the starting country.

A B

CD

A B

CD

Page 56: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 56

Application 2:Formulating a graph problem:

Given a map containing Guizhou, Hunan, Jiangxi, Fujian, Guangxi, Guangdong, construct a graph such that if two provinces share some common boarder, then there is an edge connecting the two corresponding vertices.

If we want to find a tour to visit each province once, then we need to find a Hamilton circuit in the graph.

Page 57: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 57

Challenge Problem : (only for those who are interested)(1) Prove that given a graph G=(V, E), one can

always add some edges to the graph such that the new graph (multiple edges are allowed) has an Euler circuit.

(2) Design an algorithm to add minimum number of edges to an input graph such that the resulting graph has an Euler circuit.

(3) Prove the correctness of your algorithm.

Page 58: CS4335:Design and Analysis of Algorithms

112/04/21 CS4335 Design and Analysis of Algorithms /WANG Lusheng

Page 58

Summary of Euler Circuit Algorithm Design a good algorithm needs two parts

1. Theorem, high level part2. Implementation: low level part. Data structures

are important. Our course emphasizes the first part and

demonstrates the second part whenever possible.

We will not emphasize too much about data structures.