centralized switching networks computer architecture amano, hideharu textbook pp. 92~13 0

55
Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Upload: madison-price

Post on 02-Jan-2016

216 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Centralized switching networks

Computer Architecture

AMANO, Hideharu

Textbook pp. 92~13 0

Page 2: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Switch connected parallel machines Where the switches are used?

PU-Memory connection: UMA Node-node connection: NUMA, NORA Snoop is impossible Directory based methods or compiler assisted

methods are used for UMA/NUMA How to build large scale systems

Page 3: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Switch connected UMA

Switch

Local   Memory

CPU

Interface

Main   Memory

. . . .

… .

Local   Memory is sometimes dispensable

Page 4: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Switch connected UMABlocking

Switch

Local   Memory

CPU

Interface

Main   Memoryn

. . . .

1 … .

0

1

n

0

Shared Memory

Page 5: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Switch connected UMAInterleaving

Switch

Local   Memory

CPU

Interface

Main   Memoryn

. . . .

1 … .0

Shared Memory

… .… .

Size: Double word or Cache Line

Page 6: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Switch connected UMA with circular connection

Switch

CPU

Interface

Main   Memory

. . . .

… .

Main memory is used as a home memoryInterleave is often difficult

Page 7: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Switch connected NUMA

Switching Fabrics

Symmetric Multi-Processor

Switching Fabrics sometimes become hierarchical structure→ Fat TreeDirectory based Cache coherent methods are used→ CC-NUMATypical recent high performance server: SUN’s or IBM’s

Page 8: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Switch based network

Single stage Crossbar

Multi-stage Symmetric: Multistage Interconnection Network Asymmetric: Fat-tree, base-m n-cube → Direct

interconnection network

Page 9: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Crossbar switch

Cross point: smallswitching element

The number ofcross points:nxm

Extension of the buses

Page 10: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Non-blocking property

For differentdestination,conflict free

Page 11: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Head Of Line (HOL) conflict

X

Arbiter is required for each bus

The buffer is required

The number of crosspoint is not dominant.

Page 12: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Input buffer switch

Crossbar

Input buffer

One of conflicting packets is selected.Others are stored Into the input buffer

Page 13: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Merit/demerit of Crossbars

Non-blocking property Simple structure/Control The hardware for cross-points usually do not

limit the system (Fallacy of crossbars) Extension is difficult by the pin-limitation of L

SIs If pins can be used, a large crossbar can be const

ructed → Earth simulator

Page 14: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Earth Simulator (2002,NEC)V

ect

or

Pro

cess

or

Vect

or

Pro

cess

or

Vect

or

Pro

cess

or

0 1 7

Shared Memory16GB

Vect

or

Pro

cess

or

Vect

or

Pro

cess

or

Vect

or

Pro

cess

or

0 1 7

Shared Memory16GB

Vect

or

Pro

cess

or

Vect

or

Pro

cess

or

Vect

or

Pro

cess

or

0 1 7

Shared Memory16GB

….

639 Inputs crossbar (16GB/s x 2)

Node 0 Node 1 Node 639

Peak performance40TFLOPS

Page 15: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

SUN T1Core

Core

Core

Core

Core

Core

Core

Core

CrossbarSwitch

FPU

L2Cachebank

Directory

L2Cachebank

Directory

L2Cachebank

Directory

L2Cachebank

DirectorySingle issue six-stage pipelineRISC with 16KB Instruction cache/8KB Data cache for L1

Total 3MB, 64byte Interleaved

Memory

Page 16: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Glossary 1

Symmetric: 対象型、どこからでも同じ遅延、手順でアクセス可能

Asymmetric: 非対象型、遅延、手順が異なる Non-blocking, blocking: 出力ポートが重ならなければ、衝突が起

きないのがノンブロッキング、出力ポートが重ならなくてもスイッチ内部で衝突するのがブロッキング

HOL conflict: 出線競合、出力ポートが重なることで起きる衝突 Interleave:  インタリーブ、ワード単位など細かいレベルでア

ドレスを分離して同時アクセスを増やす方法 Multi-stage/Single-stage: スイッチを多段に接続するか、単一段

で接続するかの構成の違い。 Multi-stage Interconnection Network を MIN と呼ぶ

Page 17: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

MIN ( Multistage  Interconnection  Network) Multistage connected switching elements form a large switch.

Symmetric Smaller number of cross-points, high

degree of expandability Bandwidth is often degraded Latency is stretched

Page 18: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Classification of MIN

Blocking network : Conflict may occur for destination is different :NlogN type standard MIN,πnetwork,

Re-arrangeable : Conflict free scheduling is possible : Benes network 、 Clos network( rearrangeable configuration )

Non-blocking : Conflict free without scheduling : Clos network (non-blocking configuration) 、 Batcher-Banyan network

Page 19: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Properties of MIN

Throughput for random communication Permutation capability Partition capability Fault tolerance Routing

Page 20: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Blocking Networks

Standard NlogN networks Omega network Generalized   Cube Baseline

Pass through ratio (throughput) is the same. Π network

Page 21: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Omega network

The number of switching element (2x2 , in this case ) is 1/2 N x LogN

000001

010

011

100101

110111

000001

010

011

100101

110111

Page 22: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Perfect   Shuffle

Rotate to left

000001010011100101110111

000010100110001011101111

Inverse   ShuffleRotate to right

Page 23: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Destination   Routing

000001

010

011

100101

110111

000001

010

011

100101

110111

Check the destination tag from MSBIf 0 use upper link, else use lower link.

1→ 3

5→6

1 0

Page 24: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Blocking Property

For different destination, multiple paths conflict

000001

010

011

100101

110111

000001

010

011

100101

110111

X

0→04→2

Page 25: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

For using large switching elements ( Delta network )

In the current art of technology, 8x8 (4x4) crossbars are advantageous.

00011011

20213031

00011011

20213031

0123

0123

Shuffle connection is also used.

Page 26: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Omega network

The same connection is used for all stages. Destination routing A lot of useful permutations are available. Problems on partitioning and expandability.

Page 27: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Generalized Cube

000001

010

011

100101

110111

000001

010

011

100101

110111

000100

100

110

100

101

Links labeled with 1bit distance are connectedto the same switching element.

000

010

000

001

Page 28: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Routing in Generalized Cube

000001

010

011

100101

110111

000001

010

011

100101

110111

The source label and destination label is compared (Ex-Or ):Same(0) : Straight   Different (1) : Exchange

001→011010

01 0

Page 29: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Partitioning

000001

010

011

100101

110111

000001

010

011

100101

110111

The communication in the upper half never disturbs the lower half.

Page 30: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Expandability

A size of network can be used as an element of larger size networks

Page 31: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Generalized   Cube

Destination routing cannot be applied. The routing tag is generated by exclusive or

of source label and destination label. Partitioning Expandability

Page 32: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Baseline Network

000001

010

011

100101

110111

000001

010

011

100101

110111

The area of shuffling is changed.

001

100

010

001

3bit shuffle 2bit shuffle

Page 33: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Destination   Routing in Baseline network

000001

010

011

100101

110111

000001

010

011

100101

110111

Just like Omega network

1

1

0

Page 34: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Partitioning in Baseline

000001

010

011

100101

110111

000001

010

011

100101

110111

Page 35: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Baseline network

Providing both benefits of Omega and Generalized   Cube Destination   Routing Partitioning Expandability

Used in NEC’s Cenju

Page 36: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Π network

Tandem connection of two Omega networks

000001

010

011

100101

110111

000001

010

011

100101

110111

Page 37: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Bit reversal permutation(Used in FFT)

Conflicts occur in Omega network.

000001

010

011

100101

110111

000001

010

011

100101

110111

0426

1537

0123

4567

Page 38: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Bit reversal permutation in Π network000001

010

011

100101

110111

000001

010

011

100101

110111

0426

1537

0527

1436

The first Omega : Upper input has priority.The next Omega : Destination   Routing Conflict free

Page 39: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Permutation capacity

All possible permutation is conflict free = Rearrangeable networks

Tree tandem connection of Omega network is rearrangeable.

The tandem connection of Omega and Inverse Omega (Baseline and Inverse Baseline) is rearrangeable. Benes network

Page 40: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Benes Network

Note that the center of stage is shared. The rearrangeable network with the s

mallest hardware requirement.

000001

010

011

100101

110111

000001

010

011

100101

110111

Page 41: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Non-blocking network

Clos network m>n1+n2-1 : Non-blocking m>=n2 : Rearrangeable Else: Blocking

Page 42: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Clos network

... ...

n1xm r1xr2 mxn2

r1 m r2

m=n1+n2-1 : Non-blockingm=n2 : Rearrangeablem<n2 : Blocking

The number of intermediatestage dominates the permutationcapability.

3-stage

Page 43: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Batcher network

5704

2136

5740

1263

0457

6321

0123

4567

Bitonic sorting network

Page 44: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Batcher-Banyan

5704

2136

5740

1263

0457

6321

0123

4567

Sorted input is conflict free in the banyan network

OmegaBaseline

Page 45: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Banyan networks

Only a path is provided between source and destination. The number of intermediate stages is flexible. Approach from graph theory SW-Banyan , CC-Banyan , Barrel   Shifter

Irregular structure is allowed.

Page 46: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Batcher-banyan

If there are multiple packets to the same destination, the conflict free condition is broken→ The other packets may conflict. The extension of banyan network is required.

The number of stages is large.→ Large pass through time The structure of sorting network is simple.

Page 47: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Classification of MINs

Omega

Baseline

Generalized   Cube

π

Benes

Clos

BatcherBanyan

Banyan

Blocking

Rearrageble

Nonblocking

Page 48: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Fault tolerant MINs

Multiple paths Redundant structure is required. On-the-fly fault recovery is difficult. Improving chip yield.

Page 49: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Extra   Stage   Cube  (ESC)

An extra stage + Bypass mechanism

000001

010

011

100101

110111

000001

010

011

100101

110111

If there is a fault on stages or links, another path is used.

Page 50: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

The buffer in switching element

Conflicting packets are stored into buffers.

000001

010

011

100101

110111

000001

010

011

100101

110111

Page 51: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Hot spot contention

Buffer is saturated in the figure of t ree ( Tree Saturation)

000001

010

011

100101

110111

Hot spot

Page 52: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Relaxing the hot spot contention Wormhole routing with Virtual channels →

Direct network Message   Combining

Multiple packets are combining to a packet inside a switching element (IBM RP3)

Implementation is difficult (Implemented in SNAIL)

Page 53: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Other issues in MINs

MIN with cache control mechanism Directory on MIN Cache Controller on MIN

MINs with U-turn path → Fat tree

Page 54: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Glossary 2 Rearrange-able: スケジュールすることにより、出力が重ならな

ければ内部で衝突しないようにできる構成 Perfect shuffle: シャッフルは、トランプの札を切る時に使う単語

だが、ここでは、配線のつなぎ方の方式のひとつ。 Inverse shuffle は逆シャッフルと呼ばれ、逆接続方式。

Destination routing :目的地のラベルだけで経路を決める方法 Permutation: 並び替え、順列のことだが、ここでは目的地ラベル

が重ならない経路を無衝突で生成することができる能力のこと Partitioning: ネットワークを分離して独立に使える能力のこと Fault tolerance: 耐故障性。一部が故障しても全体がダウンしない

ような性質、 Fault tolerant MIN は複数経路を持たせた MIN Expandability: 拡張性、小さなものからサイズを大きくしていく

ことのできる性質 Hot spot contention:  局所的に交信が集中して、これが全体に波

及すること。 Tree saturation:   Hot spot contention によりネットワークが木の

形で飽和していく現象。特に MIN で起きる。 Message Combining は、メッセージをくっつけてまとめることによりこれを防止する方法の一つ

Page 55: Centralized switching networks Computer Architecture AMANO, Hideharu Textbook pp. 92~13 0

Exercise

Every path between source and destination is determined with the destination routing in Omega network. Prove (or explain) the above theory in Omega network with 8-input/output.