architectureapm/redai/docs/redai02v.pdfdirectory-based protocol •distributed directory contains...

Parallel Computing

Architecture

2010@FEUP Architecture 2

• Uses of interconnection networks

• Connect processors to shared memory

• Connect processors to each other

• Interconnection media types

• Shared medium

• Switched medium

Interconnection Networks

Parallel Computers

• Vector Computers

• Multiple CPUs

• Instructions include direct vector operations

• Pipelined – data streams through vector arithmetic

units (CRAY)

• Processor array – processors execute the same

instruction

• Multiprocessors

• Multiple CPUs with shared memory

• Multicomputers

• Multiple CPUs with distributed memory

Processor Array

• Only well adapted to data parallel

problems

Multiprocessors

• Shared memory

• Can be built with comodity components

• Centralized

• Extension of a multiprocessor

• Add CPUs to a BUS

• Same memory access time

• UMA – Uniform memory access

• Also known as SMP (symmetric multiprocessor)

• Distributed

• Memory distributed among processors

• NUMA – Non-uniform memory access

• Allows greater numbers of processors

Centralized multiprocessors

• Problem: Cache coherence

• Write invalidate protocol

Most common solution to cache coherency

1. Each CPU’s cache controller monitors (snoops)

the bus & identifies which cache blocks are

requested by other CPUs.

2. A Processor gains exclusive control of data item

before performing “write”.

3. Before “write” occurs, all other copies of data

item cached by other Processors are

invalidated.

4. When any other CPU tries to read a memory

location from an invalidated cache block,

• a cache miss occurs

• it has to retrieve updated data from memory

Write Invalidate Protocol

Cache-coherence

Memory

Cache-coherence

Read from memory is

not a problem.

CPU A CPU B

Memory

Cache-coherence

CPU A CPU B

Memory

Cache-coherence

Write to memory is a

problem.

CPU A CPU B

Memory

Cache-coherence

A cache control

monitor snoops the bus

to see which cache

block is being

requested by other

processors.

CPU A CPU B

Memory

Cache-coherence

Intent to write X

Before a write can

occur, all copies of

data at that address

are declared invalid.

CPU A CPU B

Memory

Cache-coherence

Intent to write X

CPU A CPU B

Memory

Cache-coherence

When another processor

tries to read from this

location in cache, it

receives a cache miss

error and will have to

refresh from main

memory.

Distributed Multiprocessors

• Increase local memory bandwidth and

lower average memory access time

• The all memory has a single address

Cache Coherence

• Implementation more difficult

• No shared memory bus to “snoop”

• Directory-based protocol needed

• Some NUMA multiprocessors do not

support it in hardware

• Only instructions, private data in cache

• Large memory access time variance

Directory-based Protocol

• Distributed directory contains information about

cacheable memory blocks

• One directory entry for each cache block

• Each entry has

• Sharing status

• Which processors have copies

• Sharing status

• Uncached -- (denoted by “U”)

• Block not in any processor’s cache

• Shared – (denoted by “S”)

• Cached by one or more processors, read only

• Exclusive – (denoted by “E”)

• Cached by exactly one processor, write access

Interconnection Network

architectureapm/redai/docs/redai02v.pdfdirectory-based protocol •distributed directory contains...

Documents

h-oram: a cacheable oram interface for efficient i/o...

understanding active directory level 100 · pdf file ...

2016 colorado pga directory - member directory

counterpart directory – csos directory

india business directory list, india business list, india...

factorytalkimg.xuegongkong.com/attachment/201310/...--factorytalk...

php advantage 2020 provider directory provider directory ....

restful hypermedia apis april... · pros and cons of rest...

data intensive applications on clouds - illinois institute...

arria 10 soc device design guidelines · 2.2.3 mpu sharing...

active directory basics- active directory

network architecture & active directory … architecture &...

parallel programmingapm/redai/docs/redai07v.pdfwithin...

introduction - faculdade de engenharia da universidade do...

understanding active directory domains and trusts...

introduction to active directory directory services

contents.cdn.cityofsydney.nsw.gov.au/learn/history/archives/sands/1920-1924/... ·...

www architecture -...

finding cacheable areas in your web site using python and...

dynamic web acceleration - cdnetworks · dynamic, web-based...