Transcript
Page 1: Trigger Upgrade Planning

1

Trigger Upgrade Planning

Greg Iles, Imperial CollegeGreg Iles, Imperial College

Page 2: Trigger Upgrade Planning

2

We’ve experimented with new technologies...

Aux Pic

Aux Card, Tom GorskiOGTI, Greg Iles

Matrix, Matt Stettler, John Jones,

Magnus Hansen, Greg Iles

Schroff !

Mini CTR2, Jeremy Mans

Page 3: Trigger Upgrade Planning

3

and discussed ideas...

No, I’m right

I’m right

No, I’m right No, I’m right

Where is the coffee ?

I’ll need a quadruple espresso after this...

Page 4: Trigger Upgrade Planning

4

Time for a plan...

Why is it urgent?– Subsystems already designing new trigger electronics.

• In particular HCAL, but others also have upgrade plans.

– If comprehensive plan not adopted now we will squander the opportunity to build a next generation trigger system

• Interface between front-end, regional and global trigger is critical• Requirements driven by Regional Trigger (RT)

=> Would be best to try and define new Trigger now before HCAL plans fixed.

Page 5: Trigger Upgrade Planning

5

Regional Trigger Part I

Red boundary

defines

single FPGA

Single processing node:– Share data in both dimensions to provide complete data for objects

that cross boundaries

Very flexible, but Jets can be large ~15% of image in both η and φ

Very large data sharing required => Very inefficient => Big system

How best to process ~5Tb/s?

The initial idea was to have a

single processing node for

all physics objects

Page 6: Trigger Upgrade Planning

6

Alternative solutions:– Split processing into 2 stages

• Fine procesing for electrons / tau jets• Coarse processing for jets

– Pre cluster physics objects• Build possible physics objects from data available then send to

neigbours for completion• More complex algorithms. Less flexible.

– Both currently applied in CMS• But believe we have a better solution...

Time Multiplexed Trigger - TMT

Regional Trigger Part II

Page 7: Trigger Upgrade Planning

7

Trig Partition 0

Time Multiplexed Trigger

Red boundary

= FPGA

Trig Partition 1

Trigger Partition 8

1 bx

9 bx

8 bx

Green boundary

= Crate

Time

Page 8: Trigger Upgrade Planning

8

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

bx = n+0bx = n+1

bx = n+2bx = n+3

bx = n+4bx = n+5

bx = n+6bx = n+7

bx = n+8

The Trigger Partition Crates

Regional

Trigger Cards

Global

Trigger CardServices: CLK, TTC, TTS & DAQ

Alternative location for Services

Page 9: Trigger Upgrade Planning

9

Regional Trigger Requirements:

– Use Time Multiplexed Trigger (TMT) as baseline design • Flexible

– All HCAL, ECAL, TK & MU data in single FPGA

• Minimal boundary constraints• Can be spilt into seprate partitions for testing

– e.g. ECAL, HCAL MU, TK can each have a partition at the same time

• Simple to understand

– Redundant• Loss of a partition results in loss of max 10% of the trigger rate• Can easily switch to back up partition (i.e. software switch)

– But what constraints does this impose on TPGs...

Page 10: Trigger Upgrade Planning

10

MCH1 providing GbE and

standard functionality

MCH2 providing services

- DAQ (data concentration)

- LHC Clock

- TTC/TTS

Services

See also:

DAQ/Timing/Control Card (DTC), Eric Hazen,

uTCA Aux Card Status - TTC+S-LINK64, Tom Gorski

Page 11: Trigger Upgrade Planning

11

Tongue 1:

AMC port 1

DAQ

GbE

Tongue 2:

AMC port 3

TTC / TTS

Tongue 2:

AMC Clk3

LHC Clock

VadaTech CEO / CTO at

CERN Nov 19th

Page 12: Trigger Upgrade Planning

12

SFP+ (10GbE)

Local DAQ

SFP+ (10GbE)

Global DAQ

V6

XC6VLX240T

24 links

TTC MagJack

12

CLK40 to

AMC CLK3

MCH2-T2

12

TTC to

AMC Port 3 Rx12

TTS from

AMC Port 3 Tx12

CLK2

(Unused)12

Head

er – 80 way

MCH2-T1

Head

er – 80 way

DAQ from AMC Port 1

Advantages:

- All services on just one card

- High Speed Serial on just Toungue 1

- Dual DAQ outputs

- Tounges 3/4 (fat, ext-fat pipes)

spare for other apps

CMS MCH: Service

Needs to be designed....

Page 13: Trigger Upgrade Planning

13

– Ideally wish to install, commission and test new trigger system in parallel with existing trigger

– Following is an example of how this might happen

• HCAL Upgrade + Test Time Multiplexed Trigger • ECAL Upgrade• Full Time Multiplexed Trigger System• Incorporate TK + MU Trigger

Installation scenario

Page 14: Trigger Upgrade Planning

14

Current System

ECAL

TPG

ECAL HCAL

RCT

4x 1.2Gb/s 4x 1.2Gb/s

HCAL

TPG

To GCT and GT

Copper

Page 15: Trigger Upgrade Planning

15

Stage 1:

HCAL Upgrade

ECAL

TPG

ECAL HCAL

RCT

4x 1.2Gb/s

HCAL

TPG

Tech Trigger to existing GT

Many 4.8Gb/s

Time multiplexed data

1/10 to 1/20 of events

Fibre

New RT+GT

Optical interface to RCT

Page 16: Trigger Upgrade Planning

16

Stage 2:

ECAL Upgrade

ECAL

TPG

ECAL HCAL

RCT

4.8 Gb/s

HCAL

TPG

New RT+GT

Tech Trigger to existing GT

Many 4.8Gb/s

Time multiplexed data

1/10 to 1/20 of events

ECAL TM

Other

ECAL

TPGs

Duplicate

Time Multiplexer Rack

Page 17: Trigger Upgrade Planning

17

Stage 3:Upgrade to New Trig

ECAL

TPG

ECAL HCAL

HCAL

TPG

New RT+GT

ECAL TM

Other

ECAL

TPGs

Duplicate

New RT+GT

New RT+GT

Page 18: Trigger Upgrade Planning

18

Stage 4:

Add Tk + Mu Trig

ECAL

TPG

ECAL HCAL

HCAL

TPG

New RT+GT

ECAL TM

Other

ECAL

TPGs

Duplicate

New RT+GT

New RT+GT

MU

TPG

TK

TPG

Page 19: Trigger Upgrade Planning

19

A single Virtex-6 FPGA CLB comprises two slices, with each containing four 6-input LUTs and eight Flip-Flops (twice the number found in a Virtex-4 FPGA slice), for a total of eight 6-LUTs and 16 Flip-Flops per CLB.

XC5VTX150T XC5VTX240T XC6VLX550T

Links 40 @ 5.0Gb/s 48 @ 5.0Gb/s 36 @ 6.5Gb/s

Slices (k) 23 37 86

Logic Cells (k) 148 240 550

CLB FlipFlops (k) 92 150 687

Distributed RAM (kbits)

1,500 2,400 6,200

BRAM (36kbits) 228 324 632

Cost N/A 4.7k 4.5k

A single Virtex-5 CLB comprises two slices, with each containing four 6-input LUTs and four Flip-Flops (twice the number found in a Virtex-4 slice), for a total of eight 6-LUTs and eight Flip-Flops per CLB

Technology choice.. Use V6 not V5

Page 20: Trigger Upgrade Planning

20

Plan for the nexy 5 years:

– Hardware development• Prototypes

– Procesing Card– Service Card– Optical SLBs– Custom Crate

• Cooling– uTCA crates have

front to back airflow

– Firmware• Serial Protocol• Slow Control• Algorithms

– Software• Low level

drivers• Communication• Interfaces to

legacy systems

– Technical Proposal• Review of Proposal

– Resources Required• Financial / Manpower / Time

System

Page 21: Trigger Upgrade Planning

21

Road MapNo Gant Chart yet...

But you wouldn’t be able to read it...

4/2010

4/2012

Technical Trigger

Proposal

Delivery of uTCA

Regional Trigger Crate

and HCAL HW

Delivery of Matrix II

and OSLBs

Integration of

OSLBs in USC55

Demo system with

HCAL HW &

Matrix I cards

904 system with

Matrix II cards

4/2011

USC55 trigger

partition

Firmware &

Software

DevelopmentHardware

Development

Page 22: Trigger Upgrade Planning

22

End of Part I

=> Trigger Upgrade Proposal by April 2010

VadaTech CEO / CTO at

CERN Nov 19th

The next bit is

confusing so

before I lose you...

Page 23: Trigger Upgrade Planning

23

Implications for on TPGs

Can we put time multiplexing inside TPG FPGA?

– Maximum number of trigger partitions limited by latency.• Assume between 9 and 18 partitions

– Hence each TPG requires a minimum of 9 outputs• Each fibre carrying 8 towers/bx x 9 bx = 72 towers • Transmist 4 towers in η over ¼ of φ

– Assumed time multiplex in φ, but η also possible

– But HCAL TPG designed to generate 36-40 towers• This is very impresive, but still need factor of 2• Revist TPG idea...

Page 24: Trigger Upgrade Planning

24

All TPGs

792 links

~3000 links

TPG

18

~72

TPG

9

36

Approx

4:1 input to output ratio

Too much input data?

No, too many serial links.

But only just....

Close to FPGA data BW limit

Assumes input is 3.36 Gb/s

effective GBT

Trigger partitions reduced to 9.

Requires 5.0Gb/s links.

Not possible with GBT protocol

(insufficient FPGA resources)

TPG design Assumed TPG is single FPGA

Why: simplicity, board space, power,

Page 25: Trigger Upgrade Planning

25

TPG

9

36

Possible, but not with

GBT protocol

TPGs with GBT protocol...

TPG

360 LVDS pairs

@ 320Mb/s

GBT

36x GBTs

power ~36W to 72W

real estate = 92cm2

cost = 4320 SFr

TPG

180 LVDS pairs

@ 640Mb/s

Parallel

GBT

Can select output clk

i.e. doesn’t have to be LHC

Will future FPGAs have std I/O?

XC6VLX240T IO = 720 (24,0)

36

36

99

?

Page 26: Trigger Upgrade Planning

26

TPG

3

12

The alternative:

Time Multiplexing Rack

Time Multiplex

– Use FPGA to decode GBT data• Only use ½ of available links to

decode GBT otherwise no logic left for other processing

– Use different GTX blocks for GBT-Rx and RT-Tx inside TPG

• Allows trigger to decouple from LHC clock if required.

– CMS arranged in blocks of constant φ strips

• Would allow mapping to constant η strips

– Consequence• At least x3 to x5 more HCAL

hardware• Time muliplex rack• Larger latency

3

12

TPG

2x18

... x12 TPGs

3x12 4x9or or

Page 27: Trigger Upgrade Planning

27

Fibre Patch

Time multiplex rack – Assumes tower resolution from HF

• 18x22 regions

• 16 towers per region

• 792 fibres

• 12bit data, 8B/10B, 5Gb/s

– Time multiplex card may have

• 36 in/out (2 crates x11 cards)

• 18 in/out (4 crates x 11 cards)

Fibre Patch

Fibre Patch

Fibre Patch

Fibre Patch

Fibre Patch

Fibre Patch

Fibre Patch

uTCA Crate – 11x TMs

uTCA Crate – 11x TMs

36U

or

48U

12 way ribbon = ¼ η, 1 region in φ 9 ribbons can cover ½ φ Hence patch-panel = ¼ η, ½ φNot n

eeded

for H

CAL if w

e get

x2

BW a

t TPG

Page 28: Trigger Upgrade Planning

28

Are there are alternatives to Time Multiplex Racks if HCAL TPG stays the same...

– Yes: • e.g. return to concept of fine/coarse processing for

electrons/jets processing• i.e. jets at 2x2 tower resolution

– Requires less sharing (e.g. just 2, rather than 4 tower overlap for fine processing)

Alllows a more parallel architectureTPG would have to provide 2 towers in η and ¼ of φ

But... Is it wise to separate fine & coarse procesing ?

Still requires some thought

& discussion with TPG experts

Page 29: Trigger Upgrade Planning

29VadaTech CEO / C

TO at CERN N

ov 19th End...

Vadatech VT891

Page 30: Trigger Upgrade Planning

30

Extra

Page 31: Trigger Upgrade Planning

31

½ region in η, all φ regions OR 1 region in η, ½ of all φ regions

Mux: Put all data from 1bx into single FIFO

Single fibre to each processing blade

Cavern Data into TPG

n+0 n+1 n+2 n+3 n+4 n+5 n+6 n+7 n+8

Page 32: Trigger Upgrade Planning

32

Decode GBT– Only instrument 20/24 links of XC5VFX200T, 96% logic cell usage

• F. Marin, TWEPP-09. Altera results slightly better

– Cannot use GBT links to drive data into processing FPGA– Options

• (a) Use powerful FPGAs to simply to decode GBT protocol– Seems wasteful. – SerDes Tx/Rx may have to use same clk

• (b) Use mulltiple GBTs– Diff pairs = 200 @ 320Mb/s lanes. OK if diff pairs remain on FPGAs!– Power = 20W (assume less power because less I/O)– Components = 20 (16 mm x 16 mm, 0.8 mm pitch)– Cost = 120x20 = 2400 SFr !

• (c) Use Xilinx SIRF both on & off detector – If they let us...


Top Related