trigger upgrade planning

32
1 Trigger Upgrade Planning Greg Iles, Imperial College Greg Iles, Imperial College

Upload: teal

Post on 11-Jan-2016

42 views

Category:

Documents


2 download

DESCRIPTION

Trigger Upgrade Planning. Greg Iles, Imperial College. We’ve experimented with new technologies. Mini CTR2, Jeremy Mans. Aux Pic. Schroff !. Matrix, Matt Stettler, John Jones, Magnus Hansen, Greg Iles. Aux Card, Tom Gorski. OGTI, Greg Iles. and discussed ideas. I’m right. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Trigger Upgrade Planning

1

Trigger Upgrade Planning

Greg Iles, Imperial CollegeGreg Iles, Imperial College

Page 2: Trigger Upgrade Planning

2

We’ve experimented with new technologies...

Aux Pic

Aux Card, Tom GorskiOGTI, Greg Iles

Matrix, Matt Stettler, John Jones,

Magnus Hansen, Greg Iles

Schroff !

Mini CTR2, Jeremy Mans

Page 3: Trigger Upgrade Planning

3

and discussed ideas...

No, I’m right

I’m right

No, I’m right No, I’m right

Where is the coffee ?

I’ll need a quadruple espresso after this...

Page 4: Trigger Upgrade Planning

4

Time for a plan...

Why is it urgent?– Subsystems already designing new trigger electronics.

• In particular HCAL, but others also have upgrade plans.

– If comprehensive plan not adopted now we will squander the opportunity to build a next generation trigger system

• Interface between front-end, regional and global trigger is critical• Requirements driven by Regional Trigger (RT)

=> Would be best to try and define new Trigger now before HCAL plans fixed.

Page 5: Trigger Upgrade Planning

5

Regional Trigger Part I

Red boundary

defines

single FPGA

Single processing node:– Share data in both dimensions to provide complete data for objects

that cross boundaries

Very flexible, but Jets can be large ~15% of image in both η and φ

Very large data sharing required => Very inefficient => Big system

How best to process ~5Tb/s?

The initial idea was to have a

single processing node for

all physics objects

Page 6: Trigger Upgrade Planning

6

Alternative solutions:– Split processing into 2 stages

• Fine procesing for electrons / tau jets• Coarse processing for jets

– Pre cluster physics objects• Build possible physics objects from data available then send to

neigbours for completion• More complex algorithms. Less flexible.

– Both currently applied in CMS• But believe we have a better solution...

Time Multiplexed Trigger - TMT

Regional Trigger Part II

Page 7: Trigger Upgrade Planning

7

Trig Partition 0

Time Multiplexed Trigger

Red boundary

= FPGA

Trig Partition 1

Trigger Partition 8

1 bx

9 bx

8 bx

Green boundary

= Crate

Time

Page 8: Trigger Upgrade Planning

8

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

PW

R2

CM

S A

UX

/DA

Q

PW

R1

MC

H2

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MA

TR

IX

MIN

I-T

MC

H1

MA

TR

IX

MA

TR

IX

12 8 8 8 8 8 8 8 8 12

bx = n+0bx = n+1

bx = n+2bx = n+3

bx = n+4bx = n+5

bx = n+6bx = n+7

bx = n+8

The Trigger Partition Crates

Regional

Trigger Cards

Global

Trigger CardServices: CLK, TTC, TTS & DAQ

Alternative location for Services

Page 9: Trigger Upgrade Planning

9

Regional Trigger Requirements:

– Use Time Multiplexed Trigger (TMT) as baseline design • Flexible

– All HCAL, ECAL, TK & MU data in single FPGA

• Minimal boundary constraints• Can be spilt into seprate partitions for testing

– e.g. ECAL, HCAL MU, TK can each have a partition at the same time

• Simple to understand

– Redundant• Loss of a partition results in loss of max 10% of the trigger rate• Can easily switch to back up partition (i.e. software switch)

– But what constraints does this impose on TPGs...

Page 10: Trigger Upgrade Planning

10

MCH1 providing GbE and

standard functionality

MCH2 providing services

- DAQ (data concentration)

- LHC Clock

- TTC/TTS

Services

See also:

DAQ/Timing/Control Card (DTC), Eric Hazen,

uTCA Aux Card Status - TTC+S-LINK64, Tom Gorski

Page 11: Trigger Upgrade Planning

11

Tongue 1:

AMC port 1

DAQ

GbE

Tongue 2:

AMC port 3

TTC / TTS

Tongue 2:

AMC Clk3

LHC Clock

VadaTech CEO / CTO at

CERN Nov 19th

Page 12: Trigger Upgrade Planning

12

SFP+ (10GbE)

Local DAQ

SFP+ (10GbE)

Global DAQ

V6

XC6VLX240T

24 links

TTC MagJack

12

CLK40 to

AMC CLK3

MCH2-T2

12

TTC to

AMC Port 3 Rx12

TTS from

AMC Port 3 Tx12

CLK2

(Unused)12

Head

er – 80 way

MCH2-T1

Head

er – 80 way

DAQ from AMC Port 1

Advantages:

- All services on just one card

- High Speed Serial on just Toungue 1

- Dual DAQ outputs

- Tounges 3/4 (fat, ext-fat pipes)

spare for other apps

CMS MCH: Service

Needs to be designed....

Page 13: Trigger Upgrade Planning

13

– Ideally wish to install, commission and test new trigger system in parallel with existing trigger

– Following is an example of how this might happen

• HCAL Upgrade + Test Time Multiplexed Trigger • ECAL Upgrade• Full Time Multiplexed Trigger System• Incorporate TK + MU Trigger

Installation scenario

Page 14: Trigger Upgrade Planning

14

Current System

ECAL

TPG

ECAL HCAL

RCT

4x 1.2Gb/s 4x 1.2Gb/s

HCAL

TPG

To GCT and GT

Copper

Page 15: Trigger Upgrade Planning

15

Stage 1:

HCAL Upgrade

ECAL

TPG

ECAL HCAL

RCT

4x 1.2Gb/s

HCAL

TPG

Tech Trigger to existing GT

Many 4.8Gb/s

Time multiplexed data

1/10 to 1/20 of events

Fibre

New RT+GT

Optical interface to RCT

Page 16: Trigger Upgrade Planning

16

Stage 2:

ECAL Upgrade

ECAL

TPG

ECAL HCAL

RCT

4.8 Gb/s

HCAL

TPG

New RT+GT

Tech Trigger to existing GT

Many 4.8Gb/s

Time multiplexed data

1/10 to 1/20 of events

ECAL TM

Other

ECAL

TPGs

Duplicate

Time Multiplexer Rack

Page 17: Trigger Upgrade Planning

17

Stage 3:Upgrade to New Trig

ECAL

TPG

ECAL HCAL

HCAL

TPG

New RT+GT

ECAL TM

Other

ECAL

TPGs

Duplicate

New RT+GT

New RT+GT

Page 18: Trigger Upgrade Planning

18

Stage 4:

Add Tk + Mu Trig

ECAL

TPG

ECAL HCAL

HCAL

TPG

New RT+GT

ECAL TM

Other

ECAL

TPGs

Duplicate

New RT+GT

New RT+GT

MU

TPG

TK

TPG

Page 19: Trigger Upgrade Planning

19

A single Virtex-6 FPGA CLB comprises two slices, with each containing four 6-input LUTs and eight Flip-Flops (twice the number found in a Virtex-4 FPGA slice), for a total of eight 6-LUTs and 16 Flip-Flops per CLB.

XC5VTX150T XC5VTX240T XC6VLX550T

Links 40 @ 5.0Gb/s 48 @ 5.0Gb/s 36 @ 6.5Gb/s

Slices (k) 23 37 86

Logic Cells (k) 148 240 550

CLB FlipFlops (k) 92 150 687

Distributed RAM (kbits)

1,500 2,400 6,200

BRAM (36kbits) 228 324 632

Cost N/A 4.7k 4.5k

A single Virtex-5 CLB comprises two slices, with each containing four 6-input LUTs and four Flip-Flops (twice the number found in a Virtex-4 slice), for a total of eight 6-LUTs and eight Flip-Flops per CLB

Technology choice.. Use V6 not V5

Page 20: Trigger Upgrade Planning

20

Plan for the nexy 5 years:

– Hardware development• Prototypes

– Procesing Card– Service Card– Optical SLBs– Custom Crate

• Cooling– uTCA crates have

front to back airflow

– Firmware• Serial Protocol• Slow Control• Algorithms

– Software• Low level

drivers• Communication• Interfaces to

legacy systems

– Technical Proposal• Review of Proposal

– Resources Required• Financial / Manpower / Time

System

Page 21: Trigger Upgrade Planning

21

Road MapNo Gant Chart yet...

But you wouldn’t be able to read it...

4/2010

4/2012

Technical Trigger

Proposal

Delivery of uTCA

Regional Trigger Crate

and HCAL HW

Delivery of Matrix II

and OSLBs

Integration of

OSLBs in USC55

Demo system with

HCAL HW &

Matrix I cards

904 system with

Matrix II cards

4/2011

USC55 trigger

partition

Firmware &

Software

DevelopmentHardware

Development

Page 22: Trigger Upgrade Planning

22

End of Part I

=> Trigger Upgrade Proposal by April 2010

VadaTech CEO / CTO at

CERN Nov 19th

The next bit is

confusing so

before I lose you...

Page 23: Trigger Upgrade Planning

23

Implications for on TPGs

Can we put time multiplexing inside TPG FPGA?

– Maximum number of trigger partitions limited by latency.• Assume between 9 and 18 partitions

– Hence each TPG requires a minimum of 9 outputs• Each fibre carrying 8 towers/bx x 9 bx = 72 towers • Transmist 4 towers in η over ¼ of φ

– Assumed time multiplex in φ, but η also possible

– But HCAL TPG designed to generate 36-40 towers• This is very impresive, but still need factor of 2• Revist TPG idea...

Page 24: Trigger Upgrade Planning

24

All TPGs

792 links

~3000 links

TPG

18

~72

TPG

9

36

Approx

4:1 input to output ratio

Too much input data?

No, too many serial links.

But only just....

Close to FPGA data BW limit

Assumes input is 3.36 Gb/s

effective GBT

Trigger partitions reduced to 9.

Requires 5.0Gb/s links.

Not possible with GBT protocol

(insufficient FPGA resources)

TPG design Assumed TPG is single FPGA

Why: simplicity, board space, power,

Page 25: Trigger Upgrade Planning

25

TPG

9

36

Possible, but not with

GBT protocol

TPGs with GBT protocol...

TPG

360 LVDS pairs

@ 320Mb/s

GBT

36x GBTs

power ~36W to 72W

real estate = 92cm2

cost = 4320 SFr

TPG

180 LVDS pairs

@ 640Mb/s

Parallel

GBT

Can select output clk

i.e. doesn’t have to be LHC

Will future FPGAs have std I/O?

XC6VLX240T IO = 720 (24,0)

36

36

99

?

Page 26: Trigger Upgrade Planning

26

TPG

3

12

The alternative:

Time Multiplexing Rack

Time Multiplex

– Use FPGA to decode GBT data• Only use ½ of available links to

decode GBT otherwise no logic left for other processing

– Use different GTX blocks for GBT-Rx and RT-Tx inside TPG

• Allows trigger to decouple from LHC clock if required.

– CMS arranged in blocks of constant φ strips

• Would allow mapping to constant η strips

– Consequence• At least x3 to x5 more HCAL

hardware• Time muliplex rack• Larger latency

3

12

TPG

2x18

... x12 TPGs

3x12 4x9or or

Page 27: Trigger Upgrade Planning

27

Fibre Patch

Time multiplex rack – Assumes tower resolution from HF

• 18x22 regions

• 16 towers per region

• 792 fibres

• 12bit data, 8B/10B, 5Gb/s

– Time multiplex card may have

• 36 in/out (2 crates x11 cards)

• 18 in/out (4 crates x 11 cards)

Fibre Patch

Fibre Patch

Fibre Patch

Fibre Patch

Fibre Patch

Fibre Patch

Fibre Patch

uTCA Crate – 11x TMs

uTCA Crate – 11x TMs

36U

or

48U

12 way ribbon = ¼ η, 1 region in φ 9 ribbons can cover ½ φ Hence patch-panel = ¼ η, ½ φNot n

eeded

for H

CAL if w

e get

x2

BW a

t TPG

Page 28: Trigger Upgrade Planning

28

Are there are alternatives to Time Multiplex Racks if HCAL TPG stays the same...

– Yes: • e.g. return to concept of fine/coarse processing for

electrons/jets processing• i.e. jets at 2x2 tower resolution

– Requires less sharing (e.g. just 2, rather than 4 tower overlap for fine processing)

Alllows a more parallel architectureTPG would have to provide 2 towers in η and ¼ of φ

But... Is it wise to separate fine & coarse procesing ?

Still requires some thought

& discussion with TPG experts

Page 29: Trigger Upgrade Planning

29VadaTech CEO / C

TO at CERN N

ov 19th End...

Vadatech VT891

Page 30: Trigger Upgrade Planning

30

Extra

Page 31: Trigger Upgrade Planning

31

½ region in η, all φ regions OR 1 region in η, ½ of all φ regions

Mux: Put all data from 1bx into single FIFO

Single fibre to each processing blade

Cavern Data into TPG

n+0 n+1 n+2 n+3 n+4 n+5 n+6 n+7 n+8

Page 32: Trigger Upgrade Planning

32

Decode GBT– Only instrument 20/24 links of XC5VFX200T, 96% logic cell usage

• F. Marin, TWEPP-09. Altera results slightly better

– Cannot use GBT links to drive data into processing FPGA– Options

• (a) Use powerful FPGAs to simply to decode GBT protocol– Seems wasteful. – SerDes Tx/Rx may have to use same clk

• (b) Use mulltiple GBTs– Diff pairs = 200 @ 320Mb/s lanes. OK if diff pairs remain on FPGAs!– Power = 20W (assume less power because less I/O)– Components = 20 (16 mm x 16 mm, 0.8 mm pitch)– Cost = 120x20 = 2400 SFr !

• (c) Use Xilinx SIRF both on & off detector – If they let us...