Innovative scalable HPC. Ulrich Bruening/Holger Fröning June 2011

Relevanta dokument
Introduction. Innovative scalable HPC. Dr. Holger Fröning October 2010

Datasäkerhet och integritet

Biblioteket.se. A library project, not a web project. Daniel Andersson. Biblioteket.se. New Communication Channels in Libraries Budapest Nov 19, 2007

Dagens OS. Unix, Linux och Windows. Unix. Unix. En översikt av dagens OS Titt på hur de gör. Många varianter Mycket gemensamt. En del som skiljer

802.11b Wireless router w. 4 port switch. StarTech ID: BR411BWDC

Grundläggande rou-ngteknik. F2: Kapitel 2 och 3

3.5in Black USB 3.0 External SATA III Hard Drive Enclosure with UASP for SATA 6 Gbps Portable External HDD

Effektivisering av virtuell produktutveckling med CFD och High Performance Computing (HPC)

IPv6 i Mobilnät. Mattias Karlsson. mattias.karlsson@telenor.com

PRODUKT- OCH PRISLISTA

OFTP2: Secure transfer over the Internet

Beacon BluFi Bluzone. Givarna har mycket hög känslighet och kan mäta mycket små förändringar.

ETS052 Internet Routing. Jens A Andersson

Windows 8 och det nya arbetslivet. Magnus Holmér IT strategisk rådgivare

PFC and EMI filtering

InstalationGuide. English. MODEL:150NHighGain/30NMiniUSBAdapter

Michael Q. Jones & Matt B. Pedersen University of Nevada Las Vegas

Affärsmodellernas förändring inom handeln

SOA One Year Later and With a Business Perspective. BEA Education VNUG 2006

PRODUKT- OCH PRISLISTA

.SE (Stiftelsen för Internetinfrastruktur) Presentation November 2009

Anna Brunström. Hur kan man minska fördröjningarna över Internet? Karlstad University Computer Science

Grundkurs i 5G 5G och sedan då?

What Is Hyper-Threading and How Does It Improve Performance

V-Met IaaS VM [3] [6] IDS [2] IDS IDS [8] [7] [15] [12] [13] IDS. V-Met Xen 4.4. IaaS VM VM IDS IDS IDS IDS IDS IDS IDS IDS

BIG DATA FORSKNINGSCENTER

Alias 1.0 Rollbaserad inloggning

Pulsen IAM: Del 2 Trender och teknik för morgondagens utmaningar. Tobias Ljunggren, PULSEN

Lean & Green Volvo cars. Sofia Boyagi, Operational Development Environment

F1 SBS EC Utbildning AB

A QUEST FOR MISSING PULSARS

Support for Artist Residencies

Cluster VMCLUSTERcpuTotalmhzAverage CPU Mhz

ETSF05 Internetprotokoll. Jens Andersson

Vad är en dator? Introduktion till datorer och nätverk. Pontus Haglund Institutionen för datavetenskap (IDA) 21 augusti 2018

Nationellt stöd för finansiering av mjukvaruberoende innovation ANDREAS ALLSTRÖM

Lär dig sälja framtidens trådlösa. idag

Alternativet är iwindows registret som ni hittar under regedit och Windows XP 32 bit.

Olika OS. Unix, Linux och Windows. Unix. Unix. En översikt av ett par OS. Titt på hur de gör. Många varianter. Mycket gemensamt. En del som skiljer

CS 475: Parallel Programming Introduction

Real world SharePoint 2013 architecture decisions. Wictor Wilén

Grundläggande rou-ngteknik

Vågkraft. Verification of Numerical Field Model for Permanent Magnet Two Pole Motor. Centrum för förnybar elenergiomvandling

SAS VIYA JOHAN ELFMAN ROLAND BALI

P O W E R E D B Y. ServSwitch Agility TM B L A C K B O X K V M o I P E X T E N D E R

Innovation Enabled by ICT A proposal for a Vinnova national Strategic innovation Program

D-RAIL AB. All Rights Reserved.

Datorteknik och datornät. Case Study Topics

DYNAMISK INFRASTRUKTUR

Schenker Privpak AB Telefon VAT Nr. SE Schenker ABs ansvarsbestämmelser, identiska med Box 905 Faxnr Säte: Borås

Gigabit och 10 Gigabit Ethernet. Lösningar för Stadsnätet. Björn Rudberg Vice President Electronic Design

Tillgång till alla globala delar i systemet styrs av denna profil, som i sin tur kopplas till respektive användare.

Overhead of the spin-lock in UltraSPARC T2

Health café. Self help groups. Learning café. Focus on support to people with chronic diseases and their families

Om oss DET PERFEKTA KOMPLEMENTET THE PERFECT COMPLETION 04 EN BINZ ÄR PRECIS SÅ BRA SOM DU FÖRVÄNTAR DIG A BINZ IS JUST AS GOOD AS YOU THINK 05

PRODUKT- OCH PRISLISTA

Förändrade förväntningar

Föreläsning 4 IS1300 Inbyggda system

Diskprestanda Tester

icore Solutions. All Rights Reserved.

Rosetta. Ido Peled. A Digital Preservation System. December Rosetta Product Manager

Medical Informatics Period 2, 2009

Övning 5 ETS052 Datorkommuniktion Routing och Networking

Där hav möter land i Göteborg

Sara Skärhem Martin Jansson Dalarna Science Park

NORDIC GRID DISTURBANCE STATISTICS 2012

ETS052 Internet Routing. Jens A Andersson

5G SOM DIGITAL INFRASTRUKTUR FÖR TRANSPORTSEKTORN

Övning 1: Skapa virtuell maskin för utveckling.

Displaysystem. Hans Brandtberg Saab Avitronics SAAB AVITRONICS

INSTALLATION INSTRUCTIONS

Mobile IT systems and home care applications

Den Danske Bank in Sweden

Hållbar utveckling i kurser lå 16-17

Resultat av den utökade första planeringsövningen inför RRC september 2005

Lågtemperaturfjärrvärme i nya bostadsområden P i samverkan med Växjö kommun, Växjö Energi AB och Växjö-bostäder AB

Lagringssystem. server. arbetsstationer. Katalog Öppettider: 09:00-17:00 alla vardagar.

Smarter Analytics med rätt infrastruktur

balans Serie 7 - The best working position is to be balanced - in the centre of your own gravity! balans 7,45

PowerCell Sweden AB. Ren och effektiv energi överallt där den behövs

The Arctic boundary layer

The road to Recovery in a difficult Environment

Vässa kraven och förbättra samarbetet med hjälp av Behaviour Driven Development Anna Fallqvist Eriksson

USB 3.0 till VGA externt videokort, multiskärmsadapter med USB-hubb med 1 port 1920x1200

Why Steam Engine again??

Transnational spatial monitoring in the Nordic Region

Projekt Laxförvaltning för framtiden & Älvspecifik laxförvaltning Salmon Management for the Future / River Specific Management

Jan Markendahl Docent Kommunikationssystem. Associate Professor Wireless Infrastructure Deployment and Economics Tele-ekonomi

Samarbete i större infrastrukturprojekt med informationsmodeller som bas genom hela BIM baserade processen

Nya möjligheter med M3 Technology. Björn Svensson, Björn Torold

Taking Flight! Migrating to SAS 9.2!

AGIL KRAVHANTERING. Hitta behoven bakom kraven!! Thomas Nilsson! Agile Coach & Mentor! CTO, Responsive

Rekommendation på Router: Budget. Namn: Zyxel NBG418N. Pris 199 kr

Arrowhead - Process- och energisystem- automation

Ingenjörsfirman Stéen ATM Sida 1 av 1

Läkemedelsverkets Farmakovigilansdag

1. CEPT ECC/ERC-beslut

ENERGIEFFEKTIVA BYGGNADER EFTER NÄRA NOLL?

Remote control Fjärrströmbrytare / Fjernstrømbryter

Transkript:

Introduction Innovative scalable HPC Ulrich Bruening/Holger Fröning June 2011

History Design of complex HW/SW systems, Computer Architecture Group, Prof. Dr. U. Brüning, University of Heidelberg Computer architecture Interconnection networks ATOLL cluster EXTOLL project started in 2005 FPGA (Xilinx Virtex4 based) prototype since 2008 Start-up company as a spin-off 2 ATOLL-Die (bonded)

Introduction I Interconnection networks are the key component of parallel systems Prof. Patterson stated: Latency lags Bandwidth Need to significantly lower communication i latency and improve communication in parallel systems Finer grain parallelism PGAS systems Improve scaling EXTOLL project at the CAG Vision: more performance, lower cost for HPC! 3

Introduction II Lowest latency Maximum message rate / s Optimized for multi-core Optimized CPU-Device interface Direct topology Efficiency Innovative architecture Optimized scalability EXTOLL Late ency Conventional network EXTOLL Perform mance Conventional network Message size Network size More Performance for HPC customer 4

Introduction III No external switches Complete own IP Lower cost Lower energy consumption EXTOLL Conventional HPC network 0% 20% 40% 60% 80% 100% 120% Relative price per G/port incl. NIC, cabling and switches for a 324 node cluster lower cost 5

Architecture - Ideas Lean network interface Ultra low latency message exchange Extremely high hardware message rate Small memory footprint Switchless-Design - 3D Torus Direct Network Reliable network High Scalability Extremely efficient, pipelined hardware architecture 6

7 Architecture - Overview

NIC Features VELO: Very fast two sided messaging RMA: Optimized i access to remote memory Local and remote completion notifications for all RMA operations Fully virtualized user space and/or different virtual machines secure Hardware address translation unit (ATU) including TLB 8

Network Features I 64k nodes hundreds of endpoints per node Efficient network protocol low overhead even for very small packets Support for arbitrary direct topologies Choice for implementation: 6 links Natural a topology ogy 3-d dtous torus 9

Network Features II Linkport Net- work- port Linkport Network- port NETWORK EXTOLL Network Switch Linkport Linkport Adaptive and deterministic routing Packets routed deterministically are delivered in-order Three virtual channels (VCs) Four independent traffic classes (TCs) Networkport Linkport Linkport Support for remote configuration and monitoring of EXTOLL nodes without host interaction 10

Reliability & Security Features Reliable message transport Link level retransmission protocol for reliable data transmission All network protocol elements are secured by strong CRCs All internal memory protected by ECC for high reliability Isolation of communication groups in the network Process isolation on the node 11

Additional Features Scalable hardware barriers implemented completely in hardware Global interrupts with low skew Hardware support for multicast operations Non-coherent shared memory features (for PGAS) 12

Architecture -Software Optimized for HPC-Users OS bypass Linux kernel drivers manage resources Low-level API libraries MPI support OpenMPI PGAS support GASNet (prototype) Management software 13

Why do I actually need support y y pp for small messages?

[ [ Use of Small Messages in HPC HPCC RandomAccess - Message Size Histogram WRF - Message Size Histogram 64P8N 32P8N 16P8N 8P8N 4P4N 64P8N 32P8N 16P8N 8P8N 4P4N Distribution 90,00% 80,00% 70,00% 60,00% 50,00% 40,00% 30,00% 20,00% 00% 10,00% 0,00% [0:8[ 16:32[ :128[ [64 [256:5 5 12[ Message Size 2k[ k[ [1k : [4k :8 [16k:32k [64k:128k[ [256k:512k[ 4P4N 8P8N 16P8N 32P8N 64P8N Shift towards smaller messages with increasing process count 15 Distribution 100,00% 90,00% 80,00% 70,00% 60,00% 50,00% 40,00% 30,00% 4P4N 20,00% 00% 8P8N 10,00% 16P8N 0,00% 32P8N [0:8[ [ 1 6:32[ :128[ 512[ 2k[ [64 [256: [1k: Message Size [4k:8 k[ [16k:32k k [ [64k:128k[ [ [ 256k:512k[ 64P8N About 30% of all messages have a size below 512B

A Small Test Drive of our Prototype

Prototype Performance Results EXTOLL FPGA Prototyp Xilinx Virtex 4 based HT400 16-bit 150 MHz core frequency 6 optical links 6 Gbit/s link bandwidth ~ 1 μs end-to-end latency FPGA 90% filled Test machines 2 node Quad Socket Quad-Core Opteron 8354 Barcelona 2.2 GHz 16GB, Supermicro 1041MT2B/H8QME-2+, 2 Open MPI133 1.3.3 Mellanox ConnectX IB SDR and SDR IB Switch, Open MPI 1.3.2 16 node dual-socket Opteron 8380 Shanghai 2.5 GHz 17

EXTOLL Virtex4: WRF V3 90 80 70 60 GFlop/ /s 50 40 30 20 10 0 32 64 96 128 Cores Opteron Cluster with IB DDR Opteron Cluster with EXTOLL R1 18 IB Cluster: Opteron Shanghai 2358 2.6GHz EXTOLL Cluster: Opteron Shanghai 2380 2.5GHz

What s next?

Hardware Roadmap 2011 2012 2013 2014 ASIC 65nm HT2600 (10.4 GB/s) & PCIe Gen3 6 Links: 10 Gbps x12 (>10 GB/s) Zoncolan small volumes Galibier small volumes Ventoux small volumes Tourmalet large quantities Altera Stratix4 FPGA PCIe G2 x8 (4.0 GB/s) 6 Links: 2.5 Gbps x4 (1.0 GB/s) Xilinx Virtex6 FPGA PCIe G2 x4 (2.0 GB/s) 4 Links: 6 Gbps x4 (1.6 GB/s) Xilinx Virtex6 FPGA HT600 (2.4 GB/s) 6 Links: 6 Gbps x4 (1.6 GB/s) 20

Performance Metric EXTOLL Ventoux EXTOLL Tourmalet IB Gigabit Ethernet 10G Ethernet Start-up latency Hop latency <1.0us <0.5us* 1.2us 30us >10us <150ns <50ns* 100-450ns >1000ns >500ns Peak BW 16GB/ 1.6GB/s 10GB/s* 4-8GB/s 125MB/s 1.25GB/s Message Rate >25M/s ~100M/s 7-23.7M/s <2.5M/s (*) values from simulation 21

Next Generation: Ventoux Ultra-low Latency! O t t di Message Outstanding M Rate! R t! Presented at ISC 2011 in Hamburg 22

Thank You! Prototype cluster, Mannheim Prototype cluster, Valencia, Detail Prototype cluster, Valencia Computer Architecture Group, University of Heidelberg 23