SKA Phase 1: Costs of Computation. Duncan Hall CALIM 2010

Similar documents
Memo 65 SKA Signal processing costs

Overview of the SKA. P. Dewdney International SKA Project Engineer Nov 9, 2009

Smart Antennas in Radio Astronomy

March Phased Array Technology. Andrew Faulkner

Multi-octave radio frequency systems: Developments of antenna technology in radio astronomy and imaging systems

May AA Communications. Portugal

Dense Aperture Array for SKA

Recent progress in EVLA-specific algorithms. EVLA Advisory Committee Meeting, March 19-20, S. Bhatnagar and U. Rau

Towards SKA Multi-beam concepts and technology

Memo 130. SKA Phase 1: Preliminary System Description

ASKAP Industry technical briefing. Tim Cornwell, ASKAP Computing Project Lead Australian Square Kilometre Array Pathfinder

Wide-Band Imaging. Outline : CASS Radio Astronomy School Sept 2012 Narrabri, NSW, Australia. - What is wideband imaging?

November SKA Low Frequency Aperture Array. Andrew Faulkner

GPU based imager for radio astronomy

The SKA LOW correlator design challenges

Focal Plane Arrays & SKA

Plan for Imaging Algorithm Research and Development

ngvla Technical Overview

Correlator Development at Haystack. Roger Cappallo Haystack-NRAO Technical Mtg

All-Digital Wideband Space-Frequency Beamforming for the SKA Aperture Array

Focal Plane Array Beamformer for the Expanded GMRT: Initial

How small can you get? reducing data volume, retaining good imaging

Phased Array Feeds A new technology for wide-field radio astronomy

SOFTWARE CORRELATOR CONCEPT DESCRIPTION

Practical Aspects of Focal Plane Array Testing

A model for the SKA. Melvyn Wright. Radio Astronomy laboratory, University of California, Berkeley, CA, ABSTRACT

ATCA Antenna Beam Patterns and Aperture Illumination

SKA1 low Baseline Design: Lowest Frequency Aspects & EoR Science

SKA station cost comparison

Technology Drivers, SKA Pathfinders P. Dewdney

images with ASKAP Max Voronkov ASKAP So(ware scien1st 20 November 2012 Astronomy and Space Science

SOFTWARE AND COMPUTING CONCEPT DESIGN REVIEW PLAN

Roshene McCool Domain Specialist in Signal Transport and Networks SKA Program Development Office

The US Technology Development Project for the SKA. TDP Progress Report. SKA 2010, Manchester

EVLA Memo 146 RFI Mitigation in AIPS. The New Task UVRFI

The SKA New Instrumentation: Aperture Arrays

Memo 111. SKADS Benchmark Scenario Design and Costing 2 (The SKA Phase 2 AA Scenario)

LOFAR: From raw visibilities to calibrated data

ASIC BASED PROCESSING FOR MINIMUM POWER CONSUMPTION CONCEPT DESCRIPTION FOR PHASE 1

NRC Herzberg Astronomy & Astrophysics

The Australian SKA Pathfinder Project. ASKAP Digital Signal Processing Systems System Description & Overview of Industry Opportunities

Components of Imaging at Low Frequencies: Status & Challenges

LOFAR Data Products. First LOFAR Data Processing School 10 February Michael Wise

SKA-low and the Aperture Array Verification System

Wide-field, wide-band and multi-scale imaging - II

ASKAP Phased Array Feed Digital Beamformer Design Overview and Performance Characteristics

The North America Array

Phased Array Feeds A new technology for multi-beam radio astronomy

Parallel Programming I! (Fall 2016, Prof.dr. H. Wijshoff)

Sideband Smear: Sideband Separation with the ALMA 2SB and DSB Total Power Receivers

Array noise temperature measurements at the Parkes PAF Test-bed Facility

Image-Domain Gridding on Accelerators

White Rabbit in Radio Astronomy

EVLA and LWA Imaging Challenges

Calibration. (in Radio Astronomy) Ishwara Chandra CH NCRA-TIFR. Acknowledgments:

Error Recognition Emil Lenc (and Arin)

A Multi-Fielding SKA Covering the Range 100 MHz 22 GHz. Peter Hall and Aaron Chippendale, CSIRO ATNF 24 November 2003

Radio Astronomy: SKA-Era Interferometry and Other Challenges. Dr Jasper Horrell, SKA SA (and Dr Oleg Smirnov, Rhodes and SKA SA)

Phased Array Feeds & Primary Beams

Planning (VLA) observations

Software Correlators for Dish and Sparse Aperture Arrays of the SKA Phase I

Calibratability and its impact on configuration design for the LOFAR and SKA phased array radio telescopes

Phased Array Feeds for the SKA. WP2.2.3 PAFSKA Consortium CSIRO ASTRON DRAO NRAO BYU OdP Nancay Cornell U Manchester

An Accurate phase calibration Technique for digital beamforming in the multi-transceiver TIGER-3 HF radar system

Software Spectrometer for an ASTE Multi-beam Receiver. Jongsoo Kim Korea Astronomy and Space Science Institute

High Fidelity Imaging of Extended Sources. Rick Perley NRAO Socorro, NM

Imaging and Calibration Algorithms for EVLA, e-merlin and ALMA. Robert Laing ESO

Specifications for the GBT spectrometer

Space-Time Adaptive Processing Using Sparse Arrays

Wide Bandwidth Imaging

Wide-band Wide-field Imaging

EVLA Memo 105. Phase coherence of the EVLA radio telescope

LOFAR: Special Issues

ARRAY DESIGN AND SIMULATIONS

Signal Processing on GPUs for Radio Telescopes

LOFAR Calibration of the Ionosphere and Other Fun Things

Inter-Cell Interference Mitigation in Cellular Networks Applying Grids of Beams

Imaging Simulations with CARMA-23

Radio frequency interference mitigation with phase-only adaptive beam forming

Recent imaging results with wide-band EVLA data, and lessons learnt so far

Aperture Arrays for the SKA: the SKADS White Paper

Data processing with the RTS A GPU-accelerated calibration & imaging stream processor

Next Generation Very Large Array Memo No. 16 More on Synthesized Beams and Sensitivity. C.L. Carilli, NRAO, PO Box O, Socorro, NM

Recent Developments in Measuring Signal and Noise in Phased Array Feeds at CSIRO

High Performance Computing and Modern Science Prof. Dr. Thomas Ludwig

Cross Correlators. Jayce Dowell/Greg Taylor. University of New Mexico Spring Astronomy 423 at UNM Radio Astronomy

"#$!%&'()$!*+,-.$/)$!0))(1!

Chalmers Publication Library

SPDO. Phase 1 System Requirements Specification (SyRS) Tim Stevenson SPDO System Engineer

Very Long Baseline Interferometry

New Zealand evlbi. Tim Natusch,Sergei Gulyaev, Stuart Weston, Hiroshi Takiguchi

Random Phase Antenna Combining for SETI SETICon03

Beamforming for IPS and Pulsar Observations

An FPGA-Based Back End for Real Time, Multi-Beam Transient Searches Over a Wide Dispersion Measure Range

Integrated receivers for mid-band SKA. Suzy Jackson Engineer, Australia Telescope National Facility

Instrument Requirements and Options for Meeting the Science Opportunities MHz P. Dewdney A. Gray, B. Veidt

CS Computer Architecture Spring Lecture 04: Understanding Performance

Modelling and Simulation of Conical Spiral Antennas

Status of LOFAR. Ronald Nijboer (ASTRON) On behalf of the LOFAR team

Some Notes on Beamforming.

Transcription:

SKA Phase 1: Costs of Computation Duncan Hall CALIM 2010 2010 August 24, 27

Outline Motivation Phase 1 in a nutshell Benchmark from 2001 [EVLA Memo 24] Some questions

Amdahl s law overrides Moore s law! Let T S be the time spent on all operations and moves in serial Let p be the number of processors operating in parallel Let f be the fraction of operations performed in parallel Then the time for processing in parallel, T P, is given by: T P T S x [ (1-f )+ f / p ]

How much is an Exaflop? 3 x 10 11 stars 1 Exaflop = 10 18 32-bit floating point operations per second 10 18 number of stars in 3 million milky way galaxies

Pushing the Flops envelope: ~100s PFlops Performance [TFlops] = 0.055e 0.622(year-1993) SKAΦ1 Cornwell and van Diepen Scaling Mount Exaflop: from the pathfinders to the Square Kilometre Array http://www.atnf.csiro.au/people/tim.cornwell/mountexaflop.pdf

1,000,000,000 (Exaflop) 100,000,000 Gigaflops Notes: [1] MACs FLOPs; [2] Lines for ASIC and FPGA are for devices only 1,000,000 Mflops/Watt 2010 22nm ASIC: ~400 GMACs/Watt 100+ fold efficiency increase required Desired and Forecast Mflops/Watt and Green500 500 Mflops/Watt speed incr rease 10,000,000 2010 FPGA: 25 GMACs/Watt 1,000,000 (Petaflop) 2010 World's most powerful computers ~ 1,000 fold 100,000 10,000 10 Mflops/Watt 2010Jun 2009Nov 2009Jun 2008Nov 2008Jun Sources: http://www.green500.org/ http://www.top500.org/lists/2010/06 accessed 2010Jun1 kwatts 1,000 (Teraflop) 10 100 1,000 10,000 100,000

CPU cabinets for ~1 petaflop Cray Jaguar occupy 560+ square metres

Satellite view of data centre building: chillers on roof; ~1,000 square metres per floor B ildi t f d t t 10 000 t Building cost for data centres: ~ 10,000 per square metre Include power, (-H)VAC, data storage, telecommunications, security...

Outline Motivation Phase 1 in a nutshell: 3,000,000, : 1 dynamic range in ~2018 Benchmark from 2001 [EVLA Memo 24] Some questions

Dynamic range: historical progress and target for SKA Phase 1: 10,000,000 1,000,000 100,000 10,000 Kemball: Array Calibration SA SKA 2009 Smirnov: Luxury Problems of High Dynamic Range Imaging SKA 2010 1,000 1980 1985 1990 1995 2000 2005 2010 2015 2020

Outline Motivation Phase 1 in a nutshell Benchmark from 2001 [EVLA Memo 24] Some questions

T. Cornwell EVLA Memo 24: Computing for EVLA Calibration and Imaging, 2001 January12 2001 algorithm performance:

Outline Motivation Phase 1 in a nutshell Benchmark from 2001 [EVLA Memo 24] Some questions

At first order, only a few key parameters define Phase 1 computing: Description Assumption or Derivation Reference Units Dishes D+WBSPFs Sparse Aas Sum Maximum baseline length 2 x maximum radius of 100 km SKA_phase1_definition_v0 1 metres 200.0E+3 200.0E+3 Dish or station diameter SKA_phase1_definition_v0 1 metres 15 180 Number of dishes or stations n SKA_phase1_definition_v0 1 250 50 Number of unique baselines Calculated: n (n 1)/2 31,125 1,225 Maximum frequency of operation SKA_phase1_definition_v0 2 Hertz 2.0E+9 450.0E+6 Minimum frequency of operation Only one Feed available at a time SKA_phase1_definition_v0 2 Hertz 1.0E+9 70.0E+6 Fractional bandwidth Astro2010; DRM 1.0 1.0 Instantaneous bandwidth (Max freq - Min freq) x Fractional bandwidth SKA1_Concept_Definition_SSEC_draft.pdf Hertz 1.0E+9 380.0E+6 Frequency resolution SKA1_Concept_Definition_SSEC_draft.pdf Hertz 1.0E+3 1.0E+3 Number of frequency channels SKA_phase1_definition_v0 2 67.0E+3 67.0E+3 Number of beams formed per dish or station SKA_phase1_definition_v0 1 1 480 Number of polarisation products 4 4 Number of floats per complex float 2 2 Calculated parameter for use in Smearing (Maximum baseline length) / (Dish or station diameter) 13.3E+3 1.1E+3 SKA_phase1_definition_v0 2 Hertz 5.0E+0 250.0E-3

Estimated hardware for Phase 1 ranges into hundreds of petaflops Description Assumption or Derivation Reference Units Dishes D+WBSPFs Sparse Aas Sum Maximum baseline length 2 x maximum radius of 100 km SKA_phase1_definition_v0 1 metres 200.0E+3 200.0E+3 Dish or station diameter SKA_phase1_definition_v0 1 metres 15 180 Number of dishes or stations n SKA_phase1_definition_v0 1 250 50 Number of unique baselines Calculated: n (n 1)/2 31,125 1,225 Maximum frequency of operation SKA_phase1_definition_v0 2 Hertz 2.0E+9 450.0E+6 Minimum frequency of operation Only one Feed available at a time SKA_phase1_definition_v0 2 Hertz 1.0E+9 70.0E+6 Fractional bandwidth Astro2010; DRM 1.0 1.0 Instantaneous bandwidth (Max freq - Min freq) x Fractional bandwidth SKA1_Concept_Definition_SSEC_draft.pdf Hertz 1.0E+9 380.0E+6 Frequency resolution SKA1_Concept_Definition_SSEC_draft.pdf Hertz 1.0E+3 1.0E+3 Number of frequency channels SKA_phase1_definition_v0 2 67.0E+3 67.0E+3 Number of beams formed per dish or station SKA_phase1_definition_v0 1 1 480 Number of polarisation products 4 4 Number of floats per complex float 2 2 Calculated parameter for use in Smearing (Maximum baseline length) / (Dish or station diameter) 13.3E+3 1.1E+3 SKA_phase1_definition_v0 2 Hertz 5.0E+0 250.0E-3 Assume pipeline processing in near realtime Dump rate in floating point numbers All visibilities have the same limiting dump rate floats/sec 83.4E+9 78.8E+98E+9 162.2E+92E+9 Required flops per float - optimistic Assume can achieve 10 7 dynamic range (?) Advice from ASTRON, CSIRO, TDP-CPG 100,000 100,000 Required flops per float - pessimistic Assume can achieve 10 7 dynamic range (?) Advice from ASTRON, CSIRO, TDP-CPG 400,000 400,000 Required flops - optimistic 8.3E+15 7.9E+15 16.2E+15 Required flops - pessimistic 33.4E+15 31.5E+15 64.9E+15 Estimated HPC efficiency - optimistic Refer to [A] at bottom of this colmn 20091116 news release from Cray 50% 50% Estimated HPC efficiency - realistic Refer [B] at bottom of this column Hoisie et al; DOI: 10.1177/109434200001400405 10% 10% Required HPC flops - optimistic Calculated 16.7E+15 15.8E+15 32.4E+15 Required HPC flops - pessimistic Calculated 333.7E+15 315.2E+15 648.8E+15

CPG Memo 3 (2009-11-6) confirms requirements for extreme scale computing:

One driver: smearing <2% Description Assumption or Derivation Reference Units Dishes D+WBSPFs Sparse Aas Sum Maximum baseline length 2 x maximum radius of 100 km SKA_phase1_definition_v0 1 metres 200.0E+3 200.0E+3 Dish or station diameter SKA_phase1_definition_v0 1 metres 15 180 Number of dishes or stations n SKA_phase1_definition_v0 1 250 50 Number of unique baselines Calculated: n (n 1)/2 31,125 1,225 Maximum frequency of operation SKA_phase1_definition_v0 2 Hertz 2.0E+9 450.0E+6 Minimum frequency of operation Only one Feed available at a time SKA_phase1_definition_v0 2 Hertz 1.0E+9 70.0E+6 Fractional bandwidth Astro2010; DRM 1.0 1.0 Instantaneous bandwidth (Max freq - Min freq) x Fractional bandwidth SKA1_Concept_Definition_SSEC_draft.pdf Hertz 1.0E+9 380.0E+6 Frequency resolution SKA1_Concept_Definition_SSEC_draft.pdf Hertz 1.0E+3 1.0E+3 Number of frequency channels SKA_phase1_definition_v0 2 67.0E+3 67.0E+3 Number of beams formed per dish or station SKA_phase1_definition_v0 1 1 480 Number of polarisation products 4 4 Number of floats per complex float 2 2 Calculated parameter for use in Smearing (Maximum baseline length) / (Dish or station diameter) 13.3E+3 1.1E+3 SKA_phase1_definition_v0 2 Hertz 5.0E+0 250.0E-3 Assume pipeline processing in near realtime Dump rate in floating point numbers All visibilities have the same limiting dump rate floats/sec 83.4E+9 78.8E+98E+9 162.2E+92E+9 Required flops per float - optimistic Assume can achieve 10 7 dynamic range (?) Advice from ASTRON, CSIRO, TDP-CPG 100,000 100,000 Required flops per float - pessimistic Assume can achieve 10 7 dynamic range (?) Advice from ASTRON, CSIRO, TDP-CPG 400,000 400,000 Required flops - optimistic 8.3E+15 7.9E+15 16.2E+15 Required flops - pessimistic 33.4E+15 31.5E+15 64.9E+15 Estimated HPC efficiency - optimistic Refer to [A] at bottom of this colmn 20091116 news release from Cray 50% 50% Estimated HPC efficiency - realistic Refer [B] at bottom of this column Hoisie et al; DOI: 10.1177/109434200001400405 10% 10% Required HPC flops - optimistic Calculated 16.7E+15 15.8E+15 32.4E+15 Required HPC flops - pessimistic Calculated 333.7E+15 315.2E+15 648.8E+15

SKA DRM v. 1.0 2010 March 16 Where does the smearing <2% come from?

SKA DRM v. 1.0 2010 March 16 The DRM asserts that smearing shall be <2%

Example SKA Phase 1 dish configurations: 0.3 ~ 3 dumps/s? Tangential u v Smearing as a function of Dump Rate and (Receptor Beamwidth/Array Resolution) Smearing: 1 Relative Amplitude 1% 15 km / 15 m dish 200 km / 15 m dish 2% smearing criterion 1,000 13,333 30,000 10% 100,000 200,000 300,000 500,000 1,000,000 (Beamwidth/ Arrray Resolution) = (Baseline Length / Receptor Diameter) Correlator Dumps per Second 100% 1E 1 0.25 Dumps s 1 1E+0 3.3 Dumps s 1 1E+1 1E+2

Bridle and Schwab s approximations: Bridle and Schwab 1999: Bandwidth and Time Average Smearing ; Synthesis Imaging in Radio Astronomy II, pp. 380-381

http://astronomy.swin.edu.au/~elenc/calculators/wfcalc.php Emil Lenc s online calculator:

SKA DRM v. 1.0 2010 March 16 But is <2% smearing sufficient for DR = 65dB for SKA Phase 1?

Outline Motivation Benchmark from 2001 [EVLA Memo 24] Phase 1 in a nutshell Some more questions

The flops per uv float question: Description Assumption or Derivation Reference Units Dishes D+WBSPFs Sparse Aas Sum Maximum baseline length 2 x maximum radius of 100 km SKA_phase1_definition_v0 1 metres 200.0E+3 200.0E+3 Dish or station diameter SKA_phase1_definition_v0 1 metres 15 180 Number of dishes or stations n SKA_phase1_definition_v0 1 250 50 Number of unique baselines Calculated: n (n 1)/2 31,125 1,225 Maximum frequency of operation SKA_phase1_definition_v0 2 Hertz 2.0E+9 450.0E+6 Minimum frequency of operation Only one Feed available at a time SKA_phase1_definition_v0 2 Hertz 1.0E+9 70.0E+6 Fractional bandwidth Astro2010; DRM 1.0 1.0 Instantaneous bandwidth (Max freq - Min freq) x Fractional bandwidth SKA1_Concept_Definition_SSEC_draft.pdf Hertz 1.0E+9 380.0E+6 Frequency resolution SKA1_Concept_Definition_SSEC_draft.pdf Hertz 1.0E+3 1.0E+3 Number of frequency channels SKA_phase1_definition_v0 2 67.0E+3 67.0E+3 Number of beams formed per dish or station SKA_phase1_definition_v0 1 1 480 Number of polarisation products 4 4 Number of floats per complex float 2 2 Calculated parameter for use in Smearing (Maximum baseline length) / (Dish or station diameter) 13.3E+3 1.1E+3 SKA_phase1_definition_v0 2 Hertz 5.0E+0 250.0E-3 Assume pipeline processing in near realtime Dump rate in floating point numbers All visibilities have the same limiting dump rate floats/sec 83.4E+9 78.8E+98E+9 162.2E+92E+9 Required flops per float - optimistic Assume can achieve 10 7 dynamic range (?) Advice from ASTRON, CSIRO, TDP-CPG 100,000 100,000 Required flops per float - pessimistic Assume can achieve 10 7 dynamic range (?) Advice from ASTRON, CSIRO, TDP-CPG 400,000 400,000 Required flops - optimistic 8.3E+15 7.9E+15 16.2E+15 Required flops - pessimistic 33.4E+15 31.5E+15 64.9E+15 Estimated HPC efficiency - optimistic Refer to [A] at bottom of this colmn 20091116 news release from Cray 50% 50% Estimated HPC efficiency - realistic Refer [B] at bottom of this column Hoisie et al; DOI: 10.1177/109434200001400405 10% 10% Required HPC flops - optimistic Calculated 16.7E+15 15.8E+15 32.4E+15 Required HPC flops - pessimistic Calculated 333.7E+15 315.2E+15 648.8E+15

How big should m x m be?

More questions about the 65 db challenge: How much over sampling is required? How many major cycles are required, worst case? Alternative algorithms for gridding irregularly spaced samples? Empirical work for asymmetric side lobes? Faint sources that may be indistinguishable from imaging artefacts? Automatic flagging and removal of RFI etc.? Other questions: Amdahl s law... I/O data rate e.g. memory bandwidth? Data cache memory requirements? Energy efficiencies of computation and data movement?...?

Even more questions...