Signal Processing on GPUs for Radio Telescopes
|
|
- Cameron Haynes
- 6 years ago
- Views:
Transcription
1 Signal Processing on GPUs for Radio Telescopes John W. Romein Netherlands Institute for Radio Astronomy (ASTRON) Dwingeloo, the Netherlands 1
2 Overview radio telescopes motivation processing pipelines signal-processing algorithms filter, correlator, beam forming, etc. performance 2
3 The LOFAR Radio Telescope 3
4 The LOFAR Radio Telescope largest low-frequency telescope no dishes distributed sensor network ~ receivers 4
5 LOFAR: A Software Telescope all-sky view antennas new science opportunities different observation modes require flexibility digitally steered concurrent observations supercomputer real time standard imaging pulsar survey known pulsar epoch of re-ionization transients ultra-high energy particles... 5
6 LOFAR SuperTerp 6
7 What's Next? The Square Kilometre Array world-wide effort unprecedented size 3,000 dishes + aperture array South Africa + Australia : 10% SKA : 100% SKA exascale computing; petascale I/O O(104) O(105) > LOFAR many challenges! 7
8 Our Efforts build/maintain/operate current telescopes (LOFAR, WSRT) research for SKA 8
9 Motivation 1) bring GPU technology to LOFAR 2) do accelerator research for the SKA 9
10 ASTRON/IBM Dome research technologies to develop the SKA green computing nano-photonics data & streaming P1 algorithms & machines P2 nano-photonics P3 access patterns P4 microservers P5 accelerators P6 compressed sampling P7 real-time communication 10
11 Accelerator Research accelerators for (radio-astronomical) signal processing GPUs, Xeon Phi, BG/Q, FPGAs,... fundamental understanding of accelerators which properties make architecture (in)efficient? I/O-compute balance? energy efficiency? programmability? architecture-(in)dependent optimizations? devise new algorithms? generic approach to program accelerators, or ad-hoc? applications: 1) LOFAR 2) SKA 11
12 LOFAR Data Processing developing new GPU-based system 12
13 Correlator Processing Overview 13
14 More Detail complex piece of software! several pipelines 14
15 Implement GPU Kernels killing two birds 1) develop new LOFAR correlator 2) code base for accelerator research 15
16 Why We Use OpenCL OpenCL advantages disadvantages vendor independent poor library support (e.g., FFT) GPU: runtime compilation cannot use all GPU features (e.g., GPUdirect) GPU: float8, float16, swizzling CPU: nice C++ interface (exceptions) float2 input_samples[nr_stations][nr_channels][nr_times][nr_polarizations]; float16 sums, weights; sums += weights.s abcdef012 * sample; 16
17 Why We Are Disappointed s l OpenCL advantages disadvantages o o t t vendor independent poor library support (e.g., FFT)en m p GPU: runtime compilation cannot use all GPU features o (e.g., GPUdirect) l e ev d GPU: float8, float16, swizzling n i t CPU: nice C++ interface (exceptions) por p u s L C n e p O d e p p float2 input_samples[nr_stations][nr_channels][nr_times][nr_polarizations]; r o r e l d i f A o I r D p I l V a float16 weights; N sums, u s i v sums += weights.s abcdef012 * sample; 17
18 GPUs FirePro S10000: driver/vbios/hardware (?) issues cannot always overlap compute PCIe I/O Xeon Phi: immature results NVIDIA Tesla K10 AMD FirePro S10000 Intel Xeon Phi (ECC off) FPUs SP TFLOPS GB/s W
19 From Feasibility Study to Production Software concentrated mostly on GPU kernels CPU parts: now/later 19
20 Implemented Kernels most important: filter correlator beam former 20
21 Processing Pipelines imaging mode sky images pulsar modes in development UHEP mode detect Ultra-High Energy Particles experimental 21
22 Processing Pipelines imaging mode sky images pulsar modes in development UHEP mode detect Ultra-High Energy Particles experimental 22
23 Processing Pipelines imaging mode sky images pulsar modes in development UHEP mode detect Ultra-High Energy Particles experimental 23
24 Imaging (Correlator) Pipeline 24
25 Imaging (Correlator) Pipeline four GPU kernels 25
26 Poly-Phase Filter (PPF) Bank splits frequency band into channels like prism trades time for frequency resolution 26
27 Poly-Phase Filter (PPF) Bank FIR filter + FFT 27
28 1) Finite Impulse Response (FIR) Filter history & weights (in registers) no physical shift reorder output for next kernel uncoalesced writes many FMAs operational intensity = 6.4 FLOPs / byte 28
29 FIR Filter Performance FirePro S10000 Tesla K GB/s TFLOPS 1.5 FirePro S10000 Tesla K #stations #stations 60 K10: stations small #threads 29 80
30 2) FFT 1D complex complex typically: tweaked Apple FFT library 30
31 2) FFT Performance FirePro S10000 Tesla K GB/s TFLOPS 1.5 FirePro S10000 Tesla K #stations #stations 60 memory I/O bound 31 80
32 3) Phase & BandPass Correction combined kernel: clock correction delay compensation bandpass correction transpose reduces #device memory accesses 32
33 3a) Clock Correction stations: shared clock corrects cable length errors merge with next step (phase delay) 33
34 3b) Delay Compensation (a.k.a. Tracking) track observed source delay telescope data delay varies due to earth rotation shift samples remainder: rotate phase (= cmul) 0.75 FLOPs / byte 34
35 3c) BandPass Correction powers in channels unequal artifact from station processing multiply by channel-dependent weight FLOPs / byte power frequency channel
36 Phase & BandPass Correction FirePro S10000 Tesla K GB/s TFLOPS 1.5 FirePro S10000 Tesla K #stations FLOPs / byte #stations 60 memory I/O bound 36 80
37 4) Correlator Kernel multiply samples from each station pair integrate ~1s 37
38 4) Correlation Triangle multiply samples from each station pair integrate ~1s 38
39 4) Correlator Implementation global memory local memory 1 thread per station pair (dual pol) 1 FLOP / byte (local mem) 39
40 4) Optimized Correlator Implementation increase register reuse 1 thread: 2x2 stations (dual pol) 2 FLOPs / byte (local mem) also: 3x3, 4x4 40
41 Correlator Register Usage #accumulator registers 1x1 8 2x2 32 3x3 72 max. registers/thread Tesla K10 FirePro S x3 on K10: excessive spilling 41
42 Correlator #Threads x1 2x2 3x3 #threads 768 max #threads Tesla K10 FirePro S , #stations cannot use all threads in last warp too few threads low occupancy too many threads multiple passes multiple thread blocks 42
43 1x1 vs. 2x2 and 3x3 (FirePro S10000) 2 1x1 2x2 3x3 300 GB/s TFLOPS 1.5 1x1 2x2 3x #stations #stations 60 use whichever performs best 43 80
44 Correlator Performance FirePro S10000 Tesla K GB/s TFLOPS 1.5 FirePro S10000 Tesla K compute bound 40 #stations #stations 60 S10000: multiple passes 44 80
45 Combined Pipeline #pragma omp parallel num_threads(2 * nrgpus) { cl::commandqueue queue(...); cl::buffer input(...), output(...),...; 2 queues/gpu: overlap I/O & computations 1 host thread/queue: easy model #pragma omp for schedule(dynamic) for (int subband = 0; subband < nrsubbands; subband ++) { receive(input, subband); queue.enqueuewritebuffer(input,...); queue.enqueuendrangekernel(fir_filter,...); queue.enqueuendrangekernel(fft,...); queue.enqueuendrangekernel(delay_bandpass,...); queue.enqueuendrangekernel(correlator,...); queue.enqueuereadbuffer(output,...); send(output, subband); } } 45
46 Correlator Pipeline Tesla K10 Performance Breakdown most challenging observation mode 8-bit samples 488 subbands (95.3 MHz) correlations most expensive need ~11 Tesla K10s? Correlate Phase & BandPass Correction FFT FIR filter #stations #GPUs 10
47 Large #Stations LOFAR SKA #stations #correlations ~80 ~12,960 3,000 #threads #thread blocks (K10) ~ ,000,000 1,125,000 1,000 different approach LOFAR SKA 47
48 Large #Stations: Another Strategy 192x192 stations (squares + rectangles) 2x2 stations (dual pol) 1 thread read samples stations 32x32 stations (16x16 threads) 32 FLOPs / byte 48
49 Large #Stations: Correlator Performance FirePro S10000 Tesla K GB/s TFLOPS #stations FirePro S10000 Tesla K #stations 640 to do: Tesla K20X possibly faster
50 Pulsar Pipelines 50
51 Pulsar Pipelines 1) search unknown pulsars 2) observe known pulsar many narrow beams credit: Jason Hessels 51
52 Pulsar Pipelines in development 52
53 Coherent Beam Forming add stations > sensitivity weighed compensate Δt change phase different Δt different beam many beams from same input CV beam = W beam, stat S stat stat 53
54 Coherent Beam Forming Implementation global memory local memory each thread: 1 beam, 1 polarization station-dependent weights in registers 2 passes of 24 stations 48 stations, 128 beams 14.2 FLOPs / byte CV beam = W beam, stat S stat stat 54
55 Coherent Beam Forming Performance 2.5 FirePro S10000 Tesla K10 FirePro S10000 Tesla K GB/s TFLOPS #beams stations #beams 96 sawtooth caused by unused threads
56 Ultra-High Energy Particle (UHEP) Pipeline 56
57 UHEP Pipeline store raw antenna voltages in 1.3s circular buffer create ~50 beams on moon detects ev particle collisions trigger in one/few adjacent beams: freeze/dump captured antenna voltages within 1.3s??? highly experimental pipeline Image Courtesy L. Viatour 57
58 UHEP Pipeline create ~50 beams on moon inverse filter for high time resolution trigger peak detection anti-coincidence check freeze raw antenna data buffer max 1.3s latency! 58
59 UHEP Pipeline create ~50 beams on moon inverse filter for high time resolution trigger peak detection anti-coincidence check freeze raw antenna data buffer max 1.3s latency! 59
60 UHEP Pipeline create ~50 beams on moon inverse filter for high time resolution trigger peak detection anti-coincidence check freeze raw antenna data buffer max 1.3s latency! 60
61 UHEP Pipeline create ~50 beams on moon inverse filter for high time resolution trigger peak detection anti-coincidence check freeze raw antenna data buffer max 1.3s latency! 61
62 UHEP Performance Breakdown Tesla K10 FirePro S Tesla K10 FirePro S GB/s TFLOP/s Beam Forming Transpose Inverse FIR Inverse FFT Trigger 0 Beam Forming Transpose Inverse FIR Inverse FFT 48 stations, 64 beams 62 Trigger
63 UHEP Performance Breakdown Input (Beam Former Weights) Input (Samples)? Trigger Inv. FFT Inv. FIR Transpose Beam Forming 60 ms ms data, 48 stations, 488 subbands most time spent in beam former I/O overlap latency 100 ms 0 Tesla K10 FirePro S
64 A Wild Idea 64
65 OpenCL on Top of CUDA Driver API fool CUDA RTS CPU: implement our own platform (ICD) OpenCL library calls CUDA Driver API Calls limited subset (proof of concept) GPU: use OpenCL PTX compiler (clc/clang/llvm) efficient does not support full language can use: visual profiler cufft, GPUdirect 65
66 OpenCL on Top of CUDA Driver API 66
67 Future Work 67
68 To Do (LOFAR Correlator) GPU kernels: dedispersion, flagging pulsar pipelines CPU code: work distribution network reordering (FDR IB) >240 Gb/s optimizations monitoring & control... 68
69 To Do (SKA Research) Xeon Phi OpenCL on FPGA energy efficiency fully understand all results 69
70 Conclusions many signal-processing GPU kernels new LOFAR correlator research for SKA OpenCL: vendor independent, elegant, but poor support NVIDIA FirePro S10000 faster then Tesla K10, but immature driver high efficiency on most important kernels 70
71 Thanks support: Intel, NVIDIA grants from Dutch national & province governments LOFAR GPU Correlator team: Alexander van Amesfoort, Wouter Klijn, Marcel Loose, Jan David Mol 71
Processing Real-Time LOFAR Telescope Data on a Blue Gene/P
Processing Real-Time LOFAR Telescope Data on a Blue Gene/P John W. Romein Stichting ASTRON (Netherlands Institute for Radio Astronomy) Dwingeloo, the Netherlands 1 LOw Frequency ARray radio telescope 10
More informationescience: Pulsar searching on GPUs
escience: Pulsar searching on GPUs Alessio Sclocco Ana Lucia Varbanescu Karel van der Veldt John Romein Joeri van Leeuwen Jason Hessels Rob van Nieuwpoort And many others! Netherlands escience center Science
More informationImage-Domain Gridding on Accelerators
Netherlands Institute for Radio Astronomy Image-Domain Gridding on Accelerators Bram Veenboer Monday 26th March, 2018, GPU Technology Conference 2018, San Jose, USA ASTRON is part of the Netherlands Organisation
More informationGPU based imager for radio astronomy
GPU based imager for radio astronomy GTC2014, San Jose, March 27th 2014 S. Bhatnagar, P. K. Gupta, M. Clark, National Radio Astronomy Observatory, NM, USA NVIDIA-India, Pune NVIDIA-US, CA Introduction
More informationLOFAR Data Products. First LOFAR Data Processing School 10 February Michael Wise
LOFAR Data Products First LOFAR Data Processing School 10 February 2009 Michael Wise MAC and Input section Aux. processing section system processing Input section Aux. processing section system processing
More informationA Polyphase Filter for GPUs and Multi-Core Processors
A Polyphase Filter for GPUs and Multi-Core Processors Karel van der Veldt Universiteit van Amsterdam The Netherlands karel.vd.veldt@uva.nl Ana Lucia Varbanescu Technische Universiteit Delft The Netherlands
More informationAn FPGA-Based Back End for Real Time, Multi-Beam Transient Searches Over a Wide Dispersion Measure Range
An FPGA-Based Back End for Real Time, Multi-Beam Transient Searches Over a Wide Dispersion Measure Range Larry D'Addario 1, Nathan Clarke 2, Robert Navarro 1, and Joseph Trinh 1 1 Jet Propulsion Laboratory,
More informationarxiv: v1 [astro-ph.im] 1 Sep 2015
Experimental Astronomy manuscript No. (will be inserted by the editor) A Real-time Coherent Dedispersion Pipeline for the Giant Metrewave Radio Telescope Kishalay De Yashwant Gupta arxiv:1509.00186v1 [astro-ph.im]
More informationarxiv: v1 [astro-ph.im] 7 Dec 2010
arxiv:1012.1583v1 [astro-ph.im] 7 Dec 2010 University of Amsterdam (UvA), Amsterdam, The Netherlands E-mail: a.alexov@uva.nl Jason W. T. Hessels Netherlands Institute for Radio Astronomy (ASTRON), Dwingeloo,
More informationSoftware Correlators for Dish and Sparse Aperture Arrays of the SKA Phase I
Software Correlators for Dsh and Sparse Aperture Arrays of the SKA Phase I Jongsoo Km Korea Astronomy and Space Scence Insttute Collaborators: Paul Alexander (Unv. of Cambrdge) Andrew Faulkner (Unv. of
More informationA Scalable Computer Architecture for
A Scalable Computer Architecture for On-line Pulsar Search on the SKA - Draft Version - G. Knittel, A. Horneffer MPI for Radio Astronomy Bonn with help from: M. Kramer, B. Klein, R. Eatough GPU-Based Pulsar
More informationCUDA-Accelerated Satellite Communication Demodulation
CUDA-Accelerated Satellite Communication Demodulation Renliang Zhao, Ying Liu, Liheng Jian, Zhongya Wang School of Computer and Control University of Chinese Academy of Sciences Outline Motivation Related
More informationGPU-based data analysis for Synthetic Aperture Microwave Imaging
GPU-based data analysis for Synthetic Aperture Microwave Imaging 1 st IAEA Technical Meeting on Fusion Data Processing, Validation and Analysis 1 st -3 rd June 2015 J.C. Chorley 1, K.J. Brunner 1, N.A.
More informationThe LOFAR Known Pulsar Data Pipeline Alexov, A.; Hessels, J.W.T.; Mol, J.D.; Stappers, B.W.; van Leeuwen, A.G.J.
UvA-DARE (Digital Academic Repository) The LOFAR Known Pulsar Data Pipeline Alexov, A.; Hessels, J.W.T.; Mol, J.D.; Stappers, B.W.; van Leeuwen, A.G.J. Published in: Proceedings of science Link to publication
More informationSKA NON IMAGING PROCESSING CONCEPT DESCRIPTION: GPU PROCESSING FOR REAL TIME ISOLATED RADIO PULSE DETECTION
SKA NON IMAGING PROCESSING CONCEPT DESCRIPTION: GPU PROCESSING FOR REAL TIME ISOLATED RADIO PULSE DETECTION Document number... WP2 040.130.010 TD 001 Revision... 1 Author... Aris Karastergiou Date... 2011
More informationFocal Plane Array Beamformer for the Expanded GMRT: Initial
Focal Plane Array Beamformer for the Expanded GMRT: Initial Implementation on ROACH Kaushal D. Buch Digital Backend Group, Giant Metrewave Radio Telescope, NCRA-TIFR, Pune, India kdbuch@gmrt.ncra.tifr.res.in
More informationDocument downloaded from:
Document downloaded from: http://hdl.handle.net/1251/64738 This paper must be cited as: Reaño González, C.; Pérez López, F.; Silla Jiménez, F. (215). On the design of a demo for exhibiting rcuda. 15th
More informationCosmic Rays with LOFAR
Cosmic Rays with LOFAR Andreas Horneffer for the LOFAR-CR Team Cosmic Rays High energy particles Dominated by hadrons (atomic nuclei) Similar in composition to solar system Broad range in flux and energy
More informationGPU-accelerated track reconstruction in the ALICE High Level Trigger
GPU-accelerated track reconstruction in the ALICE High Level Trigger David Rohr for the ALICE Collaboration Frankfurt Institute for Advanced Studies CHEP 2016, San Francisco ALICE at the LHC The Large
More informationComputational Efficiency of the GF and the RMF Transforms for Quaternary Logic Functions on CPUs and GPUs
5 th International Conference on Logic and Application LAP 2016 Dubrovnik, Croatia, September 19-23, 2016 Computational Efficiency of the GF and the RMF Transforms for Quaternary Logic Functions on CPUs
More informationCasper Instrumentation at Green Bank
Casper Instrumentation at Green Bank John Ford September 28, 2009 The NRAO is operated for the National Science Foundation (NSF) by Associated Universities, Inc. (AUI), under a cooperative agreement. GBT
More informationSmart Antennas in Radio Astronomy
Smart Antennas in Radio Astronomy Wim van Cappellen cappellen@astron.nl Netherlands Institute for Radio Astronomy Our mission is to make radio-astronomical discoveries happen ASTRON is an institute for
More informationData acquisition and Trigger (with emphasis on LHC)
Lecture 2 Data acquisition and Trigger (with emphasis on LHC) Introduction Data handling requirements for LHC Design issues: Architectures Front-end, event selection levels Trigger Future evolutions Conclusion
More informationThe SKA New Instrumentation: Aperture Arrays
The SKA New Instrumentation: Aperture Arrays A. van Ardenne, A.J. Faulkner, and J.G. bij de Vaate Abstract The radio frequency window of the Square Kilometre Array is planned to cover the wavelength regime
More informationSOFTWARE CORRELATOR CONCEPT DESCRIPTION
SOFTWARE CORRELATOR CONCEPT DESCRIPTION Document number... WP2 040.040.010 TD 002 Revision... 1 Author... Dominic Ford Date... 2011 03 29 Status... Approved for release Name Designation Affiliation Date
More informationHIGH PERFORMANCE COMPUTING USING GPGPU FOR RADAR APPLICATIONS
HIGH PERFORMANCE COMPUTING USING GPGPU FOR RADAR APPLICATIONS Viswam Gampala 1 (visgam@yahoo.co.in), Akshay BM 1, A Vengadarajan 1, PS Avadhani 2 1. Electronics & Radar Development Establishment, DRDO,
More informationGPU-accelerated SDR Implementation of Multi-User Detector for Satellite Return Links
DLR.de Chart 1 GPU-accelerated SDR Implementation of Multi-User Detector for Satellite Return Links Chen Tang chen.tang@dlr.de Institute of Communication and Navigation German Aerospace Center DLR.de Chart
More informationThe SKA LOW correlator design challenges
The SKA LOW correlator design challenges John Bunton CSP System Engineer C4SKA, Auckland, 9-10 February, 2017 CSIRO ASTRONOMY AND SPACE SCIENCE SKA1 Low antenna station (Australia) Station beamforming
More informationSoftware Spectrometer for an ASTE Multi-beam Receiver. Jongsoo Kim Korea Astronomy and Space Science Institute
Software Spectrometer for an ASTE Multi-beam Receiver Jongsoo Kim Korea Astronomy and Space Science Institute Design Consideration software spectrometer for a near future ASTE multi-beam receiver spectrometer
More informationExascale computer system design : the square kilometre array Jongerius, R.
Exascale computer system design : the square kilometre array Jongerius, R. Published: 20/09/2016 Document Version Publisher s PDF, also known as Version of Record (includes final page, issue and volume
More informationOverview of the SKA. P. Dewdney International SKA Project Engineer Nov 9, 2009
Overview of the SKA P. Dewdney International SKA Project Engineer Nov 9, 2009 Outline* 1. SKA Science Drivers. 2. The SKA System. 3. SKA technologies. 4. Trade-off space. 5. Scaling. 6. Data Rates & Data
More informationSKA1 low Baseline Design: Lowest Frequency Aspects & EoR Science
SKA1 low Baseline Design: Lowest Frequency Aspects & EoR Science 1 st science Assessment WS, Jodrell Bank P. Dewdney Mar 27, 2013 Intent of the Baseline Design Basic architecture: 3-telescope, 2-system
More informationPhased Array Feeds & Primary Beams
Phased Array Feeds & Primary Beams Aidan Hotan ASKAP Deputy Project Scientist 3 rd October 2014 CSIRO ASTRONOMY AND SPACE SCIENCE Outline Review of parabolic (dish) antennas. Focal plane response to a
More informationPhased Array Feeds A new technology for wide-field radio astronomy
Phased Array Feeds A new technology for wide-field radio astronomy Aidan Hotan ASKAP Project Scientist 29 th September 2017 CSIRO ASTRONOMY AND SPACE SCIENCE Outline Review of radio astronomy concepts
More informationSome Notes on Beamforming.
The Medicina IRA-SKA Engineering Group Some Notes on Beamforming. S. Montebugnoli, G. Bianchi, A. Cattani, F. Ghelfi, A. Maccaferri, F. Perini. IRA N. 353/04 1) Introduction: consideration on beamforming
More informationThe Australian SKA Pathfinder Project. ASKAP Digital Signal Processing Systems System Description & Overview of Industry Opportunities
The Australian SKA Pathfinder Project ASKAP Digital Signal Processing Systems System Description & Overview of Industry Opportunities This paper describes the delivery of the digital signal processing
More informationSupporting x86-64 Address Translation for 100s of GPU Lanes. Jason Power, Mark D. Hill, David A. Wood
Supporting x86-64 Address Translation for 100s of GPU s Jason Power, Mark D. Hill, David A. Wood Summary Challenges: CPU&GPUs physically integrated, but logically separate; This reduces theoretical bandwidth,
More informationSynthetic Aperture Beamformation using the GPU
Paper presented at the IEEE International Ultrasonics Symposium, Orlando, Florida, 211: Synthetic Aperture Beamformation using the GPU Jens Munk Hansen, Dana Schaa and Jørgen Arendt Jensen Center for Fast
More informationMulti-core Platforms for
20 JUNE 2011 Multi-core Platforms for Immersive-Audio Applications Course: Advanced Computer Architectures Teacher: Prof. Cristina Silvano Student: Silvio La Blasca 771338 Introduction on Immersive-Audio
More informationLOFAR: Special Issues
Netherlands Institute for Radio Astronomy LOFAR: Special Issues John McKean (ASTRON) ASTRON is part of the Netherlands Organisation for Scientific Research (NWO) 1 Preamble http://www.astron.nl/~mckean/eris-2011-2.pdf
More informationLOFAR DATA SCHOOL 2016
LOFAR DATA SCHOOL 2016 Tied Array Imaging (II), with contributions from: RRL group Scintillation (R. Fallows) Pulsar Working Group Radio Observatory Outline Tools Calibration (Cyg A imaging) Beams Scientific
More informationMarch Phased Array Technology. Andrew Faulkner
Aperture Arrays Michael Kramer Sparse Type of AA selection 1000 Sparse AA-low Sky Brightness Temperature (K) 100 10 T sky A eff Fully sampled AA-mid Becoming sparse Aeff / T sys (m 2 / K) Dense A eff /T
More informationOverview of Survey KSP meeting Leiden March 2010
Netherlands Institute for Radio Astronomy Overview of Survey KSP meeting Leiden March 2010 George Heald LSM 20100317 ASTRON is part of the Netherlands Organisation for Scientific Research (NWO) 1 Topics
More informationReal-time Pulsar Timing signal processing on GPUs
Real-Time Pulsar Timing Signal Processing on GPUs Plan : Pulsar Timing Instrumentations LPC2E, CNRS Orléans - FRANCE Ismaël Cognard, Gilles Theureau, Grégory Desvignes, Cédric Viou, Dalal Ait-Allal Pulsars
More informationComputational Simulations of The World s Biggest Eye on GPUs
Computational Simulations of The World s Biggest Eye on GPUs Hatem Ltaief Extreme Computing Research Center King Abdullah University of Science and Technology, Saudi Arabia NVIDIA GTC at San Jose, CA April
More informationPhased Array Feeds A new technology for multi-beam radio astronomy
Phased Array Feeds A new technology for multi-beam radio astronomy Aidan Hotan ASKAP Deputy Project Scientist 2 nd October 2015 CSIRO ASTRONOMY AND SPACE SCIENCE Outline Review of radio astronomy concepts.
More informationBen Baker. Sponsored by:
Ben Baker Sponsored by: Background Agenda GPU Computing Digital Image Processing at FamilySearch Potential GPU based solutions Performance Testing Results Conclusions and Future Work 2 CPU vs. GPU Architecture
More informationNovember SKA Low Frequency Aperture Array. Andrew Faulkner
SKA Phase 1 Implementation Southern Africa Australia SKA 1 -mid 250 15m dia. Dishes 0.4-3GHz SKA 1 -low 256,000 antennas Aperture Array Stations 50 350/650MHz SKA 1 -survey 90 15m dia. Dishes 0.7-1.7GHz
More informationMonte Carlo integration and event generation on GPU and their application to particle physics
Monte Carlo integration and event generation on GPU and their application to particle physics Junichi Kanzaki (KEK) GPU2016 @ Rome, Italy Sep. 26, 2016 Motivation Increase of amount of LHC data (raw &
More informationReal-Time RFI Mitigation for Single-Dish Radio Telescopes. Richard Prestage, GBO
Real-Time RFI Mitigation for Single-Dish Radio Telescopes Richard Prestage, GBO Collaborators Cedric Viou, Jessica Masson Station de radioastronomie de Nançay Observatoire de Paris, PSL Research University,
More informationIHV means Independent Hardware Vendor. Example is Qualcomm Technologies Inc. that makes Snapdragon processors. OEM means Original Equipment
1 2 IHV means Independent Hardware Vendor. Example is Qualcomm Technologies Inc. that makes Snapdragon processors. OEM means Original Equipment Manufacturer. Examples are smartphone manufacturers. Tuning
More informationHigh Performance Imaging Using Large Camera Arrays
High Performance Imaging Using Large Camera Arrays Presentation of the original paper by Bennett Wilburn, Neel Joshi, Vaibhav Vaish, Eino-Ville Talvala, Emilio Antunez, Adam Barth, Andrew Adams, Mark Horowitz,
More informationData processing with the RTS A GPU-accelerated calibration & imaging stream processor
Data processing with the RTS A GPU-accelerated calibration & imaging stream processor Daniel Mitchell 2018 ICRAR/CASS Radio School CSIRO ASTRONOMY AND SPACE SCIENCE The RTS (Real-Time System) A GPU-accelerated
More informationMulti-octave radio frequency systems: Developments of antenna technology in radio astronomy and imaging systems
Multi-octave radio frequency systems: Developments of antenna technology in radio astronomy and imaging systems Professor Tony Brown School of Electrical and Electronic Engineering University of Manchester
More informationStatus of LOFAR. Ronald Nijboer (ASTRON) On behalf of the LOFAR team
Status of LOFAR Ronald Nijboer (ASTRON) On behalf of the LOFAR team ASTRON is part of the Netherlands Organisation for Scientific Research (NWO) -1- LOFAR: LOw Frequency ARray LBA: 10/30 80 MHz; HBA: 120
More informationTrack and Vertex Reconstruction on GPUs for the Mu3e Experiment
Track and Vertex Reconstruction on GPUs for the Mu3e Experiment Dorothea vom Bruch for the Mu3e Collaboration GPU Computing in High Energy Physics, Pisa September 11th, 2014 Physikalisches Institut Heidelberg
More informationMay AA Communications. Portugal
SKA Top-level description A large radio telescope for transformational science Up to 1 million m 2 collecting area Operating from 70 MHz to 10 GHz (4m-3cm) Two or more detector technologies Connected to
More informationATA memo 73. The ATA Correlator. W.L. Urry, M. Wright, M. Dexter, D. MacMahon. 16 Feb Abstract
ATA memo 73 The ATA Correlator W.L. Urry, M. Wright, M. Dexter, D. MacMahon 16 Feb. 2007 Abstract This memo describes the current status of the ATA correlator. The correlator is designed for 352 antennas,
More informationHow different FPGA firmware options enable digitizer platforms to address and facilitate multiple applications
How different FPGA firmware options enable digitizer platforms to address and facilitate multiple applications 1 st of April 2019 Marc.Stackler@Teledyne.com March 19 1 Digitizer definition and application
More informationTowards SKA Multi-beam concepts and technology
Towards SKA Multi-beam concepts and technology SKA meeting Meudon Observatory, 16 June 2009 Philippe Picard Station de Radioastronomie de Nançay philippe.picard@obs-nancay.fr 1 Square Kilometre Array:
More information2-PAD: An Introduction. The 2-PAD Team
2-PAD: An Introduction The 2-PAD Team Workshop, Jodrell Bank, 10 Presented th November 2009 by 2-PAD: Dr An Georgina Introduction Harris Georgina Harris for the 2-PAD Team 1 2-PAD Objectives Demonstrate
More informationApplying full polarization A-Projection to very-wide fields of view instruments: An imager for LOFAR Cyril Tasse
Applying full polarization A-Projection to very-wide fields of view instruments: An imager for LOFAR Cyril Tasse ASTRON/Leiden: Joris van Zwieten, Bas van der Tol, Ger van Diepen NRAO: Sanjay Bhatnagar
More informationThe LOFAR Telescope: System Architecture and Signal Processing
The LOFAR Telescope: System Architecture and Signal Processing M. de Vos, A.W. Gunst, R. Nijboer ASTRON, P.O Box 2, 7990 AA Dwingeloo, The Netherlands Abstract The Low Frequency Array (LOFAR) is a large
More informationCP2K PERFORMANCE FROM CRAY XT3 TO XC30. Iain Bethune Fiona Reid Alfio Lazzaro
CP2K PERFORMANCE FROM CRAY XT3 TO XC30 Iain Bethune (ibethune@epcc.ed.ac.uk) Fiona Reid Alfio Lazzaro Outline CP2K Overview Features Parallel Algorithms Cray HPC Systems Trends Water Benchmarks 2005 2013
More informationPulsar polarimetry. with. Charlotte Sobey. Dr. Aris Noutsos & Prof. Michael Kramer
Pulsar polarimetry with Dr. Aris Noutsos & Prof. Michael Kramer Outline Introduction Observations Ionosphere Outline Pulsars as objects Pulsars as probes of the ISM Faraday rotation using RM synthesis
More informationDetection of Radio Pulses from Air Showers with LOPES
Detection of Radio Pulses from Air Showers with LOPES Andreas Horneffer for the LOPES Collaboration Radboud University Nijmegen Radio Emission from Air Showers air showers are known since 1965 to emit
More informationData acquisi*on and Trigger - Trigger -
Experimental Methods in Par3cle Physics (HS 2014) Data acquisi*on and Trigger - Trigger - Lea Caminada lea.caminada@physik.uzh.ch 1 Interlude: LHC opera3on Data rates at LHC Trigger overview Coincidence
More informationASKAP Industry technical briefing. Tim Cornwell, ASKAP Computing Project Lead Australian Square Kilometre Array Pathfinder
! ASKAP Industry technical briefing Tim Cornwell, ASKAP Computing Project Lead Australian Square Kilometre Array Pathfinder The Square Kilometre Array 2020 era radio telescope Very large collecting area
More informationChallenges in Transition
Challenges in Transition Keynote talk at International Workshop on Software Engineering Methods for Parallel and High Performance Applications (SEM4HPC 2016) 1 Kazuaki Ishizaki IBM Research Tokyo kiszk@acm.org
More informationDense Aperture Array for SKA
Dense Aperture Array for SKA Steve Torchinsky EMBRACE Why a Square Kilometre? Detection of HI in emission at cosmological distances R. Ekers, SKA Memo #4, 2001 P. Wilkinson, 1991 J. Heidmann, 1966! SKA
More informationComponents of Imaging at Low Frequencies: Status & Challenges
Components of Imaging at Low Frequencies: Status & Challenges Dec. 12th 2013 S. Bhatnagar NRAO Collaborators: T.J. Cornwell, R. Nityananda, K. Golap, U. Rau J. Uson, R. Perley, F. Owen Telescope sensitivity
More informationImproving GPU Performance via Large Warps and Two-Level Warp Scheduling
Improving GPU Performance via Large Warps and Two-Level Warp Scheduling Veynu Narasiman The University of Texas at Austin Michael Shebanow NVIDIA Chang Joo Lee Intel Rustam Miftakhutdinov The University
More informationThe AAMID consortium: Mid Frequency Aperture Array
The consortium: Mid Frequency Aperture Array Wim van Cappellen, Consortium Lead Livingstone curves Brought to our attention by Ron Ekers Technological capability leads to discovery in astronomy A single
More informationLOFAR: From raw visibilities to calibrated data
Netherlands Institute for Radio Astronomy LOFAR: From raw visibilities to calibrated data John McKean (ASTRON) [subbing in for Manu] ASTRON is part of the Netherlands Organisation for Scientific Research
More informationTHE LOW-FREQUENCY ARRAY (LOFAR) is a new antenna array that observes the sky. The LOFAR radio environment. Chapter 5
Chapter 5 The LOFAR radio environment Based on: The LOFAR radio environment (Offringa et al., in preparation) THE LOW-FREQUENCY ARRAY (LOFAR) is a new antenna array that observes the sky from 1 9 and 11
More informationTHE purpose of beamforming is to precisely align the
1 Beamforming Techniques for Large-N Aperture Arrays K. Zarb-Adami, A. Faulkner, J.G. Bij de Vaate, G.W. Kant and P.Picard arxiv:1008.4047v1 [astro-ph.im] 24 Aug 2010 Abstract Beamforming is central to
More informationMemo 65 SKA Signal processing costs
Memo 65 SKA Signal processing costs John Bunton, CSIRO ICT Centre 12/08/05 www.skatelescope.org/pages/page_memos.htm Introduction The delay in the building of the SKA has a significant impact on the signal
More informationSKA Phase 1: Costs of Computation. Duncan Hall CALIM 2010
SKA Phase 1: Costs of Computation Duncan Hall CALIM 2010 2010 August 24, 27 Outline Motivation Phase 1 in a nutshell Benchmark from 2001 [EVLA Memo 24] Some questions Amdahl s law overrides Moore s law!
More informationHardware-accelerated CCD readout smear correction for Fast Solar Polarimeter
Welcome Hardware-accelerated CCD readout smear correction for Fast Solar Polarimeter Stefan Tabel and Korbinian Weikl Semiconductor Laboratory of the Max Planck Society, Munich, Germany Walter Stechele
More informationHigh Performance Computing for Engineers
High Performance Computing for Engineers David Thomas dt10@ic.ac.uk / https://github.com/m8pple Room 903 http://cas.ee.ic.ac.uk/people/dt10/teaching/2014/hpce HPCE / dt10/ 2015 / 0.1 High Performance Computing
More informationCORRECTED VISION. Here be underscores THE ROLE OF CAMERA AND LENS PARAMETERS IN REAL-WORLD MEASUREMENT
Here be underscores CORRECTED VISION THE ROLE OF CAMERA AND LENS PARAMETERS IN REAL-WORLD MEASUREMENT JOSEPH HOWSE, NUMMIST MEDIA CIG-GANS WORKSHOP: 3-D COLLECTION, ANALYSIS AND VISUALIZATION LAWRENCETOWN,
More informationGreen Bank Instrumentation circa 2030
Green Bank Instrumentation circa 2030 Dan Werthimer and 800 CASPER Collaborators http://casper.berkeley.edu Upcoming Nobel Prizes with Radio Instrumentation Gravitational Wave Detection (pulsar timing)
More informationPhased Array Feeds for the SKA. WP2.2.3 PAFSKA Consortium CSIRO ASTRON DRAO NRAO BYU OdP Nancay Cornell U Manchester
Phased Array Feeds for the SKA WP2.2.3 PAFSKA Consortium CSIRO ASTRON DRAO NRAO BYU OdP Nancay Cornell U Manchester Dish Array Hierarchy Dish Array L5 Elements PAF Dish Single Pixel Feeds L4 Sub systems
More informationLOFAR: Lessons Learnt
LOFAR: Lessons Learnt Michiel van Haarlem van Weeren, Bonafede, Ferrari, Orrù, Pizzo, Shulevski, van der Tol, Macario Jason Hessels & Pulsar Team LOFAR 40 stations in NL and 8 stations throughout Europe
More informationASKAP Phased Array Feed Digital Beamformer Design Overview and Performance Characteristics
ASKAP Phased Array Feed Digital Beamformer Design Overview and Performance Characteristics John Tuthill, Tim Bateman, Grant Hampson, John Bunton, Andrew Brown, Daniel George, Mia Baquiran August 2016 CASS
More informationHardware Software Science Co-design in the Human Brain Project
Hardware Software Science Co-design in the Human Brain Project Wouter Klijn 29-11-2016 Pune, India 1 Content The Human Brain Project Hardware - HBP Pilot machines Software - A Neuron - NestMC: NEST Multi
More informationImage Processing Architectures (and their future requirements)
Lecture 17: Image Processing Architectures (and their future requirements) Visual Computing Systems Smart phone processing resources Qualcomm snapdragon Image credit: Qualcomm Apple A7 (iphone 5s) Chipworks
More informationNRC Herzberg Astronomy & Astrophysics
NRC Herzberg Astronomy & Astrophysics SKA Pre-Construction Update Séverin Gaudet, Canadian Astronomy Data Centre David Loop, Director Astronomy Technology June 2016 update SKA Pre-Construction NRC Involvement
More informationTime-Frequency System Builds and Timing Strategy Research of VHF Band Antenna Array
Journal of Computer and Communications, 2016, 4, 116-125 Published Online March 2016 in SciRes. http://www.scirp.org/journal/jcc http://dx.doi.org/10.4236/jcc.2016.43018 Time-Frequency System Builds and
More informationREVOLUTIONIZING THE COMPUTING LANDSCAPE AND BEYOND.
December 3-6, 2018 Santa Clara Convention Center CA, USA REVOLUTIONIZING THE COMPUTING LANDSCAPE AND BEYOND. https://tmt.knect365.com/risc-v-summit @risc_v ACCELERATING INFERENCING ON THE EDGE WITH RISC-V
More informationCorrelator Development at Haystack. Roger Cappallo Haystack-NRAO Technical Mtg
Correlator Development at Haystack Roger Cappallo Haystack-NRAO Technical Mtg. 2006.10.26 History of Correlator Development at Haystack ~1973 Mk I 360 Kb/s x 2 stns. 1981 Mk III 112 Mb/s x 4 stns. 1986
More informationA report on KAT7 and MeerKAT status and plans
A report on KAT7 and MeerKAT status and plans SKA SA, Cape Town Office 3rd Floor, The Park, Park Road, Pinelands, Cape Town, South Africa E mail: tony@hartrao.ac.za This is a short memo on the current
More informationNew Paradigm in Testing Heads & Media for HDD. Dr. Lutz Henckels September 2010
New Paradigm in Testing Heads & Media for HDD Dr. Lutz Henckels September 2010 1 WOW an amazing industry 40%+ per year aerial density growth Source: Coughlin Associates 2010 2 WOW an amazing industry Aerial
More informationNumerical Approach for the Analysis and Optimization of Phased Array Feed Systems
Numerical Approach for the Analysis and Optimization of Phased Array Feed Systems The Netherlands Institute for Radio Astronomy (ASTRON) Supported by part: - The Netherlands Organization for Scientific
More informationA Multi-Fielding SKA Covering the Range 100 MHz 22 GHz. Peter Hall and Aaron Chippendale, CSIRO ATNF 24 November 2003
A Multi-Fielding SKA Covering the Range 100 MHz 22 GHz Peter Hall and Aaron Chippendale, CSIRO ATNF 24 November 2003 1. Background Various analyses, including the recent IEMT report [1], have noted that
More informationTechnology Drivers, SKA Pathfinders P. Dewdney
Technology Drivers, SKA Pathfinders P. Dewdney Dominion Radio Astrophysical Observatory Herzberg Institute of Astrophysics National Research Council Canada National Research Council Canada Conseil national
More informationASTRON/LOFAR Reproduction in whole or in part is prohibited without written consent of the 1au
The Data Explosion in Radio-Astronomy Virtual Instruments and E-LOFAR Marco de Vos ASTRON Director of R&D (devos@astron.nl) Drenthe-light Early history and near future Start of radio-astronomy: Grote Reber,
More informationVideo Enhancement Algorithms on System on Chip
International Journal of Scientific and Research Publications, Volume 2, Issue 4, April 2012 1 Video Enhancement Algorithms on System on Chip Dr.Ch. Ravikumar, Dr. S.K. Srivatsa Abstract- This paper presents
More information2002 IEEE International Solid-State Circuits Conference 2002 IEEE
Outline 802.11a Overview Medium Access Control Design Baseband Transmitter Design Baseband Receiver Design Chip Details What is 802.11a? IEEE standard approved in September, 1999 12 20MHz channels at 5.15-5.35
More informationReal-Time Software Receiver Using Massively Parallel
Real-Time Software Receiver Using Massively Parallel Processors for GPS Adaptive Antenna Array Processing Jiwon Seo, David De Lorenzo, Sherman Lo, Per Enge, Stanford University Yu-Hsuan Chen, National
More information