escience: Pulsar searching on GPUs

Size: px
Start display at page:

Download "escience: Pulsar searching on GPUs"

Transcription

1 escience: Pulsar searching on GPUs Alessio Sclocco Ana Lucia Varbanescu Karel van der Veldt John Romein Joeri van Leeuwen Jason Hessels Rob van Nieuwpoort And many others!

2 Netherlands escience center Science itself is changing We need to change with it escience = Enhanced science Apply ICT in the broadest sense Data-driven research across all scientific disciplines Develop generic escience tools Collaboration between disciplines Coordinate escience approaches Contribute to scientific breakthroughs

3 Powered by SURF and NWO Also a funding organization Currently Big Data call escience integrators Top researchers in a variety of disciplines Form bridge between domain and escience escience engineers The innovators that do the real work

4 Priority areas 1. Chemistry & Materials 2. Life Sciences (Green Genetics, Life Sciences & Translational Research, Foods and Cogntion/Neuroscience) 3. escience Methodology & Big Data (Astronomy) 4. Humanities & Social Sciences 5. Sustainability & Environment (Climate, Water management, Energy and Ecology)

5 Big Data Many novel instruments and applications produce lots of data LHC, telescopes, climate simulations, genetics, medical scanners, facebook, twitter, These applications cannot work without computer science anymore A lot of unexploited knowledge that we can only find if the data across disciplines is accessible and usable Challenges: data handling, processing Need large-scale parallelism

6 Searching for pulsars Rotating neutron starts Discovered in 1967 About 2000 are known Big mass, precise period Lab in the sky Probe space and gravitation Investigate general relativity Work done by Alessio Sclocco

7 Pulsar pipeline Real-Time Central Processing Poly-Phase Filter RFI mitigation Beam Former Dedispersion Folding Signal-to-Noise Pulsar Pipeline

8 Portability and Flexibility Many different instruments Software telescopes Life time of an instrument is much longer than life time of compute hardware Big Data, Big Compute and heterogeneity DAS-4 is ideal research platform

9 OpenCL: The Khronos group

10 OpenCL: Open Compute Language Architecture independent Explicit support for many-cores Low-level host API Uses C library, no language extensions Separate high-level kernel language Explicit support for vectorization Backend for high-level models: Pieter Hijma

11 Performance portability Many different observation types and parameters Only known at run time E.g. # frequency channels, # stations, longest baseline, filter quality, observation type Use runtime compilation and auto-tuning Map specific problem instance efficiently to hardware Auto tune platform-specific parameters Portability across different platforms / families

12 Pulsar pipeline Real-Time Central Processing Poly-Phase Filter RFI mitigation Beam Former Dedispersion Folding Signal-to-Noise Pulsar Pipeline

13 Beam forming Apply correct delays Combine stations Weighted addition of all stations Alessio Sclocco [IPDPS 2012]

14 Auto-tuning GTX580 OpenCL

15 Auto-tuning GTX580 CUDA

16 Beam forming performance GFLOPS IBM Blue Gene/P Intel Xeon E5620 (OpenMP/SSE) Intel Xeon E5620 (OpenCL) AMD Opteron AMD Opteron (OpenCL) (OpenMP/SSE) AMD HD6970 (OpenCL) NVIDIA GTX580 (CUDA) NVIDIA GTX580 (OpenCL)

17 Beam forming power efficiency GFLOPS / Watt IBM Blue Gene/P Intel Xeon E5620 (OpenMP/SSE) Intel Xeon E5620 (OpenCL) AMD Opteron 6172 (OpenMP/SSE) AMD Opteron 6172 (OpenCL) AMD HD6970 (OpenCL) NVIDIA GTX580 (CUDA) NVIDIA GTX580 (OpenCL)

18 Pulsar pipeline Real-Time Central Processing Poly-Phase Filter RFI mitigation Beam Former Dedispersion Folding Signal-to-Noise Pulsar Pipeline

19 Real-time RFI mitigation Some pipelines run in real time pulsar searching RFI is much stronger than pulsars Beam forming takes union of RFI of all receivers

20 Real-time RFI mitigation challenges Limited amount of samples Only 1 second, no data from the future, only statistics from the past Fewer frequencies Only one frequency band per node Each second, a node gets a different frequency Real time We can afford only few operations per byte

21 RFI mitigation

22 Pulsar pipeline Real-Time Central Processing Poly-Phase Filter RFI mitigation Beam Former Dedispersion Folding Signal-to-Noise Pulsar Pipeline

23 Pulsar pipeline DM Three unknowns: Location: create many beams on the sky Dispersion: measure of distance of pulsar Period Brute force search across all parameters Everything is trivially parallel (or is it?) Our approach: keep everything in the GPU period

24 Dedispersion Frequencies have different delays through ISM Dispersion measure (DM) Brute-force search Output has no channels anymore

25 Data reuse

26 Performance on GPUs

27 Period search: Folding Stream of samples Period 8: Period 4:

28 Period search: Folding Build a tree of periods to maximize reuse Each thread block computes a path from leaf to root

29 Signal-to-noise SNR = (max mean) / RMS Computed on folded data Run time negligible compared to dedispersion and folding Output a list of candidates Reduced terabytes to only 15 numbers

30 Pulse profile Signal power time

31 Conclusions Cuda / OpenCL performance gap gone Portable: OpenCL and auto-tuning Entire pipeline on GPUs No PCI-e bottleneck moving data back and forth Trivially parallel? Yes, but not trivial! It s all about the memory escience approach works! Requires commitment from both sides

Signal Processing on GPUs for Radio Telescopes

Signal Processing on GPUs for Radio Telescopes Signal Processing on GPUs for Radio Telescopes John W. Romein Netherlands Institute for Radio Astronomy (ASTRON) Dwingeloo, the Netherlands 1 Overview radio telescopes motivation processing pipelines signal-processing

More information

Real-time Pulsar Timing signal processing on GPUs

Real-time Pulsar Timing signal processing on GPUs Real-Time Pulsar Timing Signal Processing on GPUs Plan : Pulsar Timing Instrumentations LPC2E, CNRS Orléans - FRANCE Ismaël Cognard, Gilles Theureau, Grégory Desvignes, Cédric Viou, Dalal Ait-Allal Pulsars

More information

A Polyphase Filter for GPUs and Multi-Core Processors

A Polyphase Filter for GPUs and Multi-Core Processors A Polyphase Filter for GPUs and Multi-Core Processors Karel van der Veldt Universiteit van Amsterdam The Netherlands karel.vd.veldt@uva.nl Ana Lucia Varbanescu Technische Universiteit Delft The Netherlands

More information

Computational Efficiency of the GF and the RMF Transforms for Quaternary Logic Functions on CPUs and GPUs

Computational Efficiency of the GF and the RMF Transforms for Quaternary Logic Functions on CPUs and GPUs 5 th International Conference on Logic and Application LAP 2016 Dubrovnik, Croatia, September 19-23, 2016 Computational Efficiency of the GF and the RMF Transforms for Quaternary Logic Functions on CPUs

More information

SKA NON IMAGING PROCESSING CONCEPT DESCRIPTION: GPU PROCESSING FOR REAL TIME ISOLATED RADIO PULSE DETECTION

SKA NON IMAGING PROCESSING CONCEPT DESCRIPTION: GPU PROCESSING FOR REAL TIME ISOLATED RADIO PULSE DETECTION SKA NON IMAGING PROCESSING CONCEPT DESCRIPTION: GPU PROCESSING FOR REAL TIME ISOLATED RADIO PULSE DETECTION Document number... WP2 040.130.010 TD 001 Revision... 1 Author... Aris Karastergiou Date... 2011

More information

Processing Real-Time LOFAR Telescope Data on a Blue Gene/P

Processing Real-Time LOFAR Telescope Data on a Blue Gene/P Processing Real-Time LOFAR Telescope Data on a Blue Gene/P John W. Romein Stichting ASTRON (Netherlands Institute for Radio Astronomy) Dwingeloo, the Netherlands 1 LOw Frequency ARray radio telescope 10

More information

A Scalable Computer Architecture for

A Scalable Computer Architecture for A Scalable Computer Architecture for On-line Pulsar Search on the SKA - Draft Version - G. Knittel, A. Horneffer MPI for Radio Astronomy Bonn with help from: M. Kramer, B. Klein, R. Eatough GPU-Based Pulsar

More information

arxiv: v1 [astro-ph.im] 1 Sep 2015

arxiv: v1 [astro-ph.im] 1 Sep 2015 Experimental Astronomy manuscript No. (will be inserted by the editor) A Real-time Coherent Dedispersion Pipeline for the Giant Metrewave Radio Telescope Kishalay De Yashwant Gupta arxiv:1509.00186v1 [astro-ph.im]

More information

LWA Users Meeting Pulsars II: Software and Survey Kevin Stovall

LWA Users Meeting Pulsars II: Software and Survey Kevin Stovall LWA Users Meeting 2012 Pulsars II: Software and Survey Kevin Stovall Pulsar Software - PRESTO - PSRCHIVE - TEMPO/TEMPO2 - writepsrfits.py - Coherent Dedispersion Status Pulsar/Transient Survey - All Sky

More information

Joeri van Leeuwen The dynamic radio sky: Pulsars

Joeri van Leeuwen The dynamic radio sky: Pulsars Joeri van Leeuwen The dynamic radio sky: Pulsars Joeri van Leeuwen The dynamic radio sky: Pulsars Coenen, van Leeuwen et al. 2015 Joeri van Leeuwen The dynamic radio sky: Pulsars Joeri van Leeuwen The

More information

GPU-accelerated track reconstruction in the ALICE High Level Trigger

GPU-accelerated track reconstruction in the ALICE High Level Trigger GPU-accelerated track reconstruction in the ALICE High Level Trigger David Rohr for the ALICE Collaboration Frankfurt Institute for Advanced Studies CHEP 2016, San Francisco ALICE at the LHC The Large

More information

Casper Instrumentation at Green Bank

Casper Instrumentation at Green Bank Casper Instrumentation at Green Bank John Ford September 28, 2009 The NRAO is operated for the National Science Foundation (NSF) by Associated Universities, Inc. (AUI), under a cooperative agreement. GBT

More information

arxiv: v1 [astro-ph.im] 7 Dec 2010

arxiv: v1 [astro-ph.im] 7 Dec 2010 arxiv:1012.1583v1 [astro-ph.im] 7 Dec 2010 University of Amsterdam (UvA), Amsterdam, The Netherlands E-mail: a.alexov@uva.nl Jason W. T. Hessels Netherlands Institute for Radio Astronomy (ASTRON), Dwingeloo,

More information

Scalable Multi-Precision Simulation of Spiking Neural Networks on GPU with OpenCL

Scalable Multi-Precision Simulation of Spiking Neural Networks on GPU with OpenCL Scalable Multi-Precision Simulation of Spiking Neural Networks on GPU with OpenCL Dmitri Yudanov (Advanced Micro Devices, USA) Leon Reznik (Rochester Institute of Technology, USA) WCCI 2012, IJCNN, June

More information

GPU-accelerated SDR Implementation of Multi-User Detector for Satellite Return Links

GPU-accelerated SDR Implementation of Multi-User Detector for Satellite Return Links DLR.de Chart 1 GPU-accelerated SDR Implementation of Multi-User Detector for Satellite Return Links Chen Tang chen.tang@dlr.de Institute of Communication and Navigation German Aerospace Center DLR.de Chart

More information

Real-Time RFI Mitigation for Single-Dish Radio Telescopes. Richard Prestage, GBO

Real-Time RFI Mitigation for Single-Dish Radio Telescopes. Richard Prestage, GBO Real-Time RFI Mitigation for Single-Dish Radio Telescopes Richard Prestage, GBO Collaborators Cedric Viou, Jessica Masson Station de radioastronomie de Nançay Observatoire de Paris, PSL Research University,

More information

Green Bank Instrumentation circa 2030

Green Bank Instrumentation circa 2030 Green Bank Instrumentation circa 2030 Dan Werthimer and 800 CASPER Collaborators http://casper.berkeley.edu Upcoming Nobel Prizes with Radio Instrumentation Gravitational Wave Detection (pulsar timing)

More information

The LOFAR Known Pulsar Data Pipeline Alexov, A.; Hessels, J.W.T.; Mol, J.D.; Stappers, B.W.; van Leeuwen, A.G.J.

The LOFAR Known Pulsar Data Pipeline Alexov, A.; Hessels, J.W.T.; Mol, J.D.; Stappers, B.W.; van Leeuwen, A.G.J. UvA-DARE (Digital Academic Repository) The LOFAR Known Pulsar Data Pipeline Alexov, A.; Hessels, J.W.T.; Mol, J.D.; Stappers, B.W.; van Leeuwen, A.G.J. Published in: Proceedings of science Link to publication

More information

Challenges in Transition

Challenges in Transition Challenges in Transition Keynote talk at International Workshop on Software Engineering Methods for Parallel and High Performance Applications (SEM4HPC 2016) 1 Kazuaki Ishizaki IBM Research Tokyo kiszk@acm.org

More information

PROJECT CHARTER. (DRAFT Version 1.0, 16-AUG-07)

PROJECT CHARTER. (DRAFT Version 1.0, 16-AUG-07) PROJECT CHARTER (DRAFT Version 1.0, 16-AUG-07) PROJECT NAME Next Generation, Common User NRAO Pulsar Backend (AKA Scott Ransom s Dream Pulsar Machine ) PROJECT PERSONNEL Project Sponsor Scott Ransom Project

More information

Perspective platforms for BOINC distributed computing network

Perspective platforms for BOINC distributed computing network Perspective platforms for BOINC distributed computing network Vitalii Koshura Lohika Odessa, Ukraine lestat.de.lionkur@gmail.com Profile page: https://www.linkedin.com/in/aenbleidd/ Abstract This paper

More information

Pulsar polarimetry. with. Charlotte Sobey. Dr. Aris Noutsos & Prof. Michael Kramer

Pulsar polarimetry. with. Charlotte Sobey. Dr. Aris Noutsos & Prof. Michael Kramer Pulsar polarimetry with Dr. Aris Noutsos & Prof. Michael Kramer Outline Introduction Observations Ionosphere Outline Pulsars as objects Pulsars as probes of the ISM Faraday rotation using RM synthesis

More information

Track and Vertex Reconstruction on GPUs for the Mu3e Experiment

Track and Vertex Reconstruction on GPUs for the Mu3e Experiment Track and Vertex Reconstruction on GPUs for the Mu3e Experiment Dorothea vom Bruch for the Mu3e Collaboration GPU Computing in High Energy Physics, Pisa September 11th, 2014 Physikalisches Institut Heidelberg

More information

Programming and Optimization with Intel Xeon Phi Coprocessors. Colfax Developer Training One-day Boot Camp

Programming and Optimization with Intel Xeon Phi Coprocessors. Colfax Developer Training One-day Boot Camp Programming and Optimization with Intel Xeon Phi Coprocessors Colfax Developer Training One-day Boot Camp Abstract: Colfax Developer Training (CDT) is an in-depth intensive course on efficient parallel

More information

NRC Workshop on NASA s Modeling, Simulation, and Information Systems and Processing Technology

NRC Workshop on NASA s Modeling, Simulation, and Information Systems and Processing Technology NRC Workshop on NASA s Modeling, Simulation, and Information Systems and Processing Technology Bronson Messer Director of Science National Center for Computational Sciences & Senior R&D Staff Oak Ridge

More information

Synthetic Aperture Beamformation using the GPU

Synthetic Aperture Beamformation using the GPU Paper presented at the IEEE International Ultrasonics Symposium, Orlando, Florida, 211: Synthetic Aperture Beamformation using the GPU Jens Munk Hansen, Dana Schaa and Jørgen Arendt Jensen Center for Fast

More information

Eyedentify MMR SDK. Technical sheet. Version Eyedea Recognition, s.r.o.

Eyedentify MMR SDK. Technical sheet. Version Eyedea Recognition, s.r.o. Eyedentify MMR SDK Technical sheet Version 2.3.1 010001010111100101100101011001000110010101100001001000000 101001001100101011000110110111101100111011011100110100101 110100011010010110111101101110010001010111100101100101011

More information

Broadening the Scope and Impact of escience. Frank Seinstra. Director escience Program Netherlands escience Center

Broadening the Scope and Impact of escience. Frank Seinstra. Director escience Program Netherlands escience Center Broadening the Scope and Impact of escience Frank Seinstra Director escience Program Netherlands escience Center Big Science & ICT Big Science Today s Scientific Challenges are Big in many ways: Big Data

More information

Recent Advances in Simulation Techniques and Tools

Recent Advances in Simulation Techniques and Tools Recent Advances in Simulation Techniques and Tools Yuyang Li, li.yuyang(at)wustl.edu (A paper written under the guidance of Prof. Raj Jain) Download Abstract: Simulation refers to using specified kind

More information

Programming and Optimization with Intel Xeon Phi Coprocessors. Colfax Developer Training One-day Labs CDT 102

Programming and Optimization with Intel Xeon Phi Coprocessors. Colfax Developer Training One-day Labs CDT 102 Programming and Optimization with Intel Xeon Phi Coprocessors Colfax Developer Training One-day Labs CDT 102 Abstract: Colfax Developer Training (CDT) is an in-depth intensive course on efficient parallel

More information

Session Three: Pulsar Data and Dispersion Measure

Session Three: Pulsar Data and Dispersion Measure Slide 1 Session Three: Pulsar Data and Dispersion Measure Sue Ann Heatherly and Sarah Scoles Slide 2 Plot Review Average pulse profile Time domain Reduced χ 2 Recall that last week, we learned about three

More information

A

A PLAN-E Monday September 29 10.00-10.30 Registration 10.30-10.35 Opening, logistics and Introduction, Patrick Aerts 10.35-10.50 Welcome address by Wilco Hazeleger, Director/CEO NLeSC 10.50-11.10 Goals for

More information

Image-Domain Gridding on Accelerators

Image-Domain Gridding on Accelerators Netherlands Institute for Radio Astronomy Image-Domain Gridding on Accelerators Bram Veenboer Monday 26th March, 2018, GPU Technology Conference 2018, San Jose, USA ASTRON is part of the Netherlands Organisation

More information

Contribution to the Smecy Project

Contribution to the Smecy Project Alessio Pascucci Contribution to the Smecy Project Study some performance critical parts of Signal Processing Applications Study the parallelization methodology in order to achieve best performances on

More information

GPU Computing for Cognitive Robotics

GPU Computing for Cognitive Robotics GPU Computing for Cognitive Robotics Martin Peniak, Davide Marocco, Angelo Cangelosi GPU Technology Conference, San Jose, California, 25 March, 2014 Acknowledgements This study was financed by: EU Integrating

More information

Oculus Rift Getting Started Guide

Oculus Rift Getting Started Guide Oculus Rift Getting Started Guide Version 1.7.0 2 Introduction Oculus Rift Copyrights and Trademarks 2017 Oculus VR, LLC. All Rights Reserved. OCULUS VR, OCULUS, and RIFT are trademarks of Oculus VR, LLC.

More information

Supporting x86-64 Address Translation for 100s of GPU Lanes. Jason Power, Mark D. Hill, David A. Wood

Supporting x86-64 Address Translation for 100s of GPU Lanes. Jason Power, Mark D. Hill, David A. Wood Supporting x86-64 Address Translation for 100s of GPU s Jason Power, Mark D. Hill, David A. Wood Summary Challenges: CPU&GPUs physically integrated, but logically separate; This reduces theoretical bandwidth,

More information

Table of Contents HOL ADV

Table of Contents HOL ADV Table of Contents Lab Overview - - Horizon 7.1: Graphics Acceleartion for 3D Workloads and vgpu... 2 Lab Guidance... 3 Module 1-3D Options in Horizon 7 (15 minutes - Basic)... 5 Introduction... 6 3D Desktop

More information

Monte Carlo integration and event generation on GPU and their application to particle physics

Monte Carlo integration and event generation on GPU and their application to particle physics Monte Carlo integration and event generation on GPU and their application to particle physics Junichi Kanzaki (KEK) GPU2016 @ Rome, Italy Sep. 26, 2016 Motivation Increase of amount of LHC data (raw &

More information

Software Correlators for Dish and Sparse Aperture Arrays of the SKA Phase I

Software Correlators for Dish and Sparse Aperture Arrays of the SKA Phase I Software Correlators for Dsh and Sparse Aperture Arrays of the SKA Phase I Jongsoo Km Korea Astronomy and Space Scence Insttute Collaborators: Paul Alexander (Unv. of Cambrdge) Andrew Faulkner (Unv. of

More information

Real-time RFI Mitigation in Radio Astronomy

Real-time RFI Mitigation in Radio Astronomy Washington University in St. Louis Washington University Open Scholarship Senior Honors Papers / Undergraduate Theses Undergraduate Research Fall 5-18-2019 Real-time RFI Mitigation in Radio Astronomy Emily

More information

DG Research&Innovation

DG Research&Innovation PLAN-E meets DG Research&Innovation Dr. Patrick J.C. Aerts PLAN-E 1 escience escience is a new direction, with a problem driven, holistic approach, encompassing computational science, data research, making

More information

I( ) 1 P a g e. Estimation of modulation index of pulsar

I( ) 1 P a g e. Estimation of modulation index of pulsar 1. Aim of the experiment The aim of this experiment is to find out the pulse to pulse intensity modulation of the pulsar radio emission. ORT will be used to carry out observations of a select list of pulsars

More information

A software baseband receiver for pulsar astronomy at GMRT

A software baseband receiver for pulsar astronomy at GMRT Bull. Astr. Soc. India (26) 34, 41 412 A software baseband receiver for pulsar astronomy at GMRT B. C. Joshi 1 and Sunil Ramakrishna 2,3 1 National Centre for Radio Astrophysics (TIFR), Pune 411 7, India

More information

Exploiting Coarse-Grained Task, Data, and Pipeline Parallelism in Stream Programs

Exploiting Coarse-Grained Task, Data, and Pipeline Parallelism in Stream Programs Exploiting Coarse-Grained Task, Data, and Pipeline Parallelism in Stream Programs Michael Gordon, William Thies, and Saman Amarasinghe Massachusetts Institute of Technology ASPLOS October 2006 San Jose,

More information

Document downloaded from:

Document downloaded from: Document downloaded from: http://hdl.handle.net/1251/64738 This paper must be cited as: Reaño González, C.; Pérez López, F.; Silla Jiménez, F. (215). On the design of a demo for exhibiting rcuda. 15th

More information

Oculus Rift Getting Started Guide

Oculus Rift Getting Started Guide Oculus Rift Getting Started Guide Version 1.23 2 Introduction Oculus Rift Copyrights and Trademarks 2017 Oculus VR, LLC. All Rights Reserved. OCULUS VR, OCULUS, and RIFT are trademarks of Oculus VR, LLC.

More information

LOFAR: Lessons Learnt

LOFAR: Lessons Learnt LOFAR: Lessons Learnt Michiel van Haarlem van Weeren, Bonafede, Ferrari, Orrù, Pizzo, Shulevski, van der Tol, Macario Jason Hessels & Pulsar Team LOFAR 40 stations in NL and 8 stations throughout Europe

More information

Improving GPU Performance via Large Warps and Two-Level Warp Scheduling

Improving GPU Performance via Large Warps and Two-Level Warp Scheduling Improving GPU Performance via Large Warps and Two-Level Warp Scheduling Veynu Narasiman The University of Texas at Austin Michael Shebanow NVIDIA Chang Joo Lee Intel Rustam Miftakhutdinov The University

More information

CUDA-Accelerated Satellite Communication Demodulation

CUDA-Accelerated Satellite Communication Demodulation CUDA-Accelerated Satellite Communication Demodulation Renliang Zhao, Ying Liu, Liheng Jian, Zhongya Wang School of Computer and Control University of Chinese Academy of Sciences Outline Motivation Related

More information

Ben Baker. Sponsored by:

Ben Baker. Sponsored by: Ben Baker Sponsored by: Background Agenda GPU Computing Digital Image Processing at FamilySearch Potential GPU based solutions Performance Testing Results Conclusions and Future Work 2 CPU vs. GPU Architecture

More information

Impact from Industrial use of HPC HPC User Forum #59 Munich, Germany October 2015

Impact from Industrial use of HPC HPC User Forum #59 Munich, Germany October 2015 Impact from Industrial use of HPC HPC User Forum #59 Munich, Germany October 2015 Merle Giles Director, Private Sector Program and Economic Impact HPC is a gauge of relative technological prowess of nations

More information

NUIT Support of Researchers

NUIT Support of Researchers NUIT Support of Researchers RACC Meeting September 13, 2010 Bob Taylor Director, Academic and Research Technologies Research Support Focus FY2011 High Performance Computing (HPC) Capabilities Research

More information

An FPGA-Based Back End for Real Time, Multi-Beam Transient Searches Over a Wide Dispersion Measure Range

An FPGA-Based Back End for Real Time, Multi-Beam Transient Searches Over a Wide Dispersion Measure Range An FPGA-Based Back End for Real Time, Multi-Beam Transient Searches Over a Wide Dispersion Measure Range Larry D'Addario 1, Nathan Clarke 2, Robert Navarro 1, and Joseph Trinh 1 1 Jet Propulsion Laboratory,

More information

LOFAR Data Products. First LOFAR Data Processing School 10 February Michael Wise

LOFAR Data Products. First LOFAR Data Processing School 10 February Michael Wise LOFAR Data Products First LOFAR Data Processing School 10 February 2009 Michael Wise MAC and Input section Aux. processing section system processing Input section Aux. processing section system processing

More information

ARTEMIS The Embedded Systems European Technology Platform

ARTEMIS The Embedded Systems European Technology Platform ARTEMIS The Embedded Systems European Technology Platform Technology Platforms : the concept Conditions A recipe for success Industry in the Lead Flexibility Transparency and clear rules of participation

More information

Use Nvidia Performance Primitives (NPP) in Deep Learning Training. Yang Song

Use Nvidia Performance Primitives (NPP) in Deep Learning Training. Yang Song Use Nvidia Performance Primitives (NPP) in Deep Learning Training Yang Song Outline Introduction Function Categories Performance Results Deep Learning Specific Further Information What is NPP? Image+Signal

More information

Hardware Software Science Co-design in the Human Brain Project

Hardware Software Science Co-design in the Human Brain Project Hardware Software Science Co-design in the Human Brain Project Wouter Klijn 29-11-2016 Pune, India 1 Content The Human Brain Project Hardware - HBP Pilot machines Software - A Neuron - NestMC: NEST Multi

More information

HIGH PERFORMANCE COMPUTING USING GPGPU FOR RADAR APPLICATIONS

HIGH PERFORMANCE COMPUTING USING GPGPU FOR RADAR APPLICATIONS HIGH PERFORMANCE COMPUTING USING GPGPU FOR RADAR APPLICATIONS Viswam Gampala 1 (visgam@yahoo.co.in), Akshay BM 1, A Vengadarajan 1, PS Avadhani 2 1. Electronics & Radar Development Establishment, DRDO,

More information

CORRECTED VISION. Here be underscores THE ROLE OF CAMERA AND LENS PARAMETERS IN REAL-WORLD MEASUREMENT

CORRECTED VISION. Here be underscores THE ROLE OF CAMERA AND LENS PARAMETERS IN REAL-WORLD MEASUREMENT Here be underscores CORRECTED VISION THE ROLE OF CAMERA AND LENS PARAMETERS IN REAL-WORLD MEASUREMENT JOSEPH HOWSE, NUMMIST MEDIA CIG-GANS WORKSHOP: 3-D COLLECTION, ANALYSIS AND VISUALIZATION LAWRENCETOWN,

More information

GPU ACCELERATED DEEP LEARNING WITH CUDNN

GPU ACCELERATED DEEP LEARNING WITH CUDNN GPU ACCELERATED DEEP LEARNING WITH CUDNN Larry Brown Ph.D. March 2015 AGENDA 1 Introducing cudnn and GPUs 2 Deep Learning Context 3 cudnn V2 4 Using cudnn 2 Introducing cudnn and GPUs 3 HOW GPU ACCELERATION

More information

Liu Yang, Bong-Joo Jang, Sanghun Lim, Ki-Chang Kwon, Suk-Hwan Lee, Ki-Ryong Kwon 1. INTRODUCTION

Liu Yang, Bong-Joo Jang, Sanghun Lim, Ki-Chang Kwon, Suk-Hwan Lee, Ki-Ryong Kwon 1. INTRODUCTION Liu Yang, Bong-Joo Jang, Sanghun Lim, Ki-Chang Kwon, Suk-Hwan Lee, Ki-Ryong Kwon 1. INTRODUCTION 2. RELATED WORKS 3. PROPOSED WEATHER RADAR IMAGING BASED ON CUDA 3.1 Weather radar image format and generation

More information

GPU-based data analysis for Synthetic Aperture Microwave Imaging

GPU-based data analysis for Synthetic Aperture Microwave Imaging GPU-based data analysis for Synthetic Aperture Microwave Imaging 1 st IAEA Technical Meeting on Fusion Data Processing, Validation and Analysis 1 st -3 rd June 2015 J.C. Chorley 1, K.J. Brunner 1, N.A.

More information

Image Processing Architectures (and their future requirements)

Image Processing Architectures (and their future requirements) Lecture 17: Image Processing Architectures (and their future requirements) Visual Computing Systems Smart phone processing resources Qualcomm snapdragon Image credit: Qualcomm Apple A7 (iphone 5s) Chipworks

More information

Investigating the Post Processing of LS-DYNA in a Fully Immersive Workflow Environment

Investigating the Post Processing of LS-DYNA in a Fully Immersive Workflow Environment Investigating the Post Processing of LS-DYNA in a Fully Immersive Workflow Environment Ed Helwig 1, Facundo Del Pin 2 1 Livermore Software Technology Corporation, Livermore CA 2 Livermore Software Technology

More information

www.ixpug.org @IXPUG1 What is IXPUG? http://www.ixpug.org/ Now Intel extreme Performance Users Group Global community-driven organization (independently ran) Fosters technical collaboration around tuning

More information

High Performance Computing for Engineers

High Performance Computing for Engineers High Performance Computing for Engineers David Thomas dt10@ic.ac.uk / https://github.com/m8pple Room 903 http://cas.ee.ic.ac.uk/people/dt10/teaching/2014/hpce HPCE / dt10/ 2015 / 0.1 High Performance Computing

More information

The Long Tail of Research Data

The Long Tail of Research Data The Long Tail of Research Data Peter Doorn Director DANS PLAN-E Plenary Paris, 19-20 Apr 2018 @pkdoorn @dansknaw www.dans.knaw.nl DANS is an institute of KNAW and NWO Presentation topics Data big & small:

More information

Establishment of a Multiplexed Thredds Installation and a Ramadda Collaboration Environment for Community Access to Climate Change Data

Establishment of a Multiplexed Thredds Installation and a Ramadda Collaboration Environment for Community Access to Climate Change Data Establishment of a Multiplexed Thredds Installation and a Ramadda Collaboration Environment for Community Access to Climate Change Data Prof. Giovanni Aloisio Professor of Information Processing Systems

More information

LOFAR: From raw visibilities to calibrated data

LOFAR: From raw visibilities to calibrated data Netherlands Institute for Radio Astronomy LOFAR: From raw visibilities to calibrated data John McKean (ASTRON) [subbing in for Manu] ASTRON is part of the Netherlands Organisation for Scientific Research

More information

Interaction Design in Digital Libraries : Some critical issues

Interaction Design in Digital Libraries : Some critical issues Interaction Design in Digital Libraries : Some critical issues Constantine Stephanidis Foundation for Research and Technology-Hellas (FORTH) Institute of Computer Science (ICS) Science and Technology Park

More information

Radio Astronomy Transformed

Radio Astronomy Transformed Radio Astronomy Transformed - Aperture Arrays: Past, Present & Future Prof. Michael Garrett ASTRON, the Netherlands Institute for Radio Astronomy Leiden University. Mike Garrett / NAC 1 Early Antenna Arrays

More information

Warp-Aware Trace Scheduling for GPUS. James Jablin (Brown) Thomas Jablin (UIUC) Onur Mutlu (CMU) Maurice Herlihy (Brown)

Warp-Aware Trace Scheduling for GPUS. James Jablin (Brown) Thomas Jablin (UIUC) Onur Mutlu (CMU) Maurice Herlihy (Brown) Warp-Aware Trace Scheduling for GPUS James Jablin (Brown) Thomas Jablin (UIUC) Onur Mutlu (CMU) Maurice Herlihy (Brown) Historical Trends in GFLOPS: CPUs vs. GPUs Theoretical GFLOP/s 3250 3000 2750 2500

More information

CP2K PERFORMANCE FROM CRAY XT3 TO XC30. Iain Bethune Fiona Reid Alfio Lazzaro

CP2K PERFORMANCE FROM CRAY XT3 TO XC30. Iain Bethune Fiona Reid Alfio Lazzaro CP2K PERFORMANCE FROM CRAY XT3 TO XC30 Iain Bethune (ibethune@epcc.ed.ac.uk) Fiona Reid Alfio Lazzaro Outline CP2K Overview Features Parallel Algorithms Cray HPC Systems Trends Water Benchmarks 2005 2013

More information

MISCELLANEOUS CORRECTIONS TO THE BASELINE DESIGN

MISCELLANEOUS CORRECTIONS TO THE BASELINE DESIGN MISCELLANEOUS CORRECTIONS TO THE BASELINE DESIGN Document number... SKA-TEL.SKO-DD-003 Revision... 1 Author...R.McCool, T. Cornwell Date... 2013-10-27 Status... Released Name Designation Affiliation Date

More information

Autonomous spacecraft navigation using millisecond pulsars. Vincent Trung Michael Hecht Vincent Fish

Autonomous spacecraft navigation using millisecond pulsars. Vincent Trung Michael Hecht Vincent Fish Autonomous spacecraft navigation using millisecond pulsars Vincent Trung Michael Hecht Vincent Fish Overview 1. Project description 2. Data collection 3. Methods 4. What does it tell us? 5. Results 6.

More information

Application of Wiener and Adaptive Filters to GPS and Glonass Data from the Rapid Prototyping Array

Application of Wiener and Adaptive Filters to GPS and Glonass Data from the Rapid Prototyping Array ATA Memo #31 2 August 2001 Application of Wiener and Adaptive Filters to GPS and Glonass Data from the Rapid Prototyping Array Geoffrey C. Bower ABSTRACT Wiener and adaptive filters can be used to cancel

More information

Specifications for the GBT spectrometer

Specifications for the GBT spectrometer GBT memo No. 292 Specifications for the GBT spectrometer Authors: D. Anish Roshi 1, Green Bank Scientific Staff, J. Richard Fisher 2, John Ford 1 Affiliation: 1 NRAO, Green Bank, WV 24944. 2 NRAO, Charlottesville,

More information

Job Title: DATA SCIENTIST. Location: Champaign, Illinois. Monsanto Innovation Center - Let s Reimagine Together

Job Title: DATA SCIENTIST. Location: Champaign, Illinois. Monsanto Innovation Center - Let s Reimagine Together Job Title: DATA SCIENTIST Employees at the Innovation Center will help accelerate Monsanto s growth in emerging technologies and capabilities including engineering, data science, advanced analytics, operations

More information

CUDA Threads. Terminology. How it works. Terminology. Streaming Multiprocessor (SM) A SM processes block of threads

CUDA Threads. Terminology. How it works. Terminology. Streaming Multiprocessor (SM) A SM processes block of threads Terminology CUDA Threads Bedrich Benes, Ph.D. Purdue University Department of Computer Graphics Streaming Multiprocessor (SM) A SM processes block of threads Streaming Processors (SP) also called CUDA

More information

New Paradigm in Testing Heads & Media for HDD. Dr. Lutz Henckels September 2010

New Paradigm in Testing Heads & Media for HDD. Dr. Lutz Henckels September 2010 New Paradigm in Testing Heads & Media for HDD Dr. Lutz Henckels September 2010 1 WOW an amazing industry 40%+ per year aerial density growth Source: Coughlin Associates 2010 2 WOW an amazing industry Aerial

More information

Automatic Mapping of Real Time Radio Astronomy Signal Processing Pipelines onto Heterogeneous Clusters

Automatic Mapping of Real Time Radio Astronomy Signal Processing Pipelines onto Heterogeneous Clusters Automatic Mapping of Real Time Radio Astronomy Signal Processing Pipelines onto Heterogeneous Clusters Terry Esther Filiba Electrical Engineering and Computer Sciences University of California at Berkeley

More information

Software Spectrometer for an ASTE Multi-beam Receiver. Jongsoo Kim Korea Astronomy and Space Science Institute

Software Spectrometer for an ASTE Multi-beam Receiver. Jongsoo Kim Korea Astronomy and Space Science Institute Software Spectrometer for an ASTE Multi-beam Receiver Jongsoo Kim Korea Astronomy and Space Science Institute Design Consideration software spectrometer for a near future ASTE multi-beam receiver spectrometer

More information

Leading by design: Q&A with Dr. Raghuram Tupuri, AMD Chris Hall, DigiTimes.com, Taipei [Monday 12 December 2005]

Leading by design: Q&A with Dr. Raghuram Tupuri, AMD Chris Hall, DigiTimes.com, Taipei [Monday 12 December 2005] Leading by design: Q&A with Dr. Raghuram Tupuri, AMD Chris Hall, DigiTimes.com, Taipei [Monday 12 December 2005] AMD s drive to 64-bit processors surprised everyone with its speed, even as detractors commented

More information

How fast is cryptography? D. J. Bernstein University of Illinois at Chicago

How fast is cryptography? D. J. Bernstein University of Illinois at Chicago How fast is cryptography? D. J. Bernstein University of Illinois at Chicago Joint work with: Tanja Lange Technische Universiteit Eindhoven Part of the ebats project (ECRYPT Benchmarking of Asymmetric Systems):

More information

What can POP do for you?

What can POP do for you? What can POP do for you? Mike Dewar, NAG Ltd EU H2020 Center of Excellence (CoE) 1 October 2015 31 March 2018 Grant Agreement No 676553 Outline Overview of codes investigated Code audit & plan examples

More information

Trend of Software R&D for Numerical Simulation Hardware for parallel and distributed computing and software automatic tuning

Trend of Software R&D for Numerical Simulation Hardware for parallel and distributed computing and software automatic tuning SCIENCE & TECHNOLOGY TRENDS 4 Trend of Software R&D for Numerical Simulation Hardware for parallel and distributed computing and software automatic tuning Takao Furukawa Promoted Fields Unit Minoru Nomura

More information

Artificial Intelligence Machine learning and Deep Learning: Trends and Tools. Dr. Shaona

Artificial Intelligence Machine learning and Deep Learning: Trends and Tools. Dr. Shaona Artificial Intelligence Machine learning and Deep Learning: Trends and Tools Dr. Shaona Ghosh @shaonaghosh What is Machine Learning? Computer algorithms that learn patterns in data automatically from large

More information

Informatica Universiteit van Amsterdam. Performance optimization of Rush Hour board generation. Jelle van Dijk. June 8, Bachelor Informatica

Informatica Universiteit van Amsterdam. Performance optimization of Rush Hour board generation. Jelle van Dijk. June 8, Bachelor Informatica Bachelor Informatica Informatica Universiteit van Amsterdam Performance optimization of Rush Hour board generation. Jelle van Dijk June 8, 2018 Supervisor(s): dr. ir. A.L. (Ana) Varbanescu Signed: Signees

More information

Building a Cell Ecosystem. David A. Bader

Building a Cell Ecosystem. David A. Bader Building a Cell Ecosystem David A. Bader Acknowledgment of Support National Science Foundation CSR: A Framework for Optimizing Scientific Applications (06-14915) CAREER: High-Performance Algorithms for

More information

1 st V4Design Newsletter (March 2018)

1 st V4Design Newsletter (March 2018) V4Design Visual and textual content re-purposing FOR(4) architecture, Design ang virtual reality games This project has received funding from the European Union s H2020 Research and Innovation Programme,

More information

Mosaic: A GPU Memory Manager with Application-Transparent Support for Multiple Page Sizes

Mosaic: A GPU Memory Manager with Application-Transparent Support for Multiple Page Sizes Mosaic: A GPU Memory Manager with Application-Transparent Support for Multiple Page Sizes Rachata Ausavarungnirun Joshua Landgraf Vance Miller Saugata Ghose Jayneel Gandhi Christopher J. Rossbach Onur

More information

The Who. Intel - no introduction required.

The Who. Intel - no introduction required. Delivering Demand-Based Worlds with Intel SSD GDC 2011 The Who Intel - no introduction required. Digital Extremes - In addition to be great developers of AAA games, they are also the authors of the Evolution

More information

Where the brightest scientific minds thrive. IMED Early Talent and Post Doc programmes

Where the brightest scientific minds thrive. IMED Early Talent and Post Doc programmes Where the brightest scientific minds thrive. IMED Early Talent and Post Doc programmes Scientific innovation is at the heart of everything we do. Scientific innovation is at the heart of our business and

More information

Dr Myat Su Hlaing Asia Research Center, Yangon University, Myanmar. Data programming model for an operation based parallel image processing system

Dr Myat Su Hlaing Asia Research Center, Yangon University, Myanmar. Data programming model for an operation based parallel image processing system Name: Affiliation: Field of research: Specific Field of Study: Proposed Research Topic: Dr Myat Su Hlaing Asia Research Center, Yangon University, Myanmar Information Science and Technology Computer Science

More information

Pulsar Observation with the Effelsberg LOFAR station (Stand alone mode)

Pulsar Observation with the Effelsberg LOFAR station (Stand alone mode) Pulsar Observation with the Effelsberg LOFAR station (Stand alone mode) + some results with Superterp Masaya Kuniyoshi (MPIfR) (MKSP and Pulsar working Group) On behalf of LOFAR collaboration Outline Difference

More information

Connecting Science and Society. NWO strategy

Connecting Science and Society. NWO strategy Connecting Science and Society NWO strategy 2019-2022 Connecting Science and Society This strategic plan describes NWO's course for the period 2019 to 2022. In this plan, NWO emphasises its connecting

More information

Plan for Imaging Algorithm Research and Development

Plan for Imaging Algorithm Research and Development Plan for Imaging Algorithm Research and Development S. Bhatnagar July 05, 2009 Abstract Many scientific deliverables of the next generation radio telescopes require wide-field imaging or high dynamic range

More information

Parallel Programming Design of BPSK Signal Demodulation Based on CUDA

Parallel Programming Design of BPSK Signal Demodulation Based on CUDA Int. J. Communications, Network and System Sciences, 216, 9, 126-134 Published Online May 216 in SciRes. http://www.scirp.org/journal/ijcns http://dx.doi.org/1.4236/ijcns.216.9511 Parallel Programming

More information

신경망기반자동번역기술. Konkuk University Computational Intelligence Lab. 김강일

신경망기반자동번역기술. Konkuk University Computational Intelligence Lab.  김강일 신경망기반자동번역기술 Konkuk University Computational Intelligence Lab. http://ci.konkuk.ac.kr kikim01@kunkuk.ac.kr 김강일 Index Issues in AI and Deep Learning Overview of Machine Translation Advanced Techniques in

More information