Parallel Programming I! (Fall 2016, Prof.dr. H. Wijshoff)

Similar documents
December 10, Why HPC? Daniel Lucio.

The end of Moore s law and the race for performance

High Performance Computing and Modern Science Prof. Dr. Thomas Ludwig

The Exponential Promise of High Performance Computing Prof. Dr. Thomas Ludwig

The Race for Leadership in Supercomputers Does India Stand A Chance?

The impact of e-infrastructures for science and innovation in Europe Achim Bachem, Forschungszentrum Jülich

The Future of Intelligence, Artificial and Natural. HI-TECH NATION April 21, 2018 Ray Kurzweil

IESP AND APPLICATIONS. IESP BOF, SC09 Portland, Oregon Paul Messina November 18, 2009

Building a Cell Ecosystem. David A. Bader

National e-infrastructure for Science. Jacko Koster UNINETT Sigma

SKA Phase 1: Costs of Computation. Duncan Hall CALIM 2010

The Bump in the Road to Exaflops and Rethinking LINPACK

Exascale Initiatives in Europe

The objective of this book is to present the concepts of supercomputing, explore its technologies and their applications, and develop a broad understa

THE EARTH SIMULATOR CHAPTER 2. Jack Dongarra

The Future of Technology

Computer Science as a Discipline

CS4961 Parallel Programming. Lecture 1: Introduction 08/24/2010. Course Details Time and Location: TuTh, 9:10-10:30 AM, WEB L112 Course Website

A Balanced Introduction to Computer Science, 3/E

Sourcing in Scientific Computing

High Performance Computing Facility for North East India through Information and Communication Technology

The Spanish Supercomputing Network (RES)

Computational Science and Engineering Introduction

CS Computer Architecture Spring Lecture 04: Understanding Performance

CSC384 Intro to Artificial Intelligence* *The following slides are based on Fahiem Bacchus course lecture notes.

The Next-Generation Supercomputer Project and the Future of High End Computing in Japan

European View on Supercomputing

Challenges in Transition

High Performance Computing i el sector agro-alimentari Fundació Catalana per la Recerca CAFÈ AMB LA RECERCA

Outline. PRACE A Mid-Term Update Dietmar Erwin, Forschungszentrum Jülich ORAP, Lille, March 26, 2009

CSCI-564 Advanced Computer Architecture

Special Contribution Supercomputing in Industrial Manufacturing

Supercomputers have become critically important tools for driving innovation and discovery

High Performance Computing Scientific Discovery and the Importance of Collaboration

A School in Computational Science &

DEEP LEARNING A NEW COMPUTING MODEL. Sundara R Nagalingam Head Deep Learning Practice

Exascale Challenges for the Computational Science Community

Enabling Scientific Breakthroughs at the Petascale

Deep Learning Overview

H2020 and HPC: A view from INFN

Enabling technologies for beyond exascale computing

The PRACE Scientific Steering Committee

PoS(ISGC 2013)025. Challenges of Big Data Analytics. Speaker. Simon C. Lin 1. Eric Yen

Special Contribution Japan s K computer Project

Early Adopter : Multiprocessor Programming in the Undergraduate Program. NSF/TCPP Curriculum: Early Adoption at the University of Central Florida

Computing center for research and Technology - CCRT

What can POP do for you?

Parallel Computing 2020: Preparing for the Post-Moore Era. Marc Snir

ECS15: Introduction to Computers

11/11/ PARTNERSHIP FOR ADVANCED COMPUTING IN EUROPE

HIGH-LEVEL SUPPORT FOR SIMULATIONS IN ASTRO- AND ELEMENTARY PARTICLE PHYSICS

Mathematics of Magic Squares and Sudoku

IBM Research - Zurich Research Laboratory

Facts and Figures. RESEARCH TEACHING INNOVATION

Comparative Analysis of Air Conditioning System Using PID and Neural Network Controller

First Experience with PCP in the PRACE Project: PCP at any cost? F. Berberich, Forschungszentrum Jülich, May 8, 2012, IHK Düsseldorf

This list supersedes the one published in the November 2002 issue of CR.

What We Talk About When We Talk About AI

DIGITAL TECHNOLOGIES FOR A BETTER WORLD. NanoPC HPC

An Introduction to SIMDAT a Proposal for an Integrated Project on EU FP6 Topic. Grids for Integrated Problem Solving Environments

Executive summary. AI is the new electricity. I can hardly imagine an industry which is not going to be transformed by AI.

NOV View Ahead. DAVID REID Chief Marketing Officer. ExxonMobil 2040 Outlook Simplified: Demand MORE PEOPLE MORE MONEY MORE ENERGY.

CS10 Final Exam Review. August 5, 2012

Exascale computer system design : the square kilometre array Jongerius, R.

Global Artificial Intelligence (AI) Semiconductor Market: Size, Trends & Forecasts ( ) August 2018

RAPS ECMWF. RAPS Chairman. 20th ORAP Forum Slide 1

GO for IT. Guillaume Chaslot. Mark Winands

RSE in UK Academia. Paul Richmond University of Sheffield (UK)

Center for Hybrid Multicore Productivity Research (CHMPR)

STEM University UMSL Boy Scout College

Introduction to SHAPE Removing barriers to HPC for SMEs

Smarter oil and gas exploration with IBM

The Innovation Machine and the Role of Research! Infrastructure Investment:! Part 3!

In University Supercomputing, the Fastest May No Longer Be the Best - Technology

The Past and Future of Computing in Geotechnical Engineering: The Inside-out View

ECONOMIC COMPLEXITY BRIEFING NEW APPROACH PREDICTS ECONOMIC GROWTH. How does an economy grow? What exactly is Economic Complexity?

Engineering Connected Intelligence

Performance Metrics, Amdahl s Law

CS4617 Computer Architecture

Electronics Science and Technology Program

The Department of Instrument Science and Engineering (ISE) Program Overview

Part II. Numerical Simulation

Measuring and Evaluating Computer System Performance

Success Stories of Interdisciplinary Research and Development Worldwide

Graduate Studies in Computational Science at U-M. Graduate Certificate in Computational Discovery and Engineering. and

Global Alzheimer s Association Interactive Network. Imagine GAAIN

EESI Presentation at IESP

PRACE A Mid-Term Update Dietmar Erwin, Forschungszentrum Jülich Event, Location, Date

The future of work. Nav Singh Managing Partner, Boston McKinsey & Company

The Path To Extreme Computing

B.E. (CGPA) (All Branches) Sem-VII & VIII) Examinations Nov/Dec, 2018

CS6700: The Emergence of Intelligent Machines. Prof. Carla Gomes Prof. Bart Selman Cornell University

What is Artificial Intelligence? Alternate Definitions (Russell + Norvig) Human intelligence

Fujitsu Laboratories Advanced Technology Symposium 2017

Activity. Image Representation

Impact Impact Factor Impact Factor Impact Factor Bil Journal Title eissn NO. Factor Applied Physics Letter

UNIT 5: RATIO, PROPORTION, AND PERCENT WEEK 20: Student Packet

South's Perspective on Adapating Industrial Policies for Economic Diversification and Strucutural Transformation to a Digital World

Journal Title ISSN 5. MIS QUARTERLY BRIEFINGS IN BIOINFORMATICS

Architecting Systems of the Future, page 1

Transcription:

Parallel Programming I! (Fall 2016, Prof.dr. H. Wijshoff) Four parts: Introduction to Parallel Programming and Parallel Architectures (partly based on slides from Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar accompanying ``Introduction to Parallel Computing'', Addison Wesley, 2003.) Parallel Algorithms Parallel Algorithm Design Parallel Numerical Computing Parallel Graph Computing Parallel Sorting Existing Programming Paradigms New Programming Paradigms

Lab/Homework/Assignments During the course of the semester, programming assignments. These assignments make for 40% of the total load. Open book final exam will make up for remainder 60%.

A Long History The advent of parallel computing dates back to the fifties of the last century IBM introduced the 704 (full parallel floating point arithmetic) in 1954, through a project in which Gene Amdahl was one of the principal architects. In April 1958, Stanley Gill (Ferranti Ltd, inventor of the subroutine) discussed parallel programming and the need for branching and waiting. Also in 1958, IBM researchers John Cocke and Daniel Slotnick discussed the use of parallelism in numerical calculations for the first time. In 1969, US company Honeywelll introduced its first Multics system, a symmetric multiprocessor system capable of running up to eight processors in parallel. The ILLIAC IV (1971) was one of the first attempts to build a massively parallel computer. One of a series of research machines (the ILLIACs from the University of Illinois), the ILLIAC IV design featured fairly high parallelism with up to 256 processors.

Milestones ( the race to the bottom) 1972: First Supercomputer: CRAY 1 1 MFLOP = 1 000 000 operaties/sec. 1989: CRAY YMP 1 GFLOP = 1 000 000 000 operaties/sec. 1996: ASCI red (Intel based parallel processor) 1 TFLOP = 1 000 000 000 000 oper./sec. 2008: IBM Roadrunner 1 PFLOP = 1 000 000 000 000 000 oper./sec. In Nov. 2014: NUDT (China): 54.9 PFLOP (0.054 EXAFLOP) 54 900 000 000 000 000 oper./sec. achieved by 3 120 000 cores using up to 18 MW

Liquid Cooled Cray 2

In December 2014 (http://top500.org/lists/2014/11)

As of November 2015 (http://top500.org/lists/2015/11/)

As of June 2016 (http://top500.org/lists/2016/06/)

In Fact Press Release June 2016: China maintained its No. 1 ranking on the 47th edition of the TOP500 list of the world s top supercomputers, but with a new system built entirely using processors designed and made in China. Sunway TaihuLight is the new No. 1 system with 93 petaflop/s (quadrillions of calculations per second) on the LINPACK benchmark.

What does 18 MW mean? Country Population Power per capita (W/p)* 18 MW equivalent in people China 1.360.000.000 458 39.000 United States 318.000.000 1683 11.000 European Union 504.000.000 688 26.000 India 1.243.000.000 101 178.000 Netherlands 17.000.000 764 24.000 Syria 22.000.000 147 122.000 Afghanistan 30.000.000 1 18.000.000 * Taken from https://en.wikipedia.org/wiki/list_of_countries_by_electricity_consumption

What does 1 PFLOP mean? Multiplying 2 numbers with 15 decimals Paper and Pencil: 1 per 4 minutes Calculator: 10 per minute (based on 300 cpm (character per minute) (750 cpm world champion, Guinness, 2014) ) World Population of 6 000 000 000 (6*10 9 ), gives a speed of 60 000 000 000 per minute by calculators, which is equivalent to 60 GFLOPS 1 PFLOP is 1000/60 = 17 faster! (And 1 ExaFLOP is 17000 faster!!!)

WHY do we need to compute at these rates? Exponential growth of computational complexity (Easy) example: CHESS Assume an average of 10 possible moves per turn Average chess match: 80 turns So 10^80 different possible outcomes With 1 PFLOP: 10^65 sec = 4x10^57 years = 4x10^54 centuries = 10^48 x the existence of the universe

WHY (II) Large Scale of computations Example: Weather Forecasting

Computation (Simulation) For each grid point the interaction with its neighbor gridpoints are computed with respect to temperature, air pressure, moisture, etc Europe s surface: 5 700 000 km^2 Air heigth: 10 km With a 1m x 1m x 1m grid this results in: 57 000 000 000 000 000 = 5.7 x 10^16 grid points

Computation (II) Several operations per grid point: Assume for each second and for 5 variables, then for a prediction of 12 hours: 5 x 12 x 60 x 60 = 216 000 operations per grid point With a 1 PFLOP computer this takes: 5.7 x 10^16 X 216 x 10^3 / 10^15= 12 x 10^21 / 10^15 = 12 x 10^6 sec. = 3333 hours = 138 days!!!!!!!!! For a 1mm x 1mm x 1mm grid: 10^9 x 138 days = 380.000 centuries compute time required at 1 PFLOP rate for a 12 hours forecast

HPC Grand Challenges "A Research and Development Strategy for High Performance Computing", Executive Office of the President, Office of Science and Technology Policy, November 20, 1987 Prediction of weather, climate, and global change Challenges in materials sciences Semiconductor design Superconductivity Structural biology Design of pharmaceutical drugs Human genome Quantum chromodynamics Astronomy Challenges in Transportation Vehicle Signature Turbulence Vehicle dynamics Nuclear fusion Efficiency of combustion systems Enhanced oil and gas recovery Computational ocean sciences Speech Vision Undersea surveillance for anti-submarine warfare

Recently this list of applications was enlarged significantly Next to applications in engineering and design, we have Scientific Applications: structural characterization of genes and proteins, new materials: understanding chemical pathways, bio-informatics and astrophysics, etc Commercial Applications: servers for large scale web servers (google, facebook, etc.), trading systems, etc. Applications in Computer Systems (the Internet itself): intrusion detection, cryptography, etc. Applications for social networks: online data mining, Data Mining at large