NICIS: Stepping stone to a SA Cyberinfrastructure Commons? CHAIN REDS Conference Open Science at the Global Scale: Sharing e- Infrastructures, Sharing Knowledge, Sharing Progress 20150331 Prof Colin J Wright cjwright@csir.co.za 25/03/2015 CHAIN-REDS 1
Conference Title: Open Science at the Global Scale: - Sharing e-infrastructures - Sharing Knowledge - Sharing Progress Open Science (Wikipedia) make scientific research, data and dissemination accessible to all levels of an inquiring society, amateur or professional. Encompasses publishing open research, campaigning for open access, encouraging scientists to practice open notebook science and generally making it easier to publish and communicate scientific knowledge. Began in the 1600s with advent of the academic journal. H2020 Excellent Science pillar has main four specific objectives, one of which is: Research infrastructure (including e-infrastructures) will develop European research infrastructure for 2020 and beyond, foster their innovation potential and human capital, and complement this with the related Union policy and international cooperation. NICIS 25/03/2015 CHAIN-REDS 2
e-research and the Fourth Paradigm I. Thousand years ago Experimental Science o Description of natural phenomena II. Last few hundred years Theoretical Science o Newton s Laws, Maxwell s Equations III. Last few decades Computational Science o Simulation of complex phenomena IV. Today 4 th Paradigm Data Intensive Science o Researchers overwhelmed with data from different sources: Preserve data Analyse and visualise Access. o Research Data Infrastructures (Wikipedia) e-science: The application of computer technology to the undertaking of modern scientific investigation, including preparation, experimentation, data collection, results dissemination, and long-term storage and accessibility of all materials generated through scientific process. These may include data modelling and analysis, electronic / digitized laboratory notebooks, raw and fitted data sets, manuscript production and draft versions, pre-prints, print and/or electronic publications. 25/03/2015 CHAIN-REDS 3
Research- and e-infrastructures are needed to drive Innovation Knowledge Triangle comprising Research, Education and Innovation must form an ecosystem with People, Industries and Universities / Institutes Research Landscape is changing Data driving innovation & discovery: access, ease-of-use, interoperability of data Traditional modes of research recognition evolving: new approaches to collaboration /competition, publication, citation, analysis all involving digital data Persuade/ cajole/ encourage right behaviour Educational Landscape should change University curricula becoming more data driven From cradle to grave computational and data literacy e-i supported education Workforce is changing Data literacy required from everyone Computational and data science embedded in everything Data scientists / engineers increasingly critical for competitiveness and leadership 25/03/2015 CHAIN-REDS 4
Various views of CI Capability Industrial Sector Awareness Connections Computing Hardware Software Data Networks Security Computing & Data Skills Sector Domain Knowledge D. Tildesley: Vision of integrated e-infrastructure ecosystem 25/03/2015 CHAIN-REDS 5
South African National Cyberinfrastructures Strategic Science Imperatives led DST into the arena VLDB / DIRISA 25/03/2015 CHAIN-REDS 6
2012 Review: NICIS National Integrated Cyber-Infrastructure System NICIS VISION: take national leadership in the provision of a comprehensive Cyber- Infrastructure essential to 21st century advances for South Africa in research, education and innovation. NICIS MISSION: increase knowledge creation through provision of a national platform of essential Cyber-Infrastructure. Governance NICIS PRINCIPLES Joint planning and budgeting Good governance Visibility of CI services Sustainability Constructive stakeholder engagement Strategy Advisory Board Senior Management NICIS: TIER 1 User Inputs Advanced Services 25/03/2015 CHAIN-REDS 7
NICIS Networking Services: NREN Important for DST to have significant involvement in the NREN both because it will be a primary funder and so will need to account for the spending of public funds and because it is in the position of having strategic oversight and can take into account all national interests. Other key parties such as the universities will also have a stake in the process and be able to hold the service area to account. Long-term networking services plans; Value-added services. + SANReN services Joined REANNZ and ESNet in development of experimental Software Defined Networking (SDN) test-bed perfsonar eduroam FIM Video & Web Conferencing CSIRT GOLE Lightpath services DMZ Mirror Sites 25/03/2015 CHAIN-REDS 8
NICIS Components SAGrid SAGrid Infrastructure interoperability: efforts to bring the Africa-Arabia Regional Operations Centre (AAROC) to production; Infrastructure automation services; Application and community services: Continuous Integration platform, Application Delivery and Science Gateways; Operations and Virtual Research Communities (VRC) support: support to the Worldwide LHC Computing Grid (WLCG) activities in SA. Data Infrastructure: PID service in data infrastructure offering; Identity Infrastructure: bring a pilot service of Perun ID management to production; and Training. CHAIN-REDS EGI NICIS: Cloud and Grid Computing Support for Tier 1 Cloud and Grid Computing endorsed. 25/03/2015 CHAIN-REDS 9
NICIS: Centre for High Performance Computing T1 Machines Tsessebe 61.5Tf Blue Gene/P 11.5Tf iqudu 2.5Tf GPU cluster 16Tf VLDB data storage T3 T2 CHPC in essentially its current form should take on the role of the Computing Services area, with some changes to its mandate. Compute-intensive, Communication-intensive Data-intensive Sustainable, impactful, user sensitive Visualisation, Cybersecurity 25/03/2015 CHAIN-REDS 10
And then there was DATA!! Data is the new oil! Consequences of data for science Data: Computing, sensors, satellites, social media, imaging,.. Data enables discovery Scientists interact with data in multiple and multi-disciplinary ways Science increasingly depends on networks for success The very structure of modern science assumes there is a network interconnecting all parts of the collaboration Services must be common and consistent across multiple infrastructure domains Scientists collaborate, exchange data, conduct experiments irrespective of geography 11/06/12 3/25/2015 25/03/2015 CHAIN-REDS 11
NICIS SC: Data Services NICIS Data Services Area (should) be leading organisation within South Africa to advocate for and implement data initiatives across the research community. NICIS (should) work with the community to develop an ambitious proposal on data services to DST. New funding necessary for NICIS expanded Data Services. 25/03/2015 CHAIN-REDS 12
NICIS Components: Skills and Training Services Area The shortage of e-i skills is global phenomenon. Should offer effective coordination of e-i Skills and Training services within a sustainable framework. Collaborations between the national organisation and the universities, particularly in the area of human skills and training. Role is to work with and through HEIs and RCs to grow e-i savvy cohort. Build cohort of data professionals to support research infrastructure development. Examples E-Research Data Science Computational. HPC Data analytics. Main Targets Cyber-infrastructure professionals, for developing operational knowledge of CI. Researchers. Link to next generation of students in computer science who could become future CI experts and students in other disciplines who could become future users. 25/03/2015 CHAIN-REDS 13
Skills & Training 1. Co-develop and co-ordinate courses with HE and other institutions 2. Across disciplines & faculties 3. Develop skills at multiple levels Entry level researchers Mid-career researchers Data skills for researchers in all disciplines 4. Appropriate skills and training skills are not all at the same place 5. Collaboration (e.g. UK Doctoral Training Centres) 6. Silos? 7. Training Specialist data and computational scientists Discipline focussed researchers with some insight into e-skills 8. Provide students with real-world research and development experience to encourage them to pursue a future career or advanced degree in digital science/s 25/03/2015 CHAIN-REDS 14
Further NICIS Principles Consultation, Stakeholders and Accountability Accountability of the NICIS sectors and NICIS as a whole to stakeholders. User ans Special Interest Groups. Stakeholder membership of Strategic Advisory Panel. Commence implementation of new NICIS on 1 April 2015. 25/03/2015 CHAIN-REDS 15
Common current phenomenon: Data sharing? Efficiency? Sustainability? The goal: CIaaS / commons Interoperability Software advantages Data sharing Cloud / grid 25/03/2015 CHAIN-REDS 16
e-research NICIS National data integrative enabler supporting MTSF Research Strategy SARIR, Overarching coordination & national strategy National (Tier1) Institutional (Tier2) Amalgamated, physically distributed cyber platform for data intensive research Data Networking Computing Crosscutting S&T Physical-Service Support Skills Application Phy Sci & Eng. Earth & Environment Materials & Manuf. Computing Services (CHPC +) Energy Networking Services (SANREN) Health, Bio & Food Astro Humans & Society Data Services (DIRISA +) Academia Science councils : RI s Skills & Training / VREs / Collaboration (Comp Sci-s, Data Sc, Stats, NA, Visualisation, Soft Eng,...) Data intensive research environments (SA_Grid, Cloud. FIM, SDN, perfsonar, VREs, ) Core Services Networked resources NICIS ecosystem 25/03/2015 CHAIN-REDS 17
What can and should e-infrastructure providers do to set innovation partnerships in new initiatives, so stimulating and facilitating access to advanced? e-infrastructures as platform for e-research Joint strategic planning and consultation Adapt e-i to each phase of the scientific process Develop e-i Commons Stakeholder involvement E-Infrastructures Collaboration: e-is, RIs and industry Coherent Policies and legal frameworks Access across disciplines and industry PPPs; Incubate e-i enhanced SMEs Trust, accountability and engagement Optimal use of appropriate Cyberinfrastructures: advanced services, VREs Collaborate internationally Changes in research institutions Change mind-set of institutions Culture change: RIs as well as institutions and industry Institutional leadership People! Training : Enhance synergies with training and education programmes for engineers, young scientists and technicians to work in/with RIs & e- RIs; Data literacy Professional development & recognition New careers 25/03/2015 CHAIN-REDS 18