Big Data Analytics in Science and Research: New Drivers for Growth and Global Challenges

Similar documents
Re-engineering Collaborative Mechanisms and Knowledge Networks to Accelerate Innovation for Alzheimer s

THE DIGITAL ECONOMY. BIAC OECD Business Day 7 November 2014 Panel on the Business Case for Innovation

Banning Garrett, PhD

THE BIOMEDICAL ENGINEERING TEACHING & INNOVATION CENTER. at Boston University s College of Engineering

Horizon Scanning. Why & how to launch it in Lithuania? Prof. Dr. Rafael Popper

Convergence of Knowledge, Technology, and Society: Beyond Convergence of Nano-Bio-Info-Cognitive Technologies

Science and Innovation Policies at the Digital Age. Dominique Guellec Science and Technology Policy OECD

g~:~: P Holdren ~\k, rjj/1~

European Commission. 6 th Framework Programme Anticipating scientific and technological needs NEST. New and Emerging Science and Technology

President Barack Obama The White House Washington, DC June 19, Dear Mr. President,

From the foundation of innovation to the future of innovation

Research Centers. MTL ANNUAL RESEARCH REPORT 2016 Research Centers 147

The Biological and Medical Sciences Research Infrastructures on the ESFRI Roadmap

Mission: Materials innovation

Artificial Intelligence and Robotics Getting More Human

FDA Centers of Excellence in Regulatory and Information Sciences

THE СONCEPT OF THE MOSCOW INTERNATIONAL FORUM FOR INNOVATIVE DEVELOPMENT OPEN INNOVATIONS * October 31 November 2, * as of April

Looking ahead : Technology trends driving business innovation.

Science of Science & Innovation Policy (SciSIP) Julia Lane

COMPUTATIONAL SOCIAL SCIENCE AND ADVANCED COMPUTING INFRASTRUCTURE: CHALLENGES AND OPPORTUNITIES

Advanced Manufacturing and Disruptive Technologies: Implications for Strategic Competitiveness

PROJECT FACT SHEET GREEK-GERMANY CO-FUNDED PROJECT. project proposal to the funding measure

Spatial Computing, Synthetic Biology, and Emerging IP Challenges. Jacob Beal November, 2010

Three Work/Technology Global Scenarios for 2050

Front Digital page Strategy and Leadership

Esri and Autodesk What s Next?

Making Precision Medicine A Reality: Molecular Diagnostics, Remote Health Status Monitoring and the Big Data Challenge

Overview of the NSF Programs

G7 SCIENCE MINISTERS COMMUNIQUÉ

Scientific Transparency, Integrity, and Reproducibility

Seoul Initiative on the 4 th Industrial Revolution

Reason and imagination are fundamental to problem solving and critical examination of self and others.

The New Imperative: Collaborative Innovation. Dr. Anil Menon Vice President, Corporate Strategy IBM Growth Markets

Health Care Analytics: Driving Innovation

ACADEMY PROGRAMMES 1 ACADEMY OF FINLAND 2016

KÜNSTLICHE INTELLIGENZ JOBKILLER VON MORGEN?

9 th AU Private Sector Forum

Innovation for Defence Excellence and Security (IDEaS)

Educating Leaders for the 21 st Century Role of Engineering

Institute of Physical and Chemical Research Flowcharts for Achieving Mid to Long-term Objectives

Hamburg, 25 March nd International Science 2.0 Conference Keynote. (does not represent an official point of view of the EC)

Introduction to Exponentials

Innovation system research and policy: Where it came from and Where it might go

Front Digital page Strategy and leadership

Reason and imagination are fundamental to problem solving and critical examination of self and others.

OECD WORK ON ARTIFICIAL INTELLIGENCE

Job Title: DATA SCIENTIST. Location: Champaign, Illinois. Monsanto Innovation Center - Let s Reimagine Together

TECHNOLOGY IMPACT ON ECONOMY AND SOCIETY

On the moral economy of digital infrastructures: Sharing, usability and publicness

Innovation Economy. Creating the. Dr. G. Wayne Clough President, Georgia Institute of Technology

ARTEMIS The Embedded Systems European Technology Platform

Modelling and Mapping the Dynamics and Transfer of Knowledge. A Co-Creation Indicators Factory Design

Meta Scientific Discovery Beyond Search CHAN ZUCKERBERG INITIATIVE

DATA AT THE CENTER. Esri and Autodesk What s Next? February 2018

Global Alzheimer s Association Interactive Network. Imagine GAAIN

Thoughts on Reimagining The University. Rajiv Ramnath. Program Director, Software Cluster, NSF/OAC. Version: 03/09/17 00:15

TRACING THE EVOLUTION OF DESIGN

The Tech Megatrends: 2018

Embracing a Digital Future Vanson Bourne research findings & benchmark methodology

Disrupt or be Disrupted: Research Findings from the CDO Project & Policy Implications

Disrupting our way to a Very Human City

MEGATRENDS THE TREND TOWARDS

TRUSTING THE MIND OF A MACHINE

The marginalisation of cross-cutting issues in CCUS Mission Innovation PRDs

Opening Science & Scholarship

Framework Programme 7

Sociotechnical Imaginaries in Research and Innovation Policy

UNIT 2 TOPICS IN COMPUTER SCIENCE. Emerging Technologies and Society

Corporate Mind 2015 Corporate Responsibility Report

Deep Learning Overview

Computer Science as a Discipline

Medicines Manufacturing in the UK 2017

Convergence, Grand Challenges, Team Science, and Inclusion

Climate Change Innovation and Technology Framework 2017

The Human Genome, Second Edition: A User's Guide (Elsevier Science In Society) By Julia E. Richards, R. Scott Hawley

The ERC: a contribution to society and the knowledge-based economy

1.INTRODUCTION: Scientific and Technological Revolutions and Global Industry 1890s- 2010s

Written response to the public consultation on the European Commission Green Paper: From

South Africa s 4th Industrial Revolution belongs to the youth

DIGITAL FINLAND FRAMEWORK FRAMEWORK FOR TURNING DIGITAL TRANSFORMATION TO SOLUTIONS TO GRAND CHALLENGES

Why Foresight: Staying Alert to Future Opportunities MARSHA RHEA, CAE, PRESIDENT, SIGNATURE I, LLC

TECHNICAL PROPOSAL FOR 3D PRINTING

AN INTERNATIONAL REVIEW OF INDUSTRIAL INNOVATION POLICIES:

A Balanced Introduction to Computer Science, 3/E

Manufacturing the Future: the 4th Industrial Revolution and the 2030 Development Agenda

Ken Buetow, Ph.D. Director, Computation Science and Informatics, Complex Adaptive ASU Professor, School of Life Science


Chapter 1: Introduction to Neuro-Fuzzy (NF) and Soft Computing (SC)

The future of work. Nav Singh Managing Partner, Boston McKinsey & Company

COUNCIL OF THE EUROPEAN UNION. Brussels, 9 December 2008 (16.12) (OR. fr) 16767/08 RECH 410 COMPET 550

Whiting School of Engineering Interdisciplinary Centers and Institutes. Education. Research. Translation.

Education and Outreach: Nanotechnology Activity Guides

Nagoya Protocol & Open Science Time for scientists to speak out! Philippe Desmeth MOSAICC, MOSAICS & TRUST Coordinator WFCC Past President

UKRI Artificial Intelligence Centres for Doctoral Training: Priority Area Descriptions

Imminent Transformations in Health

Advances and Perspectives in Health Information Standards

Tutorial: Open Data. Open Source EHR Summit & Workshop October 17-18, 2012 National Harbor, MD

A New Path for Science?

EPD ENGINEERING PRODUCT DEVELOPMENT

TRAINING THE NEXT GENERATION OF QUANTITATIVE BIOLOGISTS IN THE ERA OF BIG DATA

Transcription:

Big Data Analytics in Science and Research: New Drivers for Growth and Global Challenges Richard A. Johnson CEO, Global Helix LLC and BLS, National Academy of Sciences ICCP Foresight Forum Big Data Analytics and Policies 22 October 2012 johnsri@alum.mit.edu

Session 3: 4 Questions for Discussion Q1 Importance of data openness and interoperability for science and research, especially in biomedicine and health? Q2 Are current IPR regimes data-intensive scientific discovery? Q3 Do we still need scientific methods (and traditional domain scientists) in an era of big data analytics? Q4 How, and why, does this matter for policy?

Convergence of Biology with Physical Sciences & Engineering through Data and Data Analytics = the New Biology or Third Revolution in the Life Sciences Foundational trend in STI for next 20 years NAS (2010); MIT (2011)

Genomic Data is Increasing Faster than Computing Power Convergence of 3 key DATA DRIVERS with RESEARCH and ECONOMIC VALUE: (1)Sequencing + (2) Synthesis + (3) Reading AND Writing DNA Data Tools in the Life Sciences: Moore s Law on Steroids Gene Expression Data Sets (Nature 2012)

Life Sciences and Biomedical Research as an Information Science: Quantitative, Data-driven, Simulation-oriented, Predictive Science

Data and Convergence Driving the Future: Data Analytic Tools, Platforms, and Measurement for New Sources of Growth Technology Convergence, Data Analytics and Metrology as Interdependent Drivers (Agilent 2012) Energy and the Environment Advancing High Growth Economies Portable, Mobile and Out-of-Lab Nanotechnology Food Safety Personalized Medicine Single Cells and Microbiome Synthetic Biology Intern Executive Speaker Series 6

Beyond Interoperability, The Power of Interconvertibility: FROM PHYSICAL LIVING MATERIAL/DNA to DIGITAL DATA, and back 1 s and 0 s A, C, T, G s IT from Bits (Poste 2012) Programming: increasing ability to both Read and Write DNA Tools to Edit and Write Genomes: MAGE + CAGE (Church/Isaacs 2011, 2012) DNA Construction (analog to Read/Write; 1 s and 0 s manipulation) - Genetic Expression Operating Systems; Scale DNA construction engineering Data enables Decoupling: biological processes from evolutionbased descent and replication + design from fabrication

Big Data and Data Analytics Drive new 21 st Century Infrastructures and KNMs, and Create Opportunities for New Research, Better Health Outcomes, and Value Creation (Toward Precision Medicine: Building a Knowledge Network for Biomedical Research and New Taxonomy of Disease: NAS 2011)

The Creative Destruction of Medicine (Topol 2012)

Data Sharing, Disease Modeling and Biomarkers to Accelerate the Development

Big Data and Engineering Biology as the Transformative New Normal in the Life Sciences Driving New Sources of Growth Synthetic Biology - Standardization, Abstraction and Modularity Predictive Platforms for Engineering Biology and Predictable Integration of new Genetic Designs built on Massive Data an Engineering METHODOLOGY to construct complex systems and novel properties based on biological components (EU-US Task Force, June 2010)

Data-driven and Engineering Biology Value Proposition Increasingly Drives Science, New Sources of Growth, and our ability to meet societal Grand Challenges NAS 2011

Neuroscience a 21 st Century Frontier for Human Understanding and Grand Challenges Traversing the scales at all levels in understanding the brain from molecular and cellular to systems neurons (100 Billion)/synapses (150 Trillion), and neural signaling Human Connectome Project = mapping neural networks with >1 million more connections than the genome has letters of DNA, and linking all this to other life experience data sets

ENCODE: the Encyclopedia of DNA Elements Big Data, Data Analytics, and Big Science increasingly change how we do science (Sept. 2012)

The Plasticity of IPR/Open Science Meanings and lots of rethinking in different domains about IPR, Openness and Scientific Research IPR and Competing Visions of Openness Open Science (Public domain; BioBricks library/bbf) v. Open Source (IPR-driven; GPL, BSD, CC) v. Open Standards v. Open Development v. Open Access (including reuse and sharing public-funded data) v. Open Innovation (depends on strong, well-functioning IPR system) Innovative New Thinking e.g., Semi-commons as a new lens to view Data interacting common and private uses that are dynamic/scalable over the same resources and that can adjust through contracting and other mechanisms Knowledge Networks and Markets (KNMs) and Knowledgebased Capital KBC) major OECD initiatives on-going Growing Counter-intuitive View that Role of IPR Increasingly Important as a Tool to Promote Openness, Transparency, and Diffusion, e.g., Algorithms, Data Exchanges, Tools and Re-use

Growing Linkage of Data-intensive Science, IPR, and New Models of Innovation: Big Data Analytics Intersect with Open Innovation, Multi-directional S&T, University-Industry Partnering, New Business Models, Forward-looking IPR, and New Public-Private Collaborative Mechanisms to Enable Cutting-edge Research and Innovation

The Fourth Paradigm, the Internet of Things, Automated Data Extraction Methods, and Big Data Analytics the Need for a New Generation of Scientific computing tools and platforms to manage, visualize and analyze Big Data for Research (Gray 2009)

Wide Range of New Data Analytic Convergence Challenges with Policy Implications (Gray 2009) Risks to Scientific Research from (Bad) Data Analytics? - Jeopardize reproducibility - Retard pace of research - Produce poorly written code/bad algorithms on which science relies - Create serious errors in scientific outcomes, and the interpretations of them

New Day-to-day Science Research Implications of Big Data: Data Analytics Challenges Which data to keep in what format? for how long? What about emergent properties? resulting from elaborate networks of interactions and data patterns How to deal with data distributed across many locations, formats, scales, etc., and merge them? How to model large complex data, and derive valuable knowledge from analytics/models? How to infuse data into complex computations to enable simulations of predictive value? How to deal with different kinds of big data (temporal, spatial, dimensional, heterogeneous) Massive data High-dimensional data Multi-modal data Real-time and Streaming data

In a data-driven science era, should we still fund, incentivize and value Empirical, Theoretical, Model-based Approaches to Scientific Discovery? Is Popper s scientific method paradigm outdated? I believe that math is trumping science. What I mean by that is you don't really have to know why, you just have to know that if a and b happen, c will happen. Vivek Ranadivé, entrepreneur and CEO, financialdata software company TIBCO (2011) With enough numbers, the data speak for themselves Chris Anderson, Editor-in-Chief, Wired, The End of Theory: The Data Deluge Makes the Scientific Method Obsolete (2008) All models are wrong, and increasingly you can succeed without them. Peter Norvig, Director of Research, Google The numbers have no way of speaking for themselves.data-driven predictions can succeed and they can fail. It is when we deny our role in the process that the odds of failure rise. Before we demand more of our data, we need to demand more of ourselves. Nate Silver, The Signal and the Noise: Why So Many Predictions Fail but Some Don t (2012) The invalid assumption that correlation implies cause is probably among the two or three most serious and common errors of human reasoning. Stephen Jay Gould, American evolutionary biologist (1981)

Thank you! Contact Information -- Richard A. Johnson CEO, Global Helix LLC richard.johnson@globalhelix.net MIT johnsri@alum.mit.edu