Scholarly Communica/on Mee/ng Pi4sburgh, January 14-15, 2013

Similar documents
Digital Repositories, Publishing and Scholarship

Preparing the Future Workforce for Careers in Science and Engineering. Steven I. Gordon

EU- US Industry and Academic Co- opera7on on Smart Ci7es. Haydn Thompson THHINK Group

Introduc)on to So,ware Engineering

Research Crea*on. an introduc+on

Why Are Data Sharing and Reuse So Difficult?

UCLA Presentations. Title. Permalink. Author. Publication Date. If Data Sharing is the Answer, What is the Question?

Rafael C. Alvarado Instruc4onal Technology Adviser The Instruc4onal Technology Group University of Virginia

Introducing Foresight Alliance

Cyberinfrastructure Uptake Some observations over 20 years. Nancy Wilkins-Diehr San Diego Supercomputer Center

Melanie Agnew, University of Wisconsin, U.S.A. Rhonda Friesen, University of Manitoba, Canada Douglas Proctor, University of Melbourne, Australia

CASI-F: a common framework for the assessment and management of sustainable innovation

Coopera've Research, Intellectual Property & Technology Transfer

Reproducibility: Gold or Fool s Gold in Digital Social Research?

Building an Infrastructure for Data Science Data and the Librarians Role. IAMSLIC, Anchorage August, 2012 Linda Pikula, NOAA and IODE GEMIM

INFO/CS 4302 Web Informa6on Systems

Human Computer Interac6on in the early 21st century: a stable discipline, a nascent science, and the growth of the long tail.

SHARE UPDATE. CNI Membership Meeting, 8 December 2014

International Symposium on Knowledge Communities 2012

University of Massachusetts Amherst Libraries. Digital Preservation Policy, Version 1.3

Data is the New Currency. SLA- AGC 2014 Sayeed Choudhury

WIA Repeater and Beacon Licence Recommenda3on Policy. December Background

Computational Reproducibility in Medical Research:

Service Design, Social Innovation and Public sector innovation!

The Cyberne*cs Thought Collec*ve Project

Transferability of Data-Related Roles and Competencies

European Associa.on for Biometrics

Reproducibility Interest Group

Crea%ng Value Through System Thinking

The ARK Iden+fier Scheme at Ten Years Old

STRATEGIC FRAMEWORK Updated August 2017

Sustaining Domain Repositories for Digital Data: A Call for Change from an Interdisciplinary Working Group of Domain Repositories

Dr. Salsabeel Alabbady 1

Thinking. Design. Principles of. Thinking Like a Designer From Idea to Business

What Is Technical Assistance?

GAME THEORY. By: Rishika and Nithya 12/04/13

The Stewardship Gap INTRODUCTION

Inventor-Driven Product Development

European Coordination for Coastal HF Radar: EuroGOOS HF Radar Task Team

Scientific Transparency, Integrity, and Reproducibility

Provide input and learn more about the study at

Microsimulation, Big Data and Predictive Analytics

Understanding Microgrids as the Essen1al Architecture of Smart Energy

How Big- Web and DevOps Changes Academic Programs in System Administra>on

Philosophy of data- intensive science. Sabina Leonelli Department of Sociology, Philosophy and Anthropology & Egenis University of Exeter

Analysis of Informa.on - III

AAAI 2017 Community Mee2ng and Business Mee2ng. Subbarao Kambhampa2 President, AAAI 2/8/2017

Digitisation Plan

Model-Driven Software Engineering -Promises and Challenges

Finding Articles Finding Patents Finding Standards

How New Computational Methods are Changing Social Science

Open Data, Open Science, Open Access

GOLEM Integrated Microelectronics Solutions GmbH Serguei Golovanov, PhD, Dipl.Eng

Horizon 2020 R&I Opportuni2es

Introduction to Data- PASS

The Next Generation Science Standards, AP Physics 1 and 2, and the Revised MCAT: The New Face of STEM Education

Institutional Repositories: A Disruptive Response To an Established Paradigm

New landscape of compu)ng Personalized and targeted compu)ng

The Europeana Data Model: tackling interoperability via modelling

What is AI? Ar)ficial Intelligence. What is AI? What is AI? 9/4/09

SI Digital Libraries, Winter 2008

In This Issue. Seasons December, Pilvi Kolk. NSCF News: Annual Conference NSCF Scholars travelling reports

Comunicare la Resilienza ed I Cambiamen8 Clima8ci. Piero Pelizzaro, Kyoto Club

University of Kansas. The University of Kansas Libraries

Inside the Psychology of the Agent Informa)on, Associa)on, A/rac)on and Repulsion

Fundamentals Introduc1on to Technology Transfer and the FLC

Principles for Responsible Innovation. Building trust and trustworthiness in business innova3on. Consulta)on Dra-

"Radio Navigation and Radio Communication - Synergies and Conflicts"

A Solution for Public Access

How CRISs are key to the future of research libraries INCONECSS April 2016 Berlin

Shared Networks and the Evolu;on towards 5G

A MODEL OF SCHOLARLY COMMUNICATION IN TOURISM AND AN OPEN ACCESS INITIATIVE

GOALS AND FOUNDATION. land-grant university with interna2onal dis2nc2on. challenges and opportuni2es facing higher educa2on ins2tu2ons.

Guidelines for the Professional Evaluation of Digital Scholarship by Historians

Advanced Game AI. Level 6 Search in Games. Prof Alexiei Dingli

Harnessing the Potential of Data Scientists and Big Data for Scientific Discovery

ADOPTING DIGITAL TWIN FOR INTEGRATED DESIGN AND MANUFACTURING TO REDUCE PROCESS TIME

For more information about how to cite these materials visit

RECOMMENDATIONS. COMMISSION RECOMMENDATION (EU) 2018/790 of 25 April 2018 on access to and preservation of scientific information

Research Data Preservation in Canada A White Paper

Philip and Sewall Wright: The Inven5on of Instrumental Variables Regression

The City of the Future Living Lab Sauro Vicini

ITIL Manifesto. Turning Grouped Ideas into Principles. Deadline for submissions: 31 May 2015

Bamboo Technology Proposal (Public)

IAIP: INTELLIGENT SYSTEMS APPLIED TO INDUSTRIAL PROCESSES SPECIAL SESSION AT INTELLI 2017

Developing Science Literacy by Involving Youth in Journalism

A Comparison of Secure Two- Party Computa7on Frameworks

Introduction. amy e. earhart and andrew jewell

Digital Preserva.on: what I wish I knew before I started William Kilbride

Simula'on of e-astrogam

Health Technology Assessment (HTA): A Primer for Procurement Professionals

Advanced Cyberinfrastructure for Science, Engineering, and Public Policy 1

Lisa Pratt, MEPAG Chair Report to PSS March 10-11, 2016

Tools for Academic Research: Resolving the Credibility Crisis in Computational Science

Computational Methods for Analysis of Footwear Impression Evidence

ICSU World Data System Strategic Plan Trusted Data Services for Global Science

High Performance Computing Systems and Scalable Networks for. Information Technology. Joint White Paper from the

Towards Intelligent business process modelling

FEMA REGION X March 2017 COMMUNICATIONS EXERCISE PLAN

The role of MNEs for innova1on networks. Petra Ahrweiler Michel Schilperoord UCD Innova1on Research Unit IRU

Transcription:

Scholarly Communica/on Mee/ng Pi4sburgh, January 14-15, 2013 Par%cipants Ron Larsen, U Pi2sburgh Steve Griffin, U Pi2sburgh Bill Arms, Cornell U Johan Bollen, Indiana U Fran Berman, Rensselaer Poly Bob Pego, Carnegie Mellon U Micah Altman, MIT Libraries Greg Crane, TuGs U Spencer Keralis, U North Texas Josh Greenberg, Sloan Founda%on Victoria Stodden, Columbia U Tom Moritz, consultant Ed Fox, VPI Chuck Henry, CLIR Carole Goble, U Manchester Unable to a2end due to travel/other circumstances: Lewis Lancaster, U Cal, Berkeley Don Waters, Mellon Founda%on Carl Lagoze, U Michigan Sandy Paye2e, Cornell U John Unsworth, Brandeis U

Mee/ng Goals The mee%ng goals were to iden%fy new means and opportuni%es for enhancing scholarly communica%on across disciplines and to explore new models for documen%ng and dissemina%ng a comprehensive record of computa%onal and data- centered research

New Research and Study Approaches Associated with Data and Computa/on theore/cal/analy/cal (new theories are formulated and proven using a priori axioms and defini%ons) empirical/observa/onal (inquiry based on detectable and measurable evidence; hypotheses driven and ogen aimed at theory building which in turn can yield new hypotheses and iden%fy poten%al new theories) computa%onal (typically, large- scale computa%on applied to mathema%cal models using high performance computers to produce simula%ons of physical phenomena ogen displayed in the form of scien%fic visualiza%ons [e- Science]) data- driven (analysis of very large numerical or textual data sets with the goal of elucida%ng pa2erns or discovering new correla%ons or rela%onships from which hypotheses might be constructed) data- intensive (inquiry involving the explora%on and use of large, diverse data stores containing complex data objects with the goal of advancing interdisciplinary and domain scholarship) Recent developments that can also be considered as extensions or new forms of theore%cal and empirical methods.

New computa/on and data- centered modes of inquiry and experimenta/on are onen referred to as cyberscholarship or digital scholarship

Digital Scholarship Exploring New Modes Of Inquiry In The Digital Era

Repository Development Trends %me Very large repositories and global data infrastructures (2010) federa/on at data level via seman/c web tech- nologies, linked open data principles interlinked data over the web using URIs, RDF, links, vocabularies, rela%ons; abstrac%ons; graphs new developments Ins/tu/onal and disciplinary repositories (2000) federated across repositories [DSpace; Fedora; eprints] Digital repositories (1990s) Digital Libraries and other repositories Scien/fic DBs interoperability across repositories via OAI- PMH compound object packaging formats, etc.. func%onal individual repositories; metadata catalogues and diverse informa%on objects Middleware services layer sogware between the network and the applica%ons providing authen%ca%on, iden%fica%on, authoriza%on, directories, security ITC Infrastructure Processors, memory, network processors, networks, storage, codes, compilers, tools, algorithms, sogware libraries increasing capaci%es and capabili%es %me

Complex Networks of Repositories Based on Linked Open Data and Seman/c Web Technologies Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. h2p://lod- cloud.net/

h2p://datahub.io/dataset/fishes- of- texas

Mee/ng Presenta/ons and Group Discussion Foci new methodologies, reach and affordances of digital scholarship technologies and ac%vi%es to capture of a more complete record of stages in the scholarly research workflow effec%ve frameworks (exis%ng and proposed) to accelerate the repurposing and reuse of open data resul%ng from scholarly work and research robust document models for presen%ng and bundling" the processes, resources, outputs and poten%al impacts of scholarly work new means for dissemina%on to increase the diffusion and reach of new concepts and findings accurate measures to ensure appropriate and fair a2ribu%on, acknowledgement, credit and reward for those involved in carrying out the work

A Persistent and Recurring Theme - The Burden of Evidence - Defining features of science include repeatability and reproducibility. Repeatability refers to the ability to duplicate an experiment under the same condi%ons many %mes and obtain the same result. Reproducibility refers to the ability for others to replicate the work in different environments and obtain the same results, selng the stage for extending the work in new direc%ons. These requirements hold for theore%cal and empirical research and apply to the formal, natural and social sciences. Replica%on of results using proven, rigorous methodologies confirms the veracity of a research process and outcome. Carole Goble Victoria Stodden Tom Moritz

Some necessary condi/ons for reproducibility access to a comprehensive record of the research process and scholarly workflow including: process records: algorithms, sogware pipelines and versioning, datasets and transforma%ons, storage formats and protocols, event tracing,... resource descrip%ons: journals, logs, tools, methods, dialog, collabora%ve ac%vi%es and external contribu%ons,... intermediate forms: temporary models, concept changes, recursion points, sogware versions, external dialogs and contribu%ons... workflow ar%facts: transcrip%ons, transla%ons, annota%ons, steps taken to acknowledge distribu%on of effort, a2ribu%on and credit,...

Working Group Assignments Charge to Break Out Group #1 What can be done to effec%vely capture, document, and prepare the informa%on flow associated with each stage of a research project or scholarly work so that they can become part of a larger, global knowledge and scholarly communica%ons infrastructure. Charge to Break Out Group #2 Digital scholarship ogen involves new types of informa%on objects, data analy%c processes, resources, tools and heuris%c representa%on of findings that cannot be accurately or completely described or communicated in tradi%onal print or in print + electronic venues. Seman%c access is also inherently limited. What new expressive forms, document models, prac%ces and venues might help remedy this situa%on.

Charge to Break Out Group #1 What can be done to capture, document, and prepare elements of research and scholarly workflows so that essen8al components can become part of the overall repor8ng of the effort and eventually be integrated into global knowledge and scholarly communica8ons infrastructure. Points to consider: The primary researchers will be disinclined to do this for mul%ple reasons. Is there a possibility of automa%ng this? If so, would this be most tractable during the individual stages of the research workflow or a>er the project is complete. What might this entail? Should there be meta- research ac/vi/es as part of major research projects? The purpose would be to document the research process and prepare resources and ar%facts to enable reproducibility. Is this already being done to a certain degree in some disciplinary areas? Why? What has been the benefit to the larger scholarly communi%es involved?

A Very General Scholarly Research Workflow Model - example of simple and tradi/onal form - mix of dialog, data and resources from individuals, the web, libraries, archives, etc. journal ar%cles, monographs, conference papers (copyright) t Libraries, Academic Departments, Individuals,... primarily informal processes primarily formal processes Informa%on flows into and out of the project at each stage ac/vity inspira%on, explore, discover area of interest formulate problem, design research, collect data conduct research, analyze results prepare findings, disseminate results data discovered, referenced, accessed, gathered, transformed, analyzed, presented high low

Current Scholarly Research Workflow and Communica/on Model recently emerging global data and resources infrastructures data and research cyberinfrastructure: digital libraries, scien%fic databases, reports, publica%ons, ETDs, sogware & code libraries, executable documents, 1 st and 2 nd genera%on repositories (linked open data; seman%c web technologies...), processing, storage and grid services conversant/discursive web: social media, blogs, chat rooms, project sites, commentaries,... hos%ng ins%tu%ons (libraries, archives, other content and service providers) t Informa%on flows into and out of project subscrip%on & open access journals, self- published documents & pre- prints, hybrid dissemina%on models ac%vity: data: inspira%on, explore discover area of interest discovered, studied, formulate problem, design research, collect data conduct research, analyze results prepare findings, disseminate results accessed, collected, transformed, analyzed, prepared, presented and, some%mes: loosely organized ac%vi%es to collect and prepare ar%facts for future repurposing and reuse by others [event tracing, versioning, logs, journals, data documenta%on, intermediate forms, temporary models, concept changes, recursion points, transcrip%on, transla%on, annota%on,...]

A New General Model for Scholarly Communica/on Infrastructure Based on Scholarly Workflow scholarly communica%ons layer: dynamic research reports with detailed descrip%ve informa%on of workflow, methods and concepts as well as access to sogware, data and other experimental assets, provenance and cita%on linkages, etc. mee%ng community- endorsed prac%ces for presenta%on, access, preserva%on and archiving global data and research cyberinfrastructure: research data infrastructures, digital libraries, scien%fic databases, reports, publica%ons, ETDs, sogware & code libraries, executable documents, 1 st and 2 nd genera%on repositories (linked open data; seman%c web technologies...), processing, storage, cloud and grid services conversant/discursive web: social media, blogs, chat rooms, project sites, commentaries,... managing and services ins%tu%ons evalua%on mechanisms t workflow informa%on management mechanisms Informa%on Flow Into and From Workflow Stages inspira%on, explore, discover area of interest formulate problem, design research, collect data conduct research, analyze results prepare findings, disseminate results prepare and deliver research assets for reuse meta- research ac%vi%es collec%ng and preparing workflow ar%facts for reference, repurposing and reuse

New Roles for Libraries, Archives and Service Providers In this model the role of Libraries evolves from one of holders and providers of knowledge resources to one of being an ac%ve partner in the research process. Libraries and librarians provide tools and exper%se that expedite research and scholarship. Libraries have the ins%tu%onal structure and many of the resources needed to advance and sustain scholarly workflows.

Summary Recommenda/ons from Breakout Group #1 capture a comprehensive record of research process and scholarly produc%on to support verifica%on and reproducibility of results create full research process record: logs, applica%ons, methods, datasets, dialog, collabora%ve ac%vi%es prepare workflow ar%facts for repurposing and reuse develop a protocol model for scholarly output that allows for modularity, distributes effort and credit, and facilitates democra%c access develop methods for managing release of components of scholarly output from all stages of the scholarly workflow meta- research process necessary to accomplish this

Charge to Break Out Group #2 Digital scholarship o>en involves new types of informa8on objects, data analy8c processes, resources, tools and heuris8c representa8on of findings that cannot be accurately or completely described or communicated in tradi8onal print or print + electronic venues. Seman8c access is also inherently limited. What new expressive forms, document models, prac8ces and venues might help remedy this situa8on. Points to consider: The primary researchers may be disinclined to do this for mul%ple reasons. For example: extra work in prepara%on of their project results; ins%tu%onal pushback from publishing in non- conformist or new experimental venues; complexi%es introduced into review processes.

Challenges for Document Models: Describing Complex Projects Research Informa%on Network and Bri%sh library Pa2erns of informa%on use and exchange: case studies of researchers in the life sciences h2p://www.rin.ac.uk/system/files/a2achments/pa2erns_informa%on_use- REPORT_Nov09.pdf

A Richer Document Model: One Example Stefan Gradmann

New Tools Can Help

Dynamic Project Sites are Complex Scholarly Documents www.ecai.org h2p://ocw.mit.edu/ans7870/21f/21f.027/home/index.html

Summary Recommenda/ons from Breakout Group #2 Need a new bundled modular research document model in which elements are linked seman%cally, released when ready and capable of being recombined at any %me and in different environments Provides a variety of presenta%on forms to accommodate disciplinary domain requiring different expressive forms Facilitates its own automated retrieval Gives direct access to datasets, tools and other workflow elements An%cipates future needs for storage, access and use (cura%on, stewardship, provenance issues) Capable of aggrega%on at the component level with other research documents Annota%on and rela%onship friendly; indefinite versioning Greg Crane Carole Goble

Benefits from Richer Document Models One Example: Cultural Historical Research Cultural historical research means understanding 'possible pasts', the facts, events, material, social and psychological influences and mo%va%ons. It lives from understanding contexts, by pulling together bits and pieces of related facts from disparate resources, which can typically not be classified under subjects in an obvious way. It lives from taking into account all known facts. Under these condi%ons, the global network of knowledge can reveal deep stories built out of an immense number of concatenated primary facts, and a thing impossible for a tradi%onal library. Mar%n Doerr - Principal Researcher, Forth - Hellas

Next Mee/ng TBA: Focus on the Humani/es Thank You Mee/ng Web Site is Under Construc/on: Comments and sugges/ons are welcome! Contact Steve Griffin at sgriffin@pi4.edu