COMPARE 2012 Comparative Empirical Evaluation of Reasoning Systems

Similar documents
FLoC/SAT 10 Edinburgh, Scotland, UK

Carsten Sinz Nina Amla João Marques Silva Emmanuel Zarpas Daniel Le Berre Laurent Simon

Software Quality Days 2019 January 15 th -18 th 2019, Vienna, Austria

8th Workshop on Algorithmic Approaches for Transportation Modeling, Optimization, and Systems

Christoph Hochreiner Stefan Schulte ZEUS th ZEUS Workshop, ZEUS 2016, Vienna, Austria, January 2016 Proceedings

24 Challenges in Deductive Software Verification

HIGH PERFORMANCE COMPUTING IN FLUID DYNAMICS

Cristian Mattarei, PhD

SafeNano Norway in from concept to reality?

EUROPEAN MANUFACTURING SURVEY EMS

SECTEUR Ascertaining user needs

Lecture Notes in Artificial Intelligence. Lecture Notes in Computer Science

INTERNATIONAL STANDARD

Advanced Information and Knowledge Processing

RecTour nd Workshop on Recommenders in Tourism. Proceedings. Co-located with the 11th ACM Conference on Recommender Systems (RecSys 2017)

13th International Symposium on Component Based Software Engineering (CBSE-2010)

Public Consultation: Science 2.0 : science in transition

EBA Master Class The Benefits of International Collaboration. Steve Morgan Co-Chair, EBA Benchmarking Group

Nature makes polysaccharides, EPNOE turns them into products

Common Features and National Differences - preliminary findings -

1. Introduction. defining and producing new materials with advanced properties, or optimizing industrial processes.

Information Sheet. Background. Cadarache-Château France May 2018 with optional field & nuclear facilities visits - 17 th May 2018

H2020 Excellent science arie Skłodowska-Curie Actions. Your research career in Europe. 17 November 2015

Modeling and Validation

Implementing the International Safety Framework for Space Nuclear Power Sources at ESA Options and Open Questions

Exploring Predictability of SAT/SMT Solvers

THE 12 COUNTRIES IN OUR SAMPLE

COOP 2016: Proceedings of the 12th International Conference on the Design of Cooperative Systems, May 2016, Trento, Italy

Architecture Design and Validation Methods

Lecture Notes in Computer Science

Document downloaded from: This paper must be cited as:

Communications in Computer and Information Science 85

International Standard

SHAPES 3.0 The Shape of Things

ASSESSMENT OF DYNAMICS OF THE INDEX OF THE OF THE INNOVATION AND ITS INFLUENCE ON GROSS DOMESTIC PRODUCT OF LATVIA

Changes to university IPR regulations in Europe and their impact on academic patenting

Task 26 Extension Proposal

Towards a New IP Consciousness in Universities and R&D Institutions: Case Show

Technical Meeting on Heat Transfer, Thermal-Hydraulics and System Design for Supercritical Water Cooled Reactors

Oliver Kopp Jörg Lenhard Cesare Pautasso ZEUS th ZEUS Workshop, ZEUS 2017, Lugano, Switzerland, February 2017 Proceedings

Re-Engineering the Scientific Publishing Process for the Internetworked Global Academic Community

COMPASS3. Marco Bozzano - Fondazione Bruno Kessler Harold Bruintjes - RWTH Aachen University. TEC-ED & TEC-SW Final Presentation Days

CURRICULUM VITAE. Oct 2005 Dec MSc in Computer Science. Faculty of Mathematics,

ICT for the Next Five Billion People

ASQF e.v. (ed.) Arbeitskreis Software-Qualität und -Fortbildung e.v. Software Quality in Service-Oriented Architectures

Munkaanyag

The JEF-2.2 Nuclear Data Library

ICAR relations to ISO

Ref: Overview of the implementation of the TRIPS Agreement (patents) in the EPC contracting states and observer countries

Components for virtual environments Michael Haller, Roland Holm, Markus Priglinger, Jens Volkert, and Roland Wagner Johannes Kepler University of Linz

This document is a preview generated by EVS

S-BPM in the Production Industry

CDP-EIF ITAtech Equity Platform

MECHANICAL DESIGN LEARNING ENVIRONMENTS BASED ON VIRTUAL REALITY TECHNOLOGIES

First Workshop on Business Process Management and Ontologies (BPMO 2016)

Understanding Knowledge Societies Report of UNDESA/DPADM. Measurement Aspects. Irene Tinagli Tunis, 17 Nov World Summit on Information Society

Debugging SENT Automotive Buses with an Oscilloscope APPLICATION NOTE

Power Measurement and Analysis Software

developments from early material design stage. This chapter collects seven papers on material and their property related research.

Information Systems Frontiers CALL FOR PAPERS. Special Issue on: Digital transformation for a sustainable society in the 21st century

THE WASTE AND THE BACKYARD

LiM - Lasers in Manufacturing 2011 Part 1

SR&ED International R&D Tax Credit Strategies

Advanced Test Equipment Rentals ATEC (2832)

Job opportunities for scientists and engineers

Software verification

Science & Technology Cooperation Workshop

1 Publishable summary

OECD s Innovation Strategy: Key Findings and Policy Messages

Newsletter: Standardisation Efforts on Industrial and Service Robots

DAB+ Digital Radio. Global update. Vasant Venkatramani, WorldDAB IFTV Broadcast Istanbul, November 2018

Verifying Power Supply Sequencing with an 8-Channel Oscilloscope APPLICATION NOTE

BRIDGING THE GAP BETWEEN PRODUCT DESIGN AND PRODUCT ENGINEERING

Lecture Notes in Computer Science. Edited by G. Goos, J. Hartmanis and J. van Leeuwen

Visual Triggering. Technical Brief

Olympus RAW File Import Plug-in. Instructions. RAW File Import Plug-in Utility for Olympus Digital Cameras

DOTSEVEN Towards 0.7 Terahertz Silicon Germanium Heterojunction Bipolar Technology FP7 Contract Number:

On the Benefits of Enhancing Optimization Modulo Theories with Sorting Jul 1, Networks 2016 for 1 / MAXS 31

Innovation in Europe: Where s it going? How does it happen? Stephen Roper Aston Business School, Birmingham, UK

A German study about Research Software Engineers (RSEs) The people writing software for Science

Dr. Eberhard Bessey. Daimler AG Group Research and Advanced Engineering (GR&AE)

Oslo IMIA Board and General Assembly Meetings August 27-28, GA Agenda Item: 15 Board Agenda Item: 15. August 2011

Conference on Patent Statistics for Policy Decision Making

Lecture Notes in Computer Science

Christina Miller Director, UK Research Office

Twinning cases selected Deliverable D3.2 INNO-Partnering Forum

EU Ecolabel EMAS Environmental Technology Verification (ETV) State-of-play and evaluations

Whole of Society Conflict Prevention and Peacebuilding

TAIC PART 2007 and Mutation 2007 Special Issue Editorial

Durham Research Online

Committee on Quality and Regulations. Liverpool, 7 October 2014 Wim Huisman

PPP InfoDay Brussels, July 2012

OBSTACLES AND OPPORTUNITIES FOR THE PECS INDUSTRY TO PARTICIPATE IN ESA PROGRAMMES SPACE4SME PROJECT. Prague April 25, 2008

CHAPTER 2 METHODOLOGY AND ORGANISATION OF THE STUDY

When is it Time to Transition to a Higher Bandwidth Oscilloscope?

Confidence in SKYLON. Success on future engine test would mean "a major breakthrough in propulsion worldwide"

RAPS ECMWF. RAPS Chairman. 20th ORAP Forum Slide 1

High Performance Computing Scientific Discovery and the Importance of Collaboration

The present volume was organized into 6 thematic parts.

Keysight Technologies MATLAB Data Analysis Software Packages

Transcription:

(Eds.) COMPARE 2012 Comparative Empirical Evaluation of Reasoning Systems Proceedings of the International Workshop June 30, 2012, Manchester, United Kingdom

Editors Karlsruhe Institute of Technology Institute for Theoretical Informatics Am Fasanengarten 5, 76131 Karlsruhe, Germany Email: klebanov@kit.edu Karlsruhe Institute of Technology Institute for Theoretical Informatics Am Fasanengarten 5, 76131 Karlsruhe, Germany Email: beckert@kit.edu Institute for Formal Models and Verification Johannes Kepler University Altenbergerstr. 69, 4040 Linz, Austria Email: biere@jku.at Department of Computer Science University of Miami P.O. Box 248154, Coral Gables, FL 33124-4245, USA Email: geoff@cs.miami.edu Copyright 2012 for the individual papers by the papers authors. Copying permitted for private and academic purposes. This volume is published and copyrighted by its editors.

Preface This volume contains the proceedings of the 1st International Workshop on Comparative Empirical Evaluation of Reasoning Systems (COMPARE 2012), held on June 30th, 2012 in Manchester, UK, in conjunction with the International Joint Conference on Automated Reasoning (IJCAR). It has become accepted wisdom that regular comparative evaluation of reasoning systems helps to focus research, identify relevant problems, bolster development, and advance the field in general. Benchmark libraries and competitions are two popular approaches to do so. The number of competitions has been rapidly increasing lately. At the moment, we are aware of about a dozen benchmark collections and two dozen competitions for reasoning systems of different kinds. It is time to compare notes. What are the proper empirical approaches and criteria for effective comparative evaluation of reasoning systems? What are the appropriate hardware and software environments? How to assess usability of reasoning systems, and in particular of systems that are used interactively? How to design, acquire, structure, publish, and use benchmarks and problem collections? The aim of the workshop was to advance comparative empirical evaluation by bringing together current and future competition organizers and participants, maintainers of benchmark collections, as well as practitioners and the general scientific public interested in the topic. We wish to sincerely thank all the authors who submitted their work for consideration. All submitted papers were peer-reviewed, and we would like to thank the Program Committee members as well as the additional referees for their great effort and professional work in the review and selection process. Their names are listed on the following pages. We are deeply grateful to our invited speakers Leonardo de Moura (Microsoft Research) and Cesare Tinelli (University of Iowa) for accepting the invitation to address the workshop participants. We thank Sarah Grebing for her help in organizing the workshop and compiling this volume. June 2012 III COMPARE 2012

Program Committee Christoph Benzmüller Dirk Beyer Vinay Chaudhri Koen Claessen Alberto Griggio Marieke Huisman Radu Iosif Rosemary Monahan Micha l Moskal Jens Otten Franck Pommereau Sylvie Putot Olivier Roussel Albert Rubio Aaron Stump Free University Berlin, Germany University of Passau, Germany Johannes Kepler University Linz, Austria SRI International, USA Chalmers Technical University, Sweden Fondazione Bruno Kessler, Italy University of Twente, the Netherlands Verimag/CNRS/University of Grenoble, France National University of Ireland Maynooth Microsoft Research, USA University of Potsdam, Germany University of Évry, France CEA-LIST, France CNRS, France Universitat Politècnica de Catalunya, Spain University of Iowa, USA University of Miami, USA Program Co-Chairs Johannes Kepler University Linz, Austria University of Miami, USA Organising Committee Sarah Grebing Additional Referees Sarah Grebing COMPARE 2012 IV

Table of Contents Abstracts of Invited Talks Regression Tests and the Inventor s Dilemma......................... 1 Leonardo de Moura Introducing StarExec: a Cross-Community Infrastructure for Logic Solving......................................................... 2 Aaron Stump,, and Cesare Tinelli Contributed Papers Evaluating the Usability of Interactive Verification Systems............ 3 and Sarah Grebing Broadening the Scope of SMT-COMP: the Application Track........... 18 Roberto Bruttomesso and Alberto Griggio A Simple Complexity Measurement for Software Verification and Software Testing.................................................. 28 Zheng Cheng, Rosemary Monahan, and James Power Benchmarking Static Analyzers..................................... 32 Pascal Cuoq, Florent Kirchner, and Boris Yakobowski The 2nd Verified Software Competition: Experience Report............ 36 Jean-Christophe Filliâtre, Andrei Paskevich, and Aaron Stump On the Organisation of Program Verification Competitions............. 50 Marieke Huisman,, and Rosemary Monahan Challenges in Comparing Software Verification Tools for C............. 60 Florian Merz, Carsten Sinz, and Stephan Falke Behind the Scene of Solvers Competitions: the evaluation Experience.. 66 Olivier Roussel Author Index................................................ 78 V COMPARE 2012

COMPARE 2012 VI