Digital Humanities, Computational Linguistics, and Natural Language Processing

Similar documents
Have Elisha and Emily ever delivered food? No, they haven t. They have never delivered food. But Emily has already delivered newspapers.

Faculty of Humanities and Social Sciences

Supplementary questionnaire on the 2011 Population and Housing Census BELGIUM

Methodology for Agent-Oriented Software

Jeu Find your best friend! Niveau Lieu Classroom Vocabulaire Classe! Grammaire Durée >15min Compétence Expression orale Matériel Doc

Sharing and distributing environmental data: Study cases in the French and Brazilian Amazonian context.

Funding line 1: Cultural Heritage and History

Creating Scientific Concepts

SIZE OF THE AFRICAN CONTINENT COMPARED TO OTHER LAND MASSES

v Pairing Instructions for: GENERAL MOTORS REPLACEMENT FLIP KEYS

J A M E S C O S U L L I VA N J O S U L L I VA N. O R G U N I V E R S I T Y O F S H E F F I E L D

Gis-Based Monitoring Systems.

Pser G uide oduct Manual

LEVEL 1 WORLD LANGUAGE UNIT 3 EXAM (FRENCH) Name: Date: Period:

Scientific communication in the Humanities. Ida Raffaelli Department of linguistics, University of Zagreb

~. a.\\ l. å ~ t 1 ~ ~, Department of Cultural Anthropology and Ethnology

Common Core Structure Final Recommendation to the Chancellor City University of New York Pathways Task Force December 1, 2011

Research and Change Call for abstracts Nr. 2

Cambridge Assessment International Education Cambridge Ordinary Level. Published

New societal challenges for the European Union New challenges for social sciences and the humanities

Design for value DfV

Can Linguistics Lead a Digital Revolution in the Humanities?

Scientific Methodology Workshop

Geolocalized data in France Benefits from the GeoStat project Geostat++(FR) Jean-Luc LIPATZ INSEE - France October 2010

DS600048C-CL. 48" Sliding Linear Shower Door. 1174~1199mm (46-3/16"~47-3/16")

Programme Curriculum for Master Programme in Economic History

Design Methodology. Šimon Kovář

Methodology. Ben Bogart July 28 th, 2011

Boundary Work for Collaborative Water Resources Management Conceptual and Empirical Insights from a South African Case Study

Sylvain Guillaumet Composer, Interpreter, Teacher

Future Personas Experience the Customer of the Future

The Palaeographical Method under the Light of a. Digital Approach. Dr. Arianna Ciula.

FCC Certification Notice: IC Certification

Maria del Carmen ARANA COURREJOLLES

Trucs pratiques de recherche d emploi

Energy for society: The value and need for interdisciplinary research

Contextual Integrity through the lens of computer science

GENEVA COMMITTEE ON DEVELOPMENT AND INTELLECTUAL PROPERTY (CDIP) Fifth Session Geneva, April 26 to 30, 2010

FACULTY SENATE ACTION TRANSMITTAL FORM TO THE CHANCELLOR

The research and creation approach in digital literature

INTERNATIONAL CONFERENCE ON ENGINEERING DESIGN ICED 03 STOCKHOLM, AUGUST 19-21, 2003

Laver, James The First Decadent. Being The Strange Life of J. K. Huysmans. New York: Citadel Press.

Ensemble de licorne «prêt à tricoter»

Empirical Research on Systems Thinking and Practice in the Engineering Enterprise

PREFACE. Introduction

Following these considerations, this encompasses two main objectives:

THE AXIOMATIC APPROACH IN THE UNIVERSAL DESIGN THEORY

Guidelines for the Professional Evaluation of Digital Scholarship by Historians

The Application of Visual Illusion in the Visual Communication Design

User guide. SmartTags. NT3/SmartTagsST25a

XtremeRange 5. Model: XR5. Compliance Sheet

Chapter 1 The Field of Computing. Slides Modified by Vicky Seno

The following slides will give you a short introduction to Research in Business Informatics.

2 Research Concept. 2.1 Research Approaches in Information Systems

EXECUTIVE BOARD MEETING METHODOLOGY FOR DEVELOPING STRATEGIC NARRATIVES

Building Collaborative Networks for Innovation

4. Analysing, designing and monitoring explicit SITIpolicy instruments: A theoretical framework to organize the information in GO SPIN

1 sur :34

Undisciplined Epistemology: Conceptual Heterogeneity in a Field in the Making

Greta Franzini February 23, 2016 National University of Ireland, Galway

Tuning-CALOHEE Assessment Frameworks for the Subject Area of CIVIL ENGINEERING The Tuning-CALOHEE Assessment Frameworks for Civil Engineering offers

Policy Evaluation as if sustainable development really mattered: Rethinking evaluation in light of Europe s 2050 Agenda

PowerView Remote Control Guide

INTEGRATING DESIGN AND ENGINEERING, II: PRODUCT ARCHITECTURE AND PRODUCT DESIGN

Learning Goals and Related Course Outcomes Applied To 14 Core Requirements

UDIS Programme of Inquiry

Managing Projects, Managing Knowledge

High School PLTW Introduction to Engineering Design Curriculum

TVB-2 INSTRUCTION SHEET. Test Verification Box

INFORMATION PERTAINING TO THE EVALUATION OF STUDENT LEARNING

Science Impact Enhancing the Use of USGS Science

ISO INTERNATIONAL STANDARD NORME INTERNATIONALE. Micrographics - Vocabulary - Image positions and methods of recording. Micrographie - Vocabulaire -

Proposed Curriculum Master of Science in Systems Engineering for The MITRE Corporation

Edgewood College General Education Curriculum Goals

A bit of epistemology...

Georgian-German cooperation in the field of social sciences and humanities

PRODUCT SCOTLAND: BRINGING DESIGNERS, ANTHROPOLOGISTS, ARTISTS AND ENGINEERS TOGETHER

Social Innovation and new pathways to social changefirst insights from the global mapping

Faculty of Arts and Social Sciences. STRUCTUURRAPPORT Chair Digital Arts and Culture

IS0 INTERNATIONAL STANDARD NORME INTERNATIONALE. Textile machinery and accessories - Flat warp knitting machines - Vocabulary -

liberal the habib HABIB UNIVERSITY: UNIVERSITY AVENUE, OFF SHAHRAH-E-FAISAL, GULISTAN-E-JAUHAR, KARACHI

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Issues and Challenges in Coupling Tropos with User-Centred Design

The teaching of construction at Grenoble School of Architecture Nicolas Tixier, Philippe Liveneau

Investing in Knowledge: Insights on the Funding Environment for Research on Inequality Among Young People in the United States

Patterns allow us to see relationships and develop generalizations.

Playware Research Methodological Considerations

StreetSounds STS-170-MMST Mobile Master. User Guide

TITLE: Using collections and worksets in large-scale corpora: Preliminary findings from the Workset Creation for Scholarly Analysis project

Writing a research proposal

FRENCH'GRAMMAR:' When! a!or! de!precedes!the!definite!article!le!or!les!the!preposition!contracts.!

Revised East Carolina University General Education Program

Towards a Consumer-Driven Energy System

Lenovo regulatory notice for wireless adapters

DiMe4Heritage: Design Research for Museum Digital Media

USB WiFi for Projector

Modelling Critical Context in Software Engineering Experience Repository: A Conceptual Schema

MOVADO.COM/SMARTSUPPORT

Background paper: From the Information Society To Knowledge Societies (December 2003)

Strategic Plan for CREE Oslo Centre for Research on Environmentally friendly Energy

Transcription:

Digital Humanities, Computational Linguistics, and Natural Language Processing Dr-Ing Michael Piotrowski Leibniz Institute of European History <piotrowski@ieg-mainzde> Uppsala, March 4, 2016 Defining Digital Humanities Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 1/22

WhatIsDigitalHumanitiescom Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 2/22 Do we really need a definition? Yes, we do If you want to create a program of studies or devise a research agenda, you must commit yourself to some definition However, most definitions focus on methods and say very little about goals Related problem: Are the digital humanities a discipline of their own, an interdisciplinary field, a community of practice, or something else again? Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 3/22

Consensus Relatively broad consensus, that the digital humanities bring together humanities and computer science; thus we have two aspects: A Work on humanities research question using methods and tools from computer science B Work on computer science methods und tools for tackling research questions in the humanities Term is inherently ambiguous Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 4/22 Piotrowski 2012 The emerging field of digital humanities aims to exploit the possibilities offered by digital data for humanities research The digital humanities combine traditional qualitative methods with quantitative, computer-based methods and tools, such as information retrieval, text analytics, data mining, visualization, and geographic information systems (GIS) (Piotrowski 2012, p 6) Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 5/22

Piotrowski 2013 In a narrow sense, digital humanities refers to the application of quantitative, computer-based methods for humanities research, usually complementing traditional qualitative methods [ ] The important point is that it is humanities research, ie, you re applying these methods to answer a humanities research question In a wider sense, it may also refer to the application of computer-based tools in humanities research (note that this definition does not require the use of quantitative methods) For example, creating a digital edition is not digital humanities in the narrow sense (because it does not use quantitative methods), but it is in the wider sense http://nlphisthypothesesorg/114 Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 6/22 Discussion Relatively clearly delimited area of research Uncontroversal, but not arbitrary Actually only a description of practices Nothing is said about motivations or goals of the digital humanities Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 7/22

Why Digital Humanities? Ultimate goal of all science and scholarship: gaining new insights by systematic research ( Erkenntnisgewinn ) What is the benefit of combining humanities and computer science for the humanities? Acceleration of research through digitization? Automatic analyses of large amounts of data? Attractive visualizations? Where is the advancement or innovation? Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 8/22 Piotrowski 2016 Definition (Digital humanities) The digital humanities study the means and methods of constructing formal models in the humanities Definition (Digital history) Digital history is concerned with the construction of formal models of historical circumstances and with the methodology of constructing such models Correspondingly: Digital literary studies, digital philosophy, etc These are subfields of their respective disciplines, characterized by the creation and use of formal models Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 9/22

Formal models A model is a representation of a selected part of the world Model description theory Слово формальный не ознацает ничего, кроме как логически последовательный + однозначный + абсолютно явный The word formal means nothing more than logically coherent + unambiguous + explicit (Gladkij & Mel čuk 1969, p 9) There are different degrees of formalization; here we are primarily interested in a degree of formalization that allows models to be processed and manipulated by computers Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 10/22 Formal models All scientific and scholarly research constructs models of their objects of research In order to understand a complex object (phenomenon, situation, ), you need to understand its parts and how they interrelate with each other This is exactly what a model describes In contrast to the natural sciences, models in the humanities are traditionally not formal and not directly accessible; narratives are not models, but informal descriptions of models Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 11/22

Digital humanities as a metascience Definition (Digital humanities) The digital humanities study the means and methods of constructing formal models in the humanities The digital humanities are concerned with the construction materials for such formal models; thus: a metascience Definition (Digital history) Digital history is concerned with the construction of formal models of historical circumstances and with the methodology of constructing such models Individual digital humanities subfields create concrete formal models of their research objects There is no strict boundary between digital humanities and individual digital humanities subfields Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 12/22 Traditional research process Working Materials Scholar reads and interprets primary and secondary sources Facts and insights are recorded as working materials in a variety of forms (on paper or electronically, as text, in spreadsheets, databases, etc) Using the working materials, scholar constructs mental model to answer research question and describes the model in a narrative Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 13/22

Building on the work of others (traditional process) Working Materials Working Materials Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 14/22 Where do formal models come into play? Formal Model Analysis, Visualization, Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 15/22

Collaboration on a higher level Formal Model Analysis, Visualization, Formal Model Analysis, Visualization, Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 16/22 Collaboration on a higher level Formal Model Analysis, Visualization, Formal Model Analysis, Visualization, Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 17/22

What do we need? Humanities research questions and results are primarily qualitative Digital humanities are primarily qualitative Knowledge representation is central for the creation of formal models in the humanities Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 18/22 DH, CL, and NLP Linguistics has a vantage point for observing the digital humanities, because it has essentially completed the transformation from armchair linguistics to an empirical science using formal models The role of computational linguistics corresponds to that of digital humanities The role of corpus linguistics corresponds to that of the digital humanities subfields (such as digital history) Where is the place of NLP? Applied computational linguistics? Engineers take on linguistics? Computer science? Toolsmiths? What is the role of NLP in digital humanities? Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 19/22

NLP and DH If the humanities seriously want to base their research on large quantities of text (and quantitative methods), they will need NLP as basis for all higher-level analyses For digital historical scholarship, NLP must then be regarded as an auxiliary science of history, similar to diplomatics, codicology, paleography, numismatics etc, which are indispensable for evaluating and using historical sources Il n est pas indispensable que le philologue établisse lui-même le programme, encore que ce soit infiniment souhaitable ; il devrait au moins connaître assez le langage de programmation pour contrôler le travail du technicien ; en effet, l expérience m a appris qu il ne faut pas s en remettre les yeux fermés aux électroniciens, mal préparés par leur formation mathématique à se faire une idée juste de problèmes concrets qui se posent dans la domaine de la philologie (Jacques Froger, 1970) Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 20/22 Summary The digital humanities do not merely aim to accelerate research or to analyze larger amounts of data The key is formal modeling of scholarly knowledge and insights in machine-processable form Formal models increase coherence, precision, and explicitness, encourage cooperation and sharing, and help researchers to directly build upon each other s work Knowledge representation techniques are thus the foremost tools for creating formal models in the humanities The digital humanities discussion can benefit from studying the development of linguistics Digital humanities subfields can learn from corpus linguistics NLP should be considered an auxiliary science as such, DH researchers have to get acquainted with its methods and tools Michael Piotrowski 2016-03-04 Digital Humanities, Computational Linguistics, and NLP 21/22

Digital Humanities, Computational Linguistics, and Natural Language Processing Dr-Ing Michael Piotrowski Leibniz Institute of European History <piotrowski@ieg-mainzde> Uppsala, March 4, 2016