Taxonomic Name Recognition (TNR) in Biodiversity Heritage

Size: px
Start display at page:

Download "Taxonomic Name Recognition (TNR) in Biodiversity Heritage"

Transcription

1 Taxonomic Name Recognition (TNR) in Biodiversity Heritage Library L L Qin Wei, Chris Freeland, P. Bryan Heidorn Missouri Botanical Garden

2 Co-author Chris Freeland Director of Biodiversity Heritage Library IT division manager of Missouri Botanical Garden 10/26/08 02:01 AM TNR in BHL 2

3 Biodiversity Heritage Library(BHL) Ten major natural history museum libraries, botanical libraries, and research institutions have joined to form the BHL. The group is developing a strategy and operational plan to digitize the published literature of biodiversity held in their respective collections. This literature will be available through a global biodiversity commons U 1 0 U More information about BHL could be found at 10/26/08 02:01 AM TNR in BHL 3

4 Participating institutions( ( (ar ) American Museum of Natural History (New York, NY) The Field Museum (Chicago, IL) Harvard University Botany Libraries (Cambridge, MA) Harvard University (Cambridge, MA) Marine Biological Laboratory / Woods Hole Oceanographic Institution (Woods Hole, MA) Missouri Botanical Garden (St. Louis, MO) Natural History Museum (London, UK) The New York Botanical Garden (New York, NY) Royal Botanic Gardens, Kew (Richmond, UK) Smithsonian Institution Libraries (Washington, DC) 10/26/08 02:01 AM TNR in BHL 4

5 Open Access O O BHL Project strives to establish a major corpus of digitized publications on the Web drawn from the historical biodiversity literature. This material will be available for open access and responsible use as a part of a global Biodiversity Commons. We will work with the global taxonomic community, rights holders, and other interested parties to ensure that this legacy literature is available to all. 10/26/08 02:01 AM TNR in BHL 5

6 10/26/08 02:01 AM TNR in BHL 6

7 TNR in BHL T NR i n BHL T NR i A significant aspect of BHL is the incorporation of algorithmic Taxonomic intelligence provided by ubio.org.. r g mi c i n t e l l i g e. r g mi c As materials are scanned, the image files are processed through ABBY FineReader or PrimeOCR to create text derivatives. Those text files are then submitted to ubio s TaxonFinder web service to identifies strings in the text that match the characteristics of scientific names. s c i e n t s c i e n t i f i c n a 10/26/08 02:01 AM TNR in BHL 7

8 Two TNR algorithms w 2 2 o 2 o T NR a l g o TaxonFinder is developed by ubio and it uses statistical models that were created from the validated organism names that are in NameBank. These models aim to describe the structure and frequency of common character sequences of organism names, such that TaxonFinder can infer whether an unknown word has a similar structure as a known organism name. a k n o wn o r g a n a k n Online only n l i n e o n 10/26/08 02:01 AM TNR in BHL 8

9 Two TNR algorithms FAT, short for Finds All Taxonomic names, was developed aiming to automatically extract all the taxonomic name from the biological literature. It then use the parts already classified to build lexica and statistics (dictionary lookup), which will be used to classify the rest of the text. (Sautter et al) b e u s e d t o c l a s s i f y t h e Offline usage and customized dictionaries( Of f l Of f l i n e ) 10/26/08 02:01 AM TNR in BHL 9

10 10/26/08 02:01 AM TNR in BHL 10

11 Digitalization Process Di g i Di g Libraries images to texts process text BHL Web OCR TNR Text Database Ubio Database BHL (Text Mining Database) Unstructured Data f i n d Text Mining find names Structured Data f i n d 10/26/08 02:01 AM TNR in BHL 11

12 Digitalization Process Di g i Di g Libraries OCR Error TNR Error BHL Web OCR TNR Text Database Ubio Database BHL (Text Mining Database) Unstructured Data Text Mining Authority File Error Structured Data 10/26/08 02:01 AM TNR in BHL 12

13 Sample Characteristics a m Number of Pages Average Number of Tokens Average Number of Names /26/08 02:01 AM TNR in BHL 13

14 Evaluation Measures v a l u a v Precision is the proportion of matching strings that are valid names. In our case,the precision means the capability of the algorithm to exclude the non-valid name in the result. n a me Recall is the proportion of valid names in the whole database that were returned as true positives. It means the capability of finding all valid names from the database. f r o m In this evaluation, we also use a single measure F- score which is a harmonic mean of R and P: F-score=2(Precision*Recall)/(Precision+Recall) 10/26/08 02:01 AM TNR in BHL 14

15 Sample Language Distribution Di s t r i b u t LanguageDistribution /26/08 02:01 AM TNR in BHL 15

16 Ground Fact r o u n d Total: 3003 valid names Unique Name One Word Two Words Three Words More Words No. of Valid names Percentage 89.01% 35.76% 40.63% 17.25% 6.36% 10/26/08 02:01 AM TNR in BHL 16

17 OCR Overall Performance OC OCR Ov Total 3003 Wrong OCR 1056 Error Rate 35.16% 10/26/08 02:01 AM TNR in BHL 17

18 Error Breakdown Error Percentage more 31% onechar 31% fourchar 6% threechar 11% twochar 21% 10/26/08 02:01 AM TNR in BHL 18

19 OCR-Is language matters? OCR Error Rate lat unk ger eng dut spa fre swe por cze ita Language 10/26/08 02:01 AM TNR in BHL 19

20 TOP OCR error patterns 1 Insert Space 8 n->v 2 Omit Space 9 l->i 3 e->c 10 r->i 4 u->i 11 u->ii 5 u->n 12 h->l 6 i->l 13 h->ii 7 c->e 14 e->o 10/26/08 02:01 AM TNR in BHL 20

21 NameBank For TaxonFinder, NameBank impleteness error rate is 6% NoBankID IsValidName Found Total /26/08 02:01 AM TNR in BHL 21

22 Without_OCR_Error* No. of Names (identified by biologist) Error Analysis No. of Names Found by algorithms Correct Exact Match Precision P r e c i Recall Re c a l 36.62% 23.34% F-score 38.47% 25.77% With_OCR_Error* TaxonFinder FAT No. of Names (identified by biologist) No. of Names Found by algorithms Correct Precision P r e c i 43.77% 32.25% Recall e c a l 25.82% 17.21% F-score TaxonFinder FAT % 28.20% 34.80% 24.73% 10/26/08 02:01 AM TNR in BHL 22

23 Overall Performances Step OCR TaxonFinder* NameBank Total No. of Names Error Rate 35.16% 41.09% 3.06% 79.32% 10/26/08 02:01 AM TNR in BHL 23

24 Conclusion o n c l Our result indicate that TaxonFinder is slightly better than FAT. But even TaxonFinder only got an F-score of 38.47% which is relatively lower compared to other Named entity recognition results. For instance, the best system entering Message Understanding Conferences (MUC) scored 93.39% of F-score while human annotators scored 97.60% and 96.95%. We could see that there is a large space we could improve the algorithm to get better result. ( ( mp r ( mp r o v ) 10/26/08 02:01 AM TNR in BHL 24

25 Future Work F u t u r e W Artificial Intelligent Retrieval is the trend ( Ar t i f i c Ar t i f ) How could we achieve it? ( Ho w c ) Experiments on machine learning methods x p e r i m x p Using other external sources, e.g. ontologies Us i n g Us i n g Automatic OCR correction Au t o ma t i c OC Fuzzy matching algorithms in IR F u z z y ma 10/26/08 02:01 AM TNR in BHL 25

26 References [1] S. Rice, J. Kanai, and T. Nartker. An evaluation of OCR accuracy. In UNLV Information Science Research Institute Annual Report, pages 9-20, 1993 [2] Koning, D., N. Sarkar, and T. Moritz TaxonGrab: Extracting Taxonomic Names from Text. Biodiversity Informatics 2: [3] Sautter, G., K. Bohm, and D. Agosti A combining approach to find all taxon names (FAT) in legacy biosisystematics literature. Biodiversity Informatics 3, ( [4] McCray, A. T., A. R. Aronson, A. C. Browne, T. C. Rindflesch, A. Razi, and S. Srinivasan UMLS knowledge for biomedical language processing. Bulletin Medical Library Association 81: /26/08 02:01 AM TNR in BHL 26

27 Questions? 10/26/08 02:01 AM TNR in BHL 27

28 Thanks! 10/26/08 02:01 AM TNR in BHL 28

BHL Moves Forward 2014 an update

BHL Moves Forward 2014 an update BHL Moves Forward 2014 an update Susan Fraser European Botanical and Horticultural Libraries Group 21 st Annual Meeting, May 15-17 2014 Dubrovnik, Croatia In any well- appointed Natural History Library

More information

Jason Best Botanical Research Institute of Texas (BRIT)/Biodiversity Informatics

Jason Best Botanical Research Institute of Texas (BRIT)/Biodiversity Informatics Improving the Character of Optical Character Recognition (OCR): idigbio Augmenting OCR Working Group Seeks Collaborators and Strategies to Improve OCR Output and Parsing of OCR Output for Faster, More

More information

Digital Libraries for Biodiversity and Natural History Collections

Digital Libraries for Biodiversity and Natural History Collections Digital Libraries for Biodiversity and Natural History Collections Authors Miguel Ruiz University of North Texas, Department of Library and Information Sciences 1155 Union Circle 311068. Denton, TX 76203-1068

More information

Named Entity Recognition. Natural Language Processing Emory University Jinho D. Choi

Named Entity Recognition. Natural Language Processing Emory University Jinho D. Choi Named Entity Recognition Natural Language Processing Emory University Jinho D. Choi Named Entity Recognition 2 Named Entity Recognition Classify the named entity tag of each chunk. 2 Named Entity Recognition

More information

Combining Large Datasets of Patents and Trademarks

Combining Large Datasets of Patents and Trademarks Combining Large Datasets of Patents and Trademarks Grid Thoma Computer Science Division, School of Science & Technology University of Camerino 14 th Italian STATA User Annual Meeting Florence, 16 Nov 2017

More information

Number Plate Recognition Using Segmentation

Number Plate Recognition Using Segmentation Number Plate Recognition Using Segmentation Rupali Kate M.Tech. Electronics(VLSI) BVCOE. Pune 411043, Maharashtra, India. Dr. Chitode. J. S BVCOE. Pune 411043 Abstract Automatic Number Plate Recognition

More information

Page # New Models for Distributing Digital Content

Page # New Models for Distributing Digital Content New Models for Distributing Digital Content J. Trant jtrant@io.org http://www.io.org/~jtrant Intellectual Property Rights legal framework inconclusive issue seen as major barrier requires balance between

More information

Systematic Bias in OCR Experiments. Yuhlin Chang, Daniel P. Lopresti, Andrew Tomkins. Matsushita Information Technology Laboratory

Systematic Bias in OCR Experiments. Yuhlin Chang, Daniel P. Lopresti, Andrew Tomkins. Matsushita Information Technology Laboratory Systematic Bias in OCR Experiments Yuhlin Chang, Daniel P. Lopresti, Andrew Tomkins Jerey Zhou, Jiangying Zhou Matsushita Information Technology Laboratory Panasonic Technologies, Inc. Two Research Way

More information

A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron

A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron Proc. National Conference on Recent Trends in Intelligent Computing (2006) 86-92 A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron

More information

Data and Knowledge as Infrastructure. Chaitan Baru Senior Advisor for Data Science CISE Directorate National Science Foundation

Data and Knowledge as Infrastructure. Chaitan Baru Senior Advisor for Data Science CISE Directorate National Science Foundation Data and Knowledge as Infrastructure Chaitan Baru Senior Advisor for Data Science CISE Directorate National Science Foundation 1 Motivation Easy access to data The Hello World problem (courtesy: R.V. Guha)

More information

Copenhagen IMIA Board and General Assembly Meetings August 19-20, 2013 Meeting Room 17 (Board); Meeting Room 19 (General Assembly) Bella Center

Copenhagen IMIA Board and General Assembly Meetings August 19-20, 2013 Meeting Room 17 (Board); Meeting Room 19 (General Assembly) Bella Center Copenhagen IMIA Board and General Assembly Meetings August 19-20, 2013 Meeting Room 17 (Board); Meeting Room 19 (General Assembly) Bella Center REPORT OF THE VICE PRESIDENT FOR WORKING GROUPS AND SPECIAL

More information

Evaluation. Fabian M. Suchanek

Evaluation. Fabian M. Suchanek Evaluation Fabian M. Suchanek 53 Semantic IE Reasoning Fact Extraction You are here Instance Extraction singer Entity Disambiguation singer Elvis Entity Recognition Source Selection and Preparation Detect

More information

Comparison of abbreviation recognition algorithms

Comparison of abbreviation recognition algorithms Comparison of abbreviation recognition algorithms 2010 REU Program MSCS Department Marquette University August 12, 2010 Introduction Algorithms Abbreviations occur frequently in scientific journals Can

More information

Image Finder Mobile Application Based on Neural Networks

Image Finder Mobile Application Based on Neural Networks Image Finder Mobile Application Based on Neural Networks Nabil M. Hewahi Department of Computer Science, College of Information Technology, University of Bahrain, Sakheer P.O. Box 32038, Kingdom of Bahrain

More information

PAPER. Connecting the dots. Giovanna Roda Vienna, Austria

PAPER. Connecting the dots. Giovanna Roda Vienna, Austria PAPER Connecting the dots Giovanna Roda Vienna, Austria giovanna.roda@gmail.com Abstract Symbolic Computation is an area of computer science that after 20 years of initial research had its acme in the

More information

Journal Title ISSN 5. MIS QUARTERLY BRIEFINGS IN BIOINFORMATICS

Journal Title ISSN 5. MIS QUARTERLY BRIEFINGS IN BIOINFORMATICS List of Journals with impact factors Date retrieved: 1 August 2009 Journal Title ISSN Impact Factor 5-Year Impact Factor 1. ACM SURVEYS 0360-0300 9.920 14.672 2. VLDB JOURNAL 1066-8888 6.800 9.164 3. IEEE

More information

Simple Large-scale Relation Extraction from Unstructured Text

Simple Large-scale Relation Extraction from Unstructured Text Simple Large-scale Relation Extraction from Unstructured Text Christos Christodoulopoulos and Arpit Mittal Amazon Research Cambridge Alexa Question Answering Alexa, what books did Carrie Fisher write?

More information

COMPREHENSIVE COMPETITIVE INTELLIGENCE MONITORING IN REAL TIME

COMPREHENSIVE COMPETITIVE INTELLIGENCE MONITORING IN REAL TIME CASE STUDY COMPREHENSIVE COMPETITIVE INTELLIGENCE MONITORING IN REAL TIME Page 1 of 7 INTRODUCTION To remain competitive, Pharmaceutical companies must keep up to date with scientific research relevant

More information

Introduction to Talking Robots

Introduction to Talking Robots Introduction to Talking Robots Graham Wilcock Adjunct Professor, Docent Emeritus University of Helsinki 8.12.2015 1 Robots and Artificial Intelligence Graham Wilcock 8.12.2015 2 Breakthrough Steps of Artificial

More information

Abstract. Most OCR systems decompose the process into several stages:

Abstract. Most OCR systems decompose the process into several stages: Artificial Neural Network Based On Optical Character Recognition Sameeksha Barve Computer Science Department Jawaharlal Institute of Technology, Khargone (M.P) Abstract The recognition of optical characters

More information

Computer-Aided Design Data Extraction Approach to Identify Product Information

Computer-Aided Design Data Extraction Approach to Identify Product Information Journal of Computer Science 5 (9): 624-629, 2009 ISSN 1549-3636 2009 Science Publications Computer-Aided Design Data Extraction Approach to Identify Product Information Mohamad Faizal Ab. Jabal, Mohd.

More information

C. PCT 1486 November 30, 2016

C. PCT 1486 November 30, 2016 November 30, 2016 Madam, Sir, Number of Words in Abstracts and Front Page Drawings 1. This Circular is addressed to your Office in its capacity as a receiving Office, International Searching Authority

More information

biodiversity heritage library SmithsonianCampaign Smithsonian Libraries

biodiversity heritage library SmithsonianCampaign Smithsonian Libraries biodiversity heritage library SmithsonianCampaign Smithsonian Libraries A World of Knowledge About Life on Earth Around the globe, scientists are investigating our planet s biological diversity the complex

More information

Advanced Analytics for Intelligent Society

Advanced Analytics for Intelligent Society Advanced Analytics for Intelligent Society Nobuhiro Yugami Nobuyuki Igata Hirokazu Anai Hiroya Inakoshi Fujitsu Laboratories is analyzing and utilizing various types of data on the behavior and actions

More information

General report format, ref. Article 12 of the Birds Directive, for the report

General report format, ref. Article 12 of the Birds Directive, for the report Annex 1: General report format, ref. Article 12 of the Birds Directive, for the 2008-2012 report 0. Member State Select the 2 digit code for your country, according to list to be found in the reference

More information

Image Extraction using Image Mining Technique

Image Extraction using Image Mining Technique IOSR Journal of Engineering (IOSRJEN) e-issn: 2250-3021, p-issn: 2278-8719 Vol. 3, Issue 9 (September. 2013), V2 PP 36-42 Image Extraction using Image Mining Technique Prof. Samir Kumar Bandyopadhyay,

More information

Design Science Research Methods. Prof. Dr. Roel Wieringa University of Twente, The Netherlands

Design Science Research Methods. Prof. Dr. Roel Wieringa University of Twente, The Netherlands Design Science Research Methods Prof. Dr. Roel Wieringa University of Twente, The Netherlands www.cs.utwente.nl/~roelw UFPE 26 sept 2016 R.J. Wieringa 1 Research methodology accross the disciplines Do

More information

Contents 1 Introduction Optical Character Recognition Systems Soft Computing Techniques for Optical Character Recognition Systems

Contents 1 Introduction Optical Character Recognition Systems Soft Computing Techniques for Optical Character Recognition Systems Contents 1 Introduction.... 1 1.1 Organization of the Monograph.... 1 1.2 Notation.... 3 1.3 State of Art.... 4 1.4 Research Issues and Challenges.... 5 1.5 Figures.... 5 1.6 MATLAB OCR Toolbox.... 5 References....

More information

A Novel Approach for Image Cropping and Automatic Contact Extraction from Images

A Novel Approach for Image Cropping and Automatic Contact Extraction from Images A Novel Approach for Image Cropping and Automatic Contact Extraction from Images Prof. Vaibhav Tumane *, {Dolly Chaurpagar, Ankita Somkuwar, Gauri Sonone, Sukanya Marbade } # Assistant Professor, Department

More information

The All Birds Barcoding Initiative (ABBI) aims to establish a public archive of DNA barcodes for all birds, approximately 10,000 species, by 2010.

The All Birds Barcoding Initiative (ABBI) aims to establish a public archive of DNA barcodes for all birds, approximately 10,000 species, by 2010. The All Birds Barcoding Initiative (ABBI) aims to establish a public archive of DNA barcodes for all birds, approximately 10,000 species, by 2010. Beginning with Darwin s finches, avian study has led to

More information

COURSE UNITS TAUGHT IN ENGLISH :: UNIVERSITY OF COIMBRA :: ACADEMIC YEAR 2009/2010

COURSE UNITS TAUGHT IN ENGLISH :: UNIVERSITY OF COIMBRA :: ACADEMIC YEAR 2009/2010 COURSE UNITS TAUGHT IN ENGLISH :: UNIVERSITY OF COIMBRA :: ACADEMIC YEAR 2009/2010 :: The majority of course units are conducted in Portuguese. Modern Language and Literature studies are usually medium-taught

More information

Exploring the New Trends of Chinese Tourists in Switzerland

Exploring the New Trends of Chinese Tourists in Switzerland Exploring the New Trends of Chinese Tourists in Switzerland Zhan Liu, HES-SO Valais-Wallis Anne Le Calvé, HES-SO Valais-Wallis Nicole Glassey Balet, HES-SO Valais-Wallis Address of corresponding author:

More information

A Novel Morphological Method for Detection and Recognition of Vehicle License Plates

A Novel Morphological Method for Detection and Recognition of Vehicle License Plates American Journal of Applied Sciences 6 (12): 2066-2070, 2009 ISSN 1546-9239 2009 Science Publications A Novel Morphological Method for Detection and Recognition of Vehicle License Plates 1 S.H. Mohades

More information

Development and Integration of Artificial Intelligence Technologies for Innovation Acceleration

Development and Integration of Artificial Intelligence Technologies for Innovation Acceleration Development and Integration of Artificial Intelligence Technologies for Innovation Acceleration Research Supervisor: Minoru Etoh (Professor, Open and Transdisciplinary Research Initiatives, Osaka University)

More information

Implementation of License Plate Recognition System in ARM Cortex A8 Board

Implementation of License Plate Recognition System in ARM Cortex A8 Board www..org 9 Implementation of License Plate Recognition System in ARM Cortex A8 Board S. Uma 1, M.Sharmila 2 1 Assistant Professor, 2 Research Scholar, Department of Electrical and Electronics Engg, College

More information

QEETHARA KADHIM AL-SHAYEA P.O.BOX 130 AMMAN 11733, JORDAN Cell (079)

QEETHARA KADHIM AL-SHAYEA P.O.BOX 130 AMMAN 11733, JORDAN Cell (079) QEETHARA KADHIM AL-SHAYEA P.O.BOX 130 AMMAN 11733, JORDAN Cell. 00962-(079)6381447 E-Mail: drqeethara@zuj.edu.jo, kit_alshayeh@yahoo.com EDUCATION: 2005 Ph. D. Computer Science Major Field: Computer Science

More information

SCIENCE & TECHNOLOGY

SCIENCE & TECHNOLOGY Pertanika J. Sci. & Technol. 25 (S): 163-172 (2017) SCIENCE & TECHNOLOGY Journal homepage: http://www.pertanika.upm.edu.my/ Performance Comparison of Min-Max Normalisation on Frontal Face Detection Using

More information

Hypothesis Tests. w/ proportions. AP Statistics - Chapter 20

Hypothesis Tests. w/ proportions. AP Statistics - Chapter 20 Hypothesis Tests w/ proportions AP Statistics - Chapter 20 let s say we flip a coin... Let s flip a coin! # OF HEADS IN A ROW PROBABILITY 2 3 4 5 6 7 8 (0.5) 2 = 0.2500 (0.5) 3 = 0.1250 (0.5) 4 = 0.0625

More information

Analysis of Data Mining Methods for Social Media

Analysis of Data Mining Methods for Social Media 65 Analysis of Data Mining Methods for Social Media Keshav S Rawat Department of Computer Science & Informatics, Central university of Himachal Pradesh Dharamshala (Himachal Pradesh) Email:Keshav79699@gmail.com

More information

A New Forecasting System using the Latent Dirichlet Allocation (LDA) Topic Modeling Technique

A New Forecasting System using the Latent Dirichlet Allocation (LDA) Topic Modeling Technique A New Forecasting System using the Latent Dirichlet Allocation (LDA) Topic Modeling Technique JU SEOP PARK, NA RANG KIM, HYUNG-RIM CHOI, EUNJUNG HAN Department of Management Information Systems Dong-A

More information

MOBIUS Member Fees (MemFees (FTE))

MOBIUS Member Fees (MemFees (FTE)) Data for the MOBIUS member assessments is captured on (or as soon as possible after) the last business day of the fiscal year (typically June 30). The captured data is used for the member assessments two

More information

A VIDEO CAMERA ROAD SIGN SYSTEM OF THE EARLY WARNING FROM COLLISION WITH THE WILD ANIMALS

A VIDEO CAMERA ROAD SIGN SYSTEM OF THE EARLY WARNING FROM COLLISION WITH THE WILD ANIMALS Vol. 12, Issue 1/2016, 42-46 DOI: 10.1515/cee-2016-0006 A VIDEO CAMERA ROAD SIGN SYSTEM OF THE EARLY WARNING FROM COLLISION WITH THE WILD ANIMALS Slavomir MATUSKA 1*, Robert HUDEC 2, Patrik KAMENCAY 3,

More information

Name that sculpture. Relja Arandjelovid and Andrew Zisserman. Visual Geometry Group Department of Engineering Science University of Oxford

Name that sculpture. Relja Arandjelovid and Andrew Zisserman. Visual Geometry Group Department of Engineering Science University of Oxford Name that sculpture Relja Arandjelovid and Andrew Zisserman Visual Geometry Group Department of Engineering Science University of Oxford University of Oxford 7 th June 2012 Problem statement Identify the

More information

Opening Science & Scholarship

Opening Science & Scholarship Opening Science & Scholarship Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Initiatives Associate Director for Program Development National Library of Medicine, NIH National Academies

More information

Liangliang Cao *, Jiebo Luo +, Thomas S. Huang *

Liangliang Cao *, Jiebo Luo +, Thomas S. Huang * Annotating ti Photo Collections by Label Propagation Liangliang Cao *, Jiebo Luo +, Thomas S. Huang * + Kodak Research Laboratories *University of Illinois at Urbana-Champaign (UIUC) ACM Multimedia 2008

More information

AI MAGAZINE AMER ASSOC ARTIFICIAL INTELL UNITED STATES English ANNALS OF MATHEMATICS AND ARTIFICIAL

AI MAGAZINE AMER ASSOC ARTIFICIAL INTELL UNITED STATES English ANNALS OF MATHEMATICS AND ARTIFICIAL Title Publisher ISSN Country Language ACM Transactions on Autonomous and Adaptive Systems ASSOC COMPUTING MACHINERY 1556-4665 UNITED STATES English ACM Transactions on Intelligent Systems and Technology

More information

The 2018 Publishing Landscape: Technological Horizons. Lyndsey Dixon Editorial Director, APAC Journals Taylor & Francis Group

The 2018 Publishing Landscape: Technological Horizons. Lyndsey Dixon Editorial Director, APAC Journals Taylor & Francis Group The 2018 Publishing Landscape: Technological Horizons Lyndsey Dixon Editorial Director, APAC Journals Taylor & Francis Group Today Waves of innovation Publishing advancements through innovation Artificial

More information

WHO. 6 staff people. Tel: / Fax: Website: vision.unipv.it

WHO. 6 staff people. Tel: / Fax: Website: vision.unipv.it It has been active in the Department of Electrical, Computer and Biomedical Engineering of the University of Pavia since the early 70s. The group s initial research activities concentrated on image enhancement

More information

The Ubiquitous Lab Or enhancing the molecular biology research experience

The Ubiquitous Lab Or enhancing the molecular biology research experience The Ubiquitous Lab Or enhancing the molecular biology research experience Juan David Hincapié Ramos IT University of Copenhagen Denmark jdhr@itu.dk www.itu.dk/people/jdhr Abstract. This PhD research aims

More information

Information Infrastructure II (Data Mining) I211

Information Infrastructure II (Data Mining) I211 Information Infrastructure II (Data Mining) I211 Spring 2010 Basic Information Class meets: Time: MW 9:30am 10:45am Place: I2 130 Instructor: Predrag Radivojac Office: Informatics 219 Email: predrag@indiana.edu

More information

Enhanced MLP Input-Output Mapping for Degraded Pattern Recognition

Enhanced MLP Input-Output Mapping for Degraded Pattern Recognition Enhanced MLP Input-Output Mapping for Degraded Pattern Recognition Shigueo Nomura and José Ricardo Gonçalves Manzan Faculty of Electrical Engineering, Federal University of Uberlândia, Uberlândia, MG,

More information

Faculty of Science & Technology

Faculty of Science & Technology Faculty of Science & Technology Dr Angelos Stefanidis Associate Dean Global Engagement Location A great location in the centre of the south coast of England 2 hours from central London by train or coach

More information

An Introduction to SIMDAT a Proposal for an Integrated Project on EU FP6 Topic. Grids for Integrated Problem Solving Environments

An Introduction to SIMDAT a Proposal for an Integrated Project on EU FP6 Topic. Grids for Integrated Problem Solving Environments An Introduction to SIMDAT a Proposal for an Integrated Project on EU FP6 Topic Grids for Integrated Problem Solving Environments Martin Hofmann Department of Bioinformatics Fraunhofer Institute for Algorithms

More information

Royal Botanic Gardens Kew Herbarium Specimen Label Tutorial

Royal Botanic Gardens Kew Herbarium Specimen Label Tutorial Royal Botanic Gardens Kew Herbarium Specimen Label Tutorial Introduction One of Kew s strategic objectives is to make its collections available to researchers all over the world so everyone can access

More information

NPI Are You Ready? The presentation was created to assist Navicure clients in navigating the information received regarding NPI.

NPI Are You Ready? The presentation was created to assist Navicure clients in navigating the information received regarding NPI. NPI Are You Ready? The presentation was created to assist Navicure clients in navigating the information received regarding NPI. NPI Overview Getting an NPI is free - Not Having One Can Be Costly. The

More information

Coding for Efficiency

Coding for Efficiency Let s suppose that, over some channel, we want to transmit text containing only 4 symbols, a, b, c, and d. Further, let s suppose they have a probability of occurrence in any block of text we send as follows

More information

MAV-ID card processing using camera images

MAV-ID card processing using camera images EE 5359 MULTIMEDIA PROCESSING SPRING 2013 PROJECT PROPOSAL MAV-ID card processing using camera images Under guidance of DR K R RAO DEPARTMENT OF ELECTRICAL ENGINEERING UNIVERSITY OF TEXAS AT ARLINGTON

More information

Highly Adaptive Indian High Security Vehicle Number Plate Recognition

Highly Adaptive Indian High Security Vehicle Number Plate Recognition Highly Adaptive Indian High Security Vehicle Number Plate Recognition Neha Arora M-Tech Scholar NRI Institute of Information Science and Technology, Bhopal, M.P. Lalit Jain Research Guide NRI Institute

More information

TITLE OF PRESENTATION. Elsevier s Challenge. Dynamic Knowledge Stores and Machine Translation. Presented By Marius Doornenbal,, Anna Tordai

TITLE OF PRESENTATION. Elsevier s Challenge. Dynamic Knowledge Stores and Machine Translation. Presented By Marius Doornenbal,, Anna Tordai Elsevier s Challenge Dynamic Knowledge Stores and Machine Translation Presented By Marius Doornenbal,, Anna Tordai Date 25-02-2016 OUTLINE Introduction Elsevier: from publisher to a data & analytics company

More information

This list supersedes the one published in the November 2002 issue of CR.

This list supersedes the one published in the November 2002 issue of CR. PERIODICALS RECEIVED This is the current list of periodicals received for review in Reviews. International standard serial numbers (ISSNs) are provided to facilitate obtaining copies of articles or subscriptions.

More information

Panel Study of Income Dynamics: Mortality File Documentation. Release 1. Survey Research Center

Panel Study of Income Dynamics: Mortality File Documentation. Release 1. Survey Research Center Panel Study of Income Dynamics: 1968-2015 Mortality File Documentation Release 1 Survey Research Center Institute for Social Research The University of Michigan Ann Arbor, Michigan December, 2016 The 1968-2015

More information

Computer-Aided Three-Dimensional

Computer-Aided Three-Dimensional CATIA CORE TOOLS Computer-Aided Three-Dimensional Interactive Application MICHEL MICHAUD Mc Graw Hill New York Chicago San Francisco Lisbon London Madrid Mexico City Milan New Delhi San Juan Seoul Singapore

More information

Latest trends in sentiment analysis - A survey

Latest trends in sentiment analysis - A survey Latest trends in sentiment analysis - A survey Anju Rose G Punneliparambil PG Scholar Department of Computer Science & Engineering Govt. Engineering College, Thrissur, India anjurose.ar@gmail.com Abstract

More information

Electric Guitar Pickups Recognition

Electric Guitar Pickups Recognition Electric Guitar Pickups Recognition Warren Jonhow Lee warrenjo@stanford.edu Yi-Chun Chen yichunc@stanford.edu Abstract Electric guitar pickups convert vibration of strings to eletric signals and thus direcly

More information

A Level Computer Science H446/02 Algorithms and programming. Practice paper - Set 1. Time allowed: 2 hours 30 minutes

A Level Computer Science H446/02 Algorithms and programming. Practice paper - Set 1. Time allowed: 2 hours 30 minutes A Level Computer Science H446/02 Algorithms and programming Practice paper - Set 1 Time allowed: 2 hours 30 minutes Do not use: a calculator First name Last name Centre number Candidate number INSTRUCTIONS

More information

This document is a preview generated by EVS

This document is a preview generated by EVS INTERNATIONAL STANDARD ISO 16278 First edition 2016-03-01 Health informatics Categorial structure for terminological systems of human anatomy Informatique de santé Structure catégorielle des systèmes terminologiques

More information

Vehicle Number Plate Recognition with Bilinear Interpolation and Plotting Horizontal and Vertical Edge Processing Histogram with Sound Signals

Vehicle Number Plate Recognition with Bilinear Interpolation and Plotting Horizontal and Vertical Edge Processing Histogram with Sound Signals Vehicle Number Plate Recognition with Bilinear Interpolation and Plotting Horizontal and Vertical Edge Processing Histogram with Sound Signals Aarti 1, Dr. Neetu Sharma 2 1 DEPArtment Of Computer Science

More information

Digitization Errors In Hungarian Documents

Digitization Errors In Hungarian Documents Digitization Errors In Hungarian Documents Máté Pataki 1 Tamás Füzessy 2 1 Department of Distributed Systems Computer and Automation Research Institute of the Hungarian Academy of Sciences 2 FreeSoft Nyrt.

More information

SSB Debate: Model-based Inference vs. Machine Learning

SSB Debate: Model-based Inference vs. Machine Learning SSB Debate: Model-based nference vs. Machine Learning June 3, 2018 SSB 2018 June 3, 2018 1 / 20 Machine learning in the biological sciences SSB 2018 June 3, 2018 2 / 20 Machine learning in the biological

More information

Chapter 3 WORLDWIDE PATENTING ACTIVITY

Chapter 3 WORLDWIDE PATENTING ACTIVITY Chapter 3 WORLDWIDE PATENTING ACTIVITY Patent activity is recognized throughout the world as an indicator of innovation. This chapter examines worldwide patent activities in terms of patent applications

More information

RECOMMENDATION ITU-R SM Frequency channel occupancy measurements

RECOMMENDATION ITU-R SM Frequency channel occupancy measurements Rec. ITU-R SM.1536 1 RECOMMENDATION ITU-R SM.1536 Frequency channel occupancy measurements (2001) The ITU Radiocommunication Assembly, considering a) that some administrations assign the same frequency

More information

List of Journals. International Journals Mechanical Engineering

List of Journals. International Journals Mechanical Engineering List of Journals Mechanical Engineering 1. Journal of Scientific and Industrial Research 2. Bulletin of Material Science 3. Inventi Impact: Mechanical Engineering 4. Claro: Materials Science 5. Claro:

More information

KIPO s plan for AI - Are you ready for AI? - Gyudong HAN, KIPO Republic of Korea

KIPO s plan for AI - Are you ready for AI? - Gyudong HAN, KIPO Republic of Korea KIPO s plan for AI - Are you ready for AI? - Gyudong HAN, KIPO Republic of Korea Table of Contents What is AI? Why AI is necessary? Where and How to apply? With whom? Further things to think about 2 01

More information

A Novel Fault Diagnosis Method for Rolling Element Bearings Using Kernel Independent Component Analysis and Genetic Algorithm Optimized RBF Network

A Novel Fault Diagnosis Method for Rolling Element Bearings Using Kernel Independent Component Analysis and Genetic Algorithm Optimized RBF Network Research Journal of Applied Sciences, Engineering and Technology 6(5): 895-899, 213 ISSN: 24-7459; e-issn: 24-7467 Maxwell Scientific Organization, 213 Submitted: October 3, 212 Accepted: December 15,

More information

Recognition System for Pakistani Paper Currency

Recognition System for Pakistani Paper Currency World Applied Sciences Journal 28 (12): 2069-2075, 2013 ISSN 1818-4952 IDOSI Publications, 2013 DOI: 10.5829/idosi.wasj.2013.28.12.300 Recognition System for Pakistani Paper Currency 1 2 Ahmed Ali and

More information

Combination of Web and Android Application to Implement Automated Meter Reader Based on OCR

Combination of Web and Android Application to Implement Automated Meter Reader Based on OCR Combination of Web and Android Application to Implement Automated Meter Reader Based on OCR 1 Swapnil R. Gawali, 2 Sangram K. Pawar, 3 Amol Kad 1, 2, 3 Department of Information Technology 1, 2, 3 AAEMF's

More information

Simple Large-scale Relation Extraction from Unstructured Text

Simple Large-scale Relation Extraction from Unstructured Text Simple Large-scale Relation Extraction from Unstructured Text Christos Christodoulopoulos and Arpit Mittal Amazon Research Cambridge Alexa Question Answering Alexa, what books did Carrie Fisher write?

More information

IRIS BIOMETRICS FROM SEGMENTATION TO TEMPLATE SECURITY ADVANCES IN INFORMATION SECURITY

IRIS BIOMETRICS FROM SEGMENTATION TO TEMPLATE SECURITY ADVANCES IN INFORMATION SECURITY IRIS BIOMETRICS FROM SEGMENTATION TO TEMPLATE SECURITY ADVANCES IN INFORMATION SECURITY page 1 / 5 page 2 / 5 iris biometrics from segmentation pdf Iris recognition is an automated method of biometric

More information

Comparative Interoperability Project: Collaborative Science, Interoperability Strategies, and Distributing Cognition

Comparative Interoperability Project: Collaborative Science, Interoperability Strategies, and Distributing Cognition Comparative Interoperability Project: Collaborative Science, Interoperability Strategies, and Distributing Cognition Florence Millerand 1, David Ribes 2, Karen S. Baker 3, and Geoffrey C. Bowker 4 1 LCHC/Science

More information

Recap from previous lecture. Information Retrieval. Topics for Today. Recall: Basic structure of an Inverted index. Dictionaries & Tolerant Retrieval

Recap from previous lecture. Information Retrieval. Topics for Today. Recall: Basic structure of an Inverted index. Dictionaries & Tolerant Retrieval Recap from previous lecture nformation Retrieval Dictionaries & Tolerant Retrieval Jörg Tiedemann jorg.tiedemann@lingfil.uu.se Department of Linguistics and Philology Uppsala University nverted indexes

More information

THE SUBJECT COMPOSITION OF THE WORLD'S SCIENTIFIC JOURNALS

THE SUBJECT COMPOSITION OF THE WORLD'S SCIENTIFIC JOURNALS Scientometrics, Vol. 2, No. 1 (198) 53-63 THE SUBJECT COMPOSITION OF THE WORLD'S SCIENTIFIC JOURNALS M. P. CARPENTER, F. NARIN Computer Horizons, Inc., 15 Kings Highway North, Cherry Hill, New Jersey 834

More information

Text Mining for Historical Documents Motivation and Case Studies

Text Mining for Historical Documents Motivation and Case Studies Motivation and Case Studies Computational Linguistics/MMCI Universität des Saarlandes Wintersemester 2011/12 22.02.2012 IT and Cultural Heritage: Why bother? (1) Museums, archives and libraries possess

More information

Medical Intelligence:

Medical Intelligence: Medical Intelligence: Big Data, Predictive Analytics, Machine Learning, and Artificial Intelligence Anthony C. Chang, MD, MBA, MPH Chief Intelligence and Innovation Officer Children s Hospital of Orange

More information

A new method to recognize Dimension Sets and its application in Architectural Drawings. I. Introduction

A new method to recognize Dimension Sets and its application in Architectural Drawings. I. Introduction A new method to recognize Dimension Sets and its application in Architectural Drawings Yalin Wang, Long Tang, Zesheng Tang P O Box 84-187, Tsinghua University Postoffice Beijing 100084, PRChina Email:

More information

MICROCHIP PATTERN RECOGNITION BASED ON OPTICAL CORRELATOR

MICROCHIP PATTERN RECOGNITION BASED ON OPTICAL CORRELATOR 38 Acta Electrotechnica et Informatica, Vol. 17, No. 2, 2017, 38 42, DOI: 10.15546/aeei-2017-0014 MICROCHIP PATTERN RECOGNITION BASED ON OPTICAL CORRELATOR Dávid SOLUS, Ľuboš OVSENÍK, Ján TURÁN Department

More information

Background. Computer Vision & Digital Image Processing. Improved Bartlane transmitted image. Example Bartlane transmitted image

Background. Computer Vision & Digital Image Processing. Improved Bartlane transmitted image. Example Bartlane transmitted image Background Computer Vision & Digital Image Processing Introduction to Digital Image Processing Interest comes from two primary backgrounds Improvement of pictorial information for human perception How

More information

Elements of Artificial Intelligence and Expert Systems

Elements of Artificial Intelligence and Expert Systems Elements of Artificial Intelligence and Expert Systems Master in Data Science for Economics, Business & Finance Nicola Basilico Dipartimento di Informatica Via Comelico 39/41-20135 Milano (MI) Ufficio

More information

Roswitha Poll Münster, Germany

Roswitha Poll Münster, Germany Date submitted: 02/06/2009 The Project NUMERIC: Statistics for the Digitisation of the European Cultural Heritage Roswitha Poll Münster, Germany Meeting: 92. Statistics and Evaluation, Information Technology

More information

Application Areas of AI Artificial intelligence is divided into different branches which are mentioned below:

Application Areas of AI   Artificial intelligence is divided into different branches which are mentioned below: Week 2 - o Expert Systems o Natural Language Processing (NLP) o Computer Vision o Speech Recognition And Generation o Robotics o Neural Network o Virtual Reality APPLICATION AREAS OF ARTIFICIAL INTELLIGENCE

More information

IMPORTANT ASPECTS OF DATA MINING & DATA PRIVACY ISSUES. K.P Jayant, Research Scholar JJT University Rajasthan

IMPORTANT ASPECTS OF DATA MINING & DATA PRIVACY ISSUES. K.P Jayant, Research Scholar JJT University Rajasthan IMPORTANT ASPECTS OF DATA MINING & DATA PRIVACY ISSUES K.P Jayant, Research Scholar JJT University Rajasthan ABSTRACT It has made the world a smaller place and has opened up previously inaccessible markets

More information

ACADEMIC YEAR

ACADEMIC YEAR INTERNATIONAL JOURNAL SL.NO. NAME OF THE FACULTY TITLE OF THE PAPER JOURNAL DETAILS 1 Dr.K.Komathy 2 Dr.K.Komathy 3 Dr.K. Komathy 4 Dr.G.S.Anandha Mala 5 Dr.G.S.Anandha Mala 6 Dr.G.S.Anandha Mala 7 Dr.G.S.Anandha

More information

1. Queries are issued to the image archive for information about computed tomographic (CT)

1. Queries are issued to the image archive for information about computed tomographic (CT) Appendix E1 Exposure Extraction Method examinations. 1. Queries are issued to the image archive for information about computed tomographic (CT) 2. Potential dose report screen captures (hereafter, dose

More information

Sentiment Analysis of User-Generated Contents for Pharmaceutical Product Safety

Sentiment Analysis of User-Generated Contents for Pharmaceutical Product Safety Sentiment Analysis of User-Generated Contents for Pharmaceutical Product Safety Haruna Isah, Daniel Neagu and Paul Trundle Artificial Intelligence Research Group University of Bradford, UK Haruna Isah

More information

Recursive Text Segmentation for Color Images for Indonesian Automated Document Reader

Recursive Text Segmentation for Color Images for Indonesian Automated Document Reader Recursive Text Segmentation for Color Images for Indonesian Automated Document Reader Teresa Vania Tjahja 1, Anto Satriyo Nugroho #2, Nur Aziza Azis #, Rose Maulidiyatul Hikmah #, James Purnama Faculty

More information

Curriculum Vitae Bradley A. Malin

Curriculum Vitae Bradley A. Malin Curriculum Vitae Bradley A. Malin Carnegie Mellon University +1 412 268 1097 (tel) School of Computer Science +1 412 268 6708 (fax) 1320 B Wean Hall malin@cs.cmu.edu Pittsburgh, Pennsylvania 15213-3890

More information

Midterm for Name: Good luck! Midterm page 1 of 9

Midterm for Name: Good luck! Midterm page 1 of 9 Midterm for 6.864 Name: 40 30 30 30 Good luck! 6.864 Midterm page 1 of 9 Part #1 10% We define a PCFG where the non-terminals are {S, NP, V P, V t, NN, P P, IN}, the terminal symbols are {Mary,ran,home,with,John},

More information

Implementation of Text to Speech Conversion

Implementation of Text to Speech Conversion Implementation of Text to Speech Conversion Chaw Su Thu Thu 1, Theingi Zin 2 1 Department of Electronic Engineering, Mandalay Technological University, Mandalay 2 Department of Electronic Engineering,

More information

Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts

Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts Olarik Surinta and Rapeeporn Chamchong Department of Management Information Systems and Computer Science Faculty of Informatics,

More information

Advanced Software Developments for Automated Power Quality Assessment Using DFR Data

Advanced Software Developments for Automated Power Quality Assessment Using DFR Data Advanced Software Developments for Automated Power Quality Assessment Using DFR Data M. Kezunovic, X. Xu Texas A&M University Y. Liao ABB ETI, Raleigh, NC Abstract The power quality (PQ) meters are usually

More information

Classification Experiments for Number Plate Recognition Data Set Using Weka

Classification Experiments for Number Plate Recognition Data Set Using Weka Classification Experiments for Number Plate Recognition Data Set Using Weka Atul Kumar 1, Sunila Godara 2 1 Department of Computer Science and Engineering Guru Jambheshwar University of Science and Technology

More information