0 Big data for the analysis of digital economy & society Beyond bibliometrics Stephane Berghmans, DVM PhD VP Academic & Research Relations EU, Elsevier With support from Judith Kamalski (Analytical Services) 22 September 2015 JRC-IPTS, Seville, Spain @StefEurope Data is Elsevier, opinions are mine 0000-0001-5414-8674
1 1 Elsevier s Broad View of the Global World of Research Shinya Albert Einstein (Physics) Yamanaka (Medicine) Francoise Barre-Sinoussi (Medicine) 130+ years Marie Curie (Physics, Chemistry) Louis Pasteur (Chemistry) John C. Mather (Physics) Alexander Fleming (Medicine) Craig C Mello (Medicine)
Elsevier partners with global research leaders 2 National research assessment and benchmarking reports UK REF, UK BIS reports ERA (Australia) FCT (Portugal) VQR (Italy) September 12, 2011 Global University Rankings Times Higher World University Rankings QS rankings US News rankings (Arab Region) ProBono initiatives and reports (select examples) UK Royal Society Science Europe European Commission, FENS, HBP, Kavli Foundation, RIKEN BSI World Bank EuroStemCell, Kyoto University Snowball Metrics
3 Decisions by triangulating information Reliable Data DECISION Strategy Policy Expert opinion Methodology
4 1M/Y 11M 53M 13M 700M/Y 3M FTA downloads 50M/Y FTA click-through 80K
5 Elsevier Data Physical Sciences 6,600 Health Sciences 6,300 Social Sciences 6,350 Life Sciences 4,050 JOURNALS 21,912 peer-reviewed journals 367 trade journals - Full metadata, abstracts and cited references (pre-1996) - >2,800 fully Open Access titles - Going back to 1823 - Funding data from acknowledgements CONFERENCES 17k events 6.5M records (10%) Conf. expansion: 1,000 conferences 6,000 conf. events 400k conf. papers 5M citations Mainly Engineering and Physical Sciences BOOKS 421 book series - 28K Volumes - 925K items 65,000 books - 311K items Books expansion: 75K books by 2015 - Focus on Social Sciences and A&H Titles from 105 different countries in all different regions 40 local languages covered
6 Elsevier Data Full-text journal articles platform, used by more than 12,000 institutes worldwide. More than 11 million active users and over 700 million full-text article downloads yearly Usage data: download, engagement, data use, OA data,...
7 UK academic and corporate users are using each other s articles Figure 7.9 Share of downloads of articles with at least one corporate author by downloading sector, 2003-07 and 2008-12. Figure 7.10 Share of article downloads by corporate sector, 2003-07 and 2008-12.
8 8 Elsevier Data next challenge Beyond article citations Beyond the Impact Factor
Open Access Open Data Connecting to Society Collaboration Peer Review Metrics 9 9 Altmetric
Elsevier Data next challenge 10 10
11 A high and rising proportion of UK journal articles are cited in patents Figure 7.8 Relative share of 2007-11 patent citations to articles published 2007-11 for the UK and comparators. Each data point corresponds to the share of each country s total journal article output that year that were cited in patents in the period 2007-11, divided by the share of global journal article output that year that were cited in patents in the same period to give a global baseline defined at 1.0.
External Data consistency? 12
13 External Data coverage? Open Government?
Occurrence frequency in set A 14 Comparing concept trends in grant awards versus publications What areas of research are being funded? Set A : Scopus documents related to BNR Set B : Funded Grant Awards on BNR from NIH RePORTER Set C : Funded Grant Awards on BNR from European Commission Rank each concept by the normalized frequency and relevance in each document set. Compare the rankings in different sets. Prominent in research, but not in grants Low occurrence in both High overlap, consistency between funded and published research Prominent in grants, but not (yet) reflected in publications Occurrence frequency in set B
15 Disease-related concepts and translational research appear more frequently in NIH grants Top concepts that ranked highly in Set B but not in Set A Table 3.7 Top 20 concepts that are dominant in NIH grant descriptions related to brain and neuroscience but not in brain and neuroscience research articles from Scopus, with 1 being the highest ranking.
16 Strong alignment on disorders research Table 3.10 Top 10 concepts that occurred in brain and neuroscience research articles relating to disorders from document sets A, B, and C, based on the sum of term frequency-inverse document frequency (tf-idf) of the concept in the document set that it belonged to. Figures in parentheses are the frequency with which the concept occurred in the document set. Highlighted in violet are concepts that appeared in the top 10 disorder-related concepts in all three document sets, reflecting common areas of focus. Highlighted in magenta are concepts that only appeared in Set A and Set B. Concepts that are not highlighted were those unique to each document set, indicating different areas of focus in disorder-related concepts in brain and neuroscience research.
17 Decisions by triangulating information Reliable Data DECISION Strategy Policy Expert opinion Methodology
18 Experts & Expertise Internal expertise Analytics team 7 staff SciVal 14 staff Scopus 30+ staff...
19 External Experts & Expertise to define (research) areas A. Fingerprint recent publications from these journals and derive concepts Starting from the Scopus journal classification area Neuroscience, extract the concepts from these publications
20 Select relevant and specific BNR (brain and neurocience research) concepts 1200+ concepts 2400+ descendants From all concepts, only those that are relevant and specific enough to the field are included. Feedback from experts were incorporated: European Commission FENS HBP NIH/NIMH RIKEN BSI
21 Use selected concepts as filter on entire Scopus database to generate publication set for analysis Scopus (55 million articles) Selected concepts 1.79 Million articles (2009-2013) included for analysis (~16% of world output) Figure 1.3 Word cloud of concepts from the selected document set where the selection rate was 100%, meaning that no relevant documents that contained these concepts were excluded. Size of each concept is weighted by the number of occurrences in the selected document set. Source: Scopus.
22 External Experts & Expertise to formulate policy relevant questions What is the quantity and impact of this domain s research? Who collaborates the most, and how impactful are those collaborations? What are the disciplinary histories of neuroscience researchers? What are the most popular and emerging research topics? What areas of research are being funded? Mapping the landscape of brain and neuroscience research The countries that the report will focus on are the EU41, Canada, China, Japan and the USA
23
24
25
External Experts & Expertise to interpret the results 26
27 Decisions by triangulating information Reliable Data DECISION Strategy Policy Expert opinion Methodology
Methodology OUTFLOW Researchers who left the area of brain and neuroscience research (BNR) always too many ideas and options INFLOW Researchers who came into the area of BNR What are the disciplinary histories of neuroscience researchers? Researchers who published only in the area of BNR SINGLE DISCIPLINE 28 MULTIDISCIPLINARY Researchers who spent fewer than two years in the area of BNR at any given time
29 Methodology always too many ideas and options Is an article interdisciplinary? Are the references of the article "far away" from each other in terms of subjects? Are the journals the references published in far away from each other? HEFCE Higher Educaction Funding Council for England http://www.hefce.ac.uk/news/newsarchive/2015/name,104889,en.html
30 What are the new things are we working on? Interdisciplinarity report for HEFCE World of Research 2015 Patent data Funding data Altmetrics Newsflo End of year Snapshot vs. Dynamics Smart publishing data Sustainability report for United Nations and Elsevier Green team Collaboration JRC-CERN Collaboration Spotting Tool Gender for Germany @ Gender Summit Berlin
31 The Brain Science report can be downloaded at: http://www.elsevier.com/solutions/analytical-services Thank you! www.elsevier.com/research-intelligence