CiteGraph: a citation network system for MEDLINE articles and analysis. 2013

Qing Zhang, and Hong Yu
Department of Electrical Engineering and Computer Science, University of Wisconsin-Milwaukee, Milwaukee, WI, USA.

This paper details the development and implementation of CiteGraph, a system for constructing large-scale citation and co-authorship networks from full-text biomedical articles. CiteGraph represents articles and authors by uniquely identified nodes, and connects those nodes through citation and co-authorship relations. CiteGraph network encompasses over 1.65 million full-text articles and 6.35 million citations by 1.37 million unique authors from the Elsevier full-text articles. Our evaluation shows 98% 99% F1-score for mapping a citation to the corresponding article and identifying MEDLINE articles. We further analyzed the characteristics of CiteGraph and found that they are consistent with assumptions made using small-scale bibliometric analysis. We also developed several novel network-based methods for analyzing publication, citation and collaboration patterns. This is the first work to develop a completely automated system for the creation of a large-scale citation network in the biomedical domain, and also to introduce novel findings in researcher publication histories. CiteGraph can be a useful resource to both the biomedical community, and bibliometric research.

UI MeSH Term Description Entries
D008498 Medical Record Linkage The creation and maintenance of medical and vital records in multiple institutions in a manner that will facilitate the combined use of the records of identified individuals. Record Linkage, Medical,Linkage, Medical Record,Linkages, Medical Record,Medical Record Linkages,Record Linkages, Medical
D009323 Natural Language Processing Computer processing of a language with rules that reflect and describe current usage rather than prescribed usage. Language Processing, Natural,Language Processings, Natural,Natural Language Processings,Processing, Natural Language,Processings, Natural Language
D010506 Periodicals as Topic Works about publications issued at stated, more or less regular, intervals. Journals as Topic,Magazines,Newsletters,Magazine,Newsletter
D003628 Database Management Systems Software designed to store, manipulate, manage, and control data for specific uses. Data Base Management Systems,Management System, Data Base,Management Systems, Data Base,System, Data Base Management,Systems, Data Base Management,Database Management System
D001185 Artificial Intelligence Theory and development of COMPUTER SYSTEMS which perform tasks that normally require human intelligence. Such tasks may include speech recognition, LEARNING; VISUAL PERCEPTION; MATHEMATICAL COMPUTING; reasoning, PROBLEM SOLVING, DECISION-MAKING, and translation of language. AI (Artificial Intelligence),Computer Reasoning,Computer Vision Systems,Knowledge Acquisition (Computer),Knowledge Representation (Computer),Machine Intelligence,Computational Intelligence,Acquisition, Knowledge (Computer),Computer Vision System,Intelligence, Artificial,Intelligence, Computational,Intelligence, Machine,Knowledge Representations (Computer),Reasoning, Computer,Representation, Knowledge (Computer),System, Computer Vision,Systems, Computer Vision,Vision System, Computer,Vision Systems, Computer
D015706 Bibliometrics The use of statistical methods in the analysis of a body of literature to reveal the historical development of subject fields and patterns of authorship, publication, and use. Formerly called statistical bibliography. (from The ALA Glossary of Library and Information Science, 1983) Bibliography, Statistical,Analysis, Bibliometric,Bibliographies, Statistical,Bibliometric Analysis,Statistical Bibliographies,Statistical Bibliography,Analyses, Bibliometric,Bibliometric Analyses
D016239 MEDLINE The premier bibliographic database of the NATIONAL LIBRARY OF MEDICINE. MEDLINE® (MEDLARS Online) is the primary subset of PUBMED and can be searched on NLM's Web site in PubMed or the NLM Gateway. MEDLINE references are indexed with MEDICAL SUBJECT HEADINGS (MeSH). Index Medicus
D018875 Vocabulary, Controlled A specified list of terms with a fixed and unalterable meaning, and from which a selection is made when CATALOGING; ABSTRACTING AND INDEXING; or searching BOOKS; JOURNALS AS TOPIC; and other documents. The control is intended to avoid the scattering of related subjects under different headings (SUBJECT HEADINGS). The list may be altered or extended only by the publisher or issuing agency. (From Harrod's Librarians' Glossary, 7th ed, p163) Controlled Vocabulary,Thesaurus,Controlled Thesauri,Controlled Thesaurus,Thesauri,Controlled Vocabularies,Thesauri, Controlled,Thesaurus, Controlled,Vocabularies, Controlled

Related Publications

Qing Zhang, and Hong Yu
September 2015, Journal of reconstructive microsurgery,
Qing Zhang, and Hong Yu
January 2016, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science,
Qing Zhang, and Hong Yu
January 2015, Plastic and reconstructive surgery. Global open,
Qing Zhang, and Hong Yu
January 2005, AMIA ... Annual Symposium proceedings. AMIA Symposium,
Qing Zhang, and Hong Yu
April 2015, BMC bioinformatics,
Qing Zhang, and Hong Yu
January 2019, AMIA ... Annual Symposium proceedings. AMIA Symposium,
Qing Zhang, and Hong Yu
June 2018, The journal of evidence-based dental practice,
Qing Zhang, and Hong Yu
January 2013, Respirology (Carlton, Vic.),
Qing Zhang, and Hong Yu
February 2009, The Annals of thoracic surgery,
Copied contents to your clipboard!