Skip to main content

Research Repository

Advanced Search

The effect of word sense disambiguation accuracy on literature based discovery

Preiss, J; Stevenson, M

The effect of word sense disambiguation accuracy on literature based discovery Thumbnail


Authors

J Preiss

M Stevenson



Abstract

Background
The volume of research published in the biomedical domain has increasingly lead to researchers focussing on specific areas of interest and connections between findings being missed. Literature based discovery (LBD) attempts to address this problem by searching for previously unnoticed connections between published information (also known as “hidden knowledge”). A common approach is to identify hidden knowledge via shared linking terms. However, biomedical documents are highly ambiguous which can lead LBD systems to over generate hidden knowledge by hypothesising connections through different meanings of linking terms. Word Sense Disambiguation (WSD) aims to resolve ambiguities in text by identifying the meaning of ambiguous terms. This study explores the effect of WSD accuracy on LBD performance.

Methods
An existing LBD system is employed and four approaches to WSD of biomedical documents integrated with it. The accuracy of each WSD approach is determined by comparing its output against a standard benchmark. Evaluation of the LBD output is carried out using timeslicing approach, where hidden knowledge is generated from articles published prior to a certain cutoff date and a gold standard extracted from publications after the cutoff date.

Results
WSD accuracy varies depending on the approach used. The connection between the performance of the LBD and WSD systems are analysed to reveal a correlation between WSD accuracy and LBD performance.

Conclusion
This study reveals that LBD performance is sensitive to WSD accuracy. It is therefore concluded that WSD has the potential to improve the output of LBD systems by reducing the amount of spurious hidden knowledge that is generated. It is also suggested that further improvements in WSD accuracy have the potential to improve LBD accuracy.

Citation

Preiss, J., & Stevenson, M. (2016). The effect of word sense disambiguation accuracy on literature based discovery. BMC Medical Informatics and Decision Making, 16(Sup. 1), 57. https://doi.org/10.1186/s12911-016-0296-1

Journal Article Type Article
Publication Date Jul 18, 2016
Deposit Date Nov 11, 2020
Publicly Available Date Nov 11, 2020
Journal BMC Medical Informatics and Decision Making
Publisher Springer Verlag
Volume 16
Issue Sup. 1
Pages 57
DOI https://doi.org/10.1186/s12911-016-0296-1
Publisher URL https://doi.org/10.1186/s12911-016-0296-1
Related Public URLs https://bmcmedinformdecismak.biomedcentral.com/
Additional Information Funders : Engineering and Physical Sciences Research Council (EPSRC)
Grant Number: EP/J008427/1

Files





Downloadable Citations