A Bibliometric Analysis of Techniques for Word Sense Disambiguation in Morphologically Rich Languages

  • Hlaudi D. Masethe*
  • , Mosima A. Masethe
  • , Sunday O. Ojo
  • , Pius A. Owolawi
  • , Fausto Giunchiglia
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Word Sense Disambiguation (WSD) continues to provide considerable difficulty in Natural Language Processing, especially for morphologically rich languages (MRLs), where intricate word forms and inflections exacerbate lexical ambiguity. The absence of extensive linguistic resources and cross-lingual tools for these languages exacerbates the challenges in developing efficient WSD systems. This study does a bibliometric analysis of significant publications and co-citation networks in the domain of WSD, concentrating on MRLs, to investigate how the scientific community has tackled this problem. The research used VOSviewer to illustrate the intellectual framework of the topic by mapping prominent authors, citation trends, and thematic clusters derived from data obtained from major academic databases. The research designates Roberto Navigli as the preeminent academic, with 530 citations and a total link strength of 4914, mostly due to his contributions to BabelNet and graph-based disambiguation techniques. Additional prominent contributions are Eduardo Agirre (semantic similarity), Alessandro Raganato (WSD assessment frameworks), and Martha Palmer (VerbNet and PropBank). Other prominent individuals, like Hwee Tou Ng, Christiane Fellbaum, Rada Mihalcea, and Ted Pedersen, are acknowledged for their contributions to the development of symbolic, statistical, and hybrid word sense disambiguation approaches. The study identifies a cohort of under-cited but significant researchers, such as Pushpak Bhattacharyya, David Yarowsky, and Alexander Gelbukh, whose work underscores the disjointed character of cross-linguistic research in low-resource contexts. The co-citation analysis indicates a robust research foundation focused on common tools and frameworks, while highlighting the essential need for enhanced international cooperation to broaden WSD solutions for under-represented morphologically rich languages.

Original languageEnglish
Title of host publication2025 Annual IEEE Conference on Information Communication Technology and Society, ICTAS 2025 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331531553
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event2025 Annual IEEE Conference on Information Communication Technology and Society, ICTAS 2025 - Durban, South Africa
Duration: 23 Jul 202525 Jul 2025

Publication series

Name2025 Annual IEEE Conference on Information Communication Technology and Society, ICTAS 2025 - Proceedings

Conference

Conference2025 Annual IEEE Conference on Information Communication Technology and Society, ICTAS 2025
Country/TerritorySouth Africa
CityDurban
Period23/Jul/2525/Jul/25

Keywords

  • Bibliometric Analysis
  • Morphologically Rich Languages (MRLs)
  • Natural Language Processing (NLP)
  • VOSviewer
  • Word Sense Disambiguation (WSD)

Fingerprint

Dive into the research topics of 'A Bibliometric Analysis of Techniques for Word Sense Disambiguation in Morphologically Rich Languages'. Together they form a unique fingerprint.

Cite this