What are knowledge graphs and how can they support under-resourced languages?

Sep 6, 2024·
Elwin Huaman
Elwin Huaman
· 0 min read
Image credit: Elwin Huaman
Abstract
While Knowledge Graphs (KGs) have revolutionized applications for high-resource languages like English and Spanish, the majority of the world’s 7,000 languages remain digitally marginalized. This paper addresses this disparity by presenting a robust framework for building KGs specifically designed for under-resourced languages. Utilizing a task-based methodology, we detail a comprehensive lifecycle covering the creation, hosting, curation, and deployment phases. Our results demonstrate that this structured approach not only enhances the digital presence of minority languages but also provides a scalable architecture for representing multilingual data, text, and media. We conclude with actionable best practices to guide future development in indigenous and under-resourced language technologies.
Type
Publication
arXiv
publication
Elwin Huaman
Authors
Research Scientist
Elwin Huaman is a Researcher, Project Manager, and an Activist for Under-Resourced Languages. He has experience creating Knowledge Graphs and applying Semantic Web technologies in academia and industry. He has co-authored the book: Knowledge Graphs - Methodology, Tools and Selected Use Cases, and leads the QICHWABASE Knowledge Graph that supports a harmonization process of the language and knowledge of Quechua communities across the world.