Standardising Pronunciation for a Grapheme-to-Phoneme Converter for Faroese

Sandra Saxov Lamhauge, Iben Nyholm Debess, Carlos Daniel Hernández Mena, Annika Simonsen, Jón Gudnason

Research output: Contribution to conferencePaperpeer-review

5 Downloads (Pure)


Pronunciation dictionaries allow computational modelling of the pronunciation
of words in a certain language and are widely used in speech technologies, especially in the fields of speech recognition and synthesis. On the other hand, a grapheme-to-phoneme tool is a generalization of a pronunciation dictionary that is not limited to a given and finite vocabulary. In this paper, we present a set of standardized phonological rules for the Faroese language; we introduce FARSAMPA, a machine-readable character set suitable for phonetic transcription of Faroese, and we present a set of grapheme-to-phoneme models for Faroese, which are publicly available and shared under a creative commons license. We
present the G2P converter and evaluate the performance. The evaluation shows reliable results that demonstrate the quality of the data.
Original languageEnglish
Number of pages10
Publication statusPublished - May 2023
EventNodalida 2023: Nordic Conference on Computational Linguistics - Tórshavn, Faroe Islands
Duration: 22 May 202324 May 2023
Conference number: 24


ConferenceNodalida 2023
Abbreviated titleNodalida
Country/TerritoryFaroe Islands
Internet address


  • pronunciation dictionaries
  • computational modeling
  • speech technologies
  • grapheme-to-phoneme tool
  • machine-readable character set
  • Faroese language


Dive into the research topics of 'Standardising Pronunciation for a Grapheme-to-Phoneme Converter for Faroese'. Together they form a unique fingerprint.

Cite this