pythainlp.transliterate

The pythainlp.transliterate turns Thai text into a romanized one (put simply, spelled with English).

Modules

pythainlp.transliterate.romanize(text: str, engine: str = 'royin')str[source]

This function renders Thai words in the Latin alphabet or “romanization”, using the Royal Thai General System of Transcription (RTGS) 1. RTGS is the official system published by the Royal Institute of Thailand. (Thai: ถอดเสียงภาษาไทยเป็นอักษรละติน)

Parameters
  • text (str) – Thai text to be romanized

  • engine (str) – ‘royin’ (default) or ‘thai2rom’.

Returns

A string of Thai words rendered in the Latin alphabet.

Return type

str

Options for engines
  • royin - based on the Royal Thai General System of Transcription issued by Royal Institute of Thailand.

  • thai2rom - a deep learning-based Thai romanization engine (require PyTorch).

Example

from pythainlp.transliterate import romanize

romanize("สามารถ", engine="royin")
# output: 'samant'

romanize("สามารถ", engine="thai2rom")
# output: 'samat'

romanize("ภาพยนตร์", engine="royin")
# output: 'phapn'

romanize("ภาพยนตร์", engine="thai2rom")
# output: 'phapphayon'
pythainlp.transliterate.transliterate(text: str, engine: str = 'ipa')str[source]

This function transliterates Thai text.

Parameters
  • text (str) – Thai text to be transliterated

  • engine (str) – ‘ipa’ (International Phonetic Alphabet; default) or ‘icu’.

Returns

A string of Internaitonal Phonetic Alphabets indicating how the text should be pronounced.

Return type

str

Options for engines
  • ipa - (default) International Phonetic Alphabet (IPA)

  • icu - International Components for Unicode (ICU)

Example

from pythainlp.transliterate import transliterate

transliterate("สามารถ", engine="ipa")
# output: 'saːmaːrot'

transliterate("สามารถ", engine="icu")
# output: 's̄āmārt̄h'

transliterate("ภาพยนตร์", engine="ipa")
# output: 'pʰaːpjanot'

transliterate("ภาพยนตร์", engine="icu")
# output: 'p̣hāphyntr̒'

References

1

Nitaya Kanchanawan. (2006). Romanization, Transliteration, and Transcription for the Globalization of the Thai Language. The Journal of the Royal Institute of Thailand.