This project is concerned with a description of the Khmer script. It consists of a character specification and the character encoding, in particular the structure of the Khmer character encoding. The main components of this project are:
- Character Encoding Specification
- [Khmer Encoding Structure](encoding_structure/Khmer Encoding Structure.pdf)
- Supporting Software
- khnormal a Python program for normalising and testing strings according to the Khmer Encoding Structure
- khmer-normalizer a Javascript implementation of khnormal (external project)
This document seeks to describe how the Khmer script is used in the following languages:
- Khmer language,
- ethnic minority languages,
- Pali and
- Sanskrit.
There is always a gap between the developer communities and the linguistic communities when it comes to Khmer character specification or usages. The aim is to fill in the gap when it comes to the use of Khmer characters.
You are welcomed to contribute to this project so long as you abide by the license below. You can either create issues for us to fix or a PR for us to review.
In order to properly view ths document, it is recommended to use "Khmer Mondulkiri" or "Khmer OS" font family. Chrome 81.0.4044.129 does not render the below subscript followed by the first vowel correctly as opposed to Firefox 76.0 (i.e. try this ស្មាធិ៍).
Khmer Character Specification/Usages by SIL is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
- Markdown in Github does not render inline style correctly, i.e.
- text highlight
- font color
- text indentation in reference section
To help with this rendering issue, a PDF version - size A5 of this document was generated from an exported HTML using MarkdownViewer++ and Microsoft Print to PDF on Firefox 76.0. Khmer Mondulkiri font is used.