unicode
: functions module for shellfire
This module provides a simple framework for encoding unicode
with a shellfire application. An example user is the jsonreader module, which uses it to convert unicode code points to UTF-8 strings.
- Tag
release_2015.0117.1750-1
is compatible with shellfire releaserelease_2015.0117.1750-1
.
To encode the ellipsis … code point as UTF-8 (0xE280A6), use the following code:-
# U+2026
unicode_utf8_encodeCodePoint 8230
# Or, if using hexadecimal
unicode_utf8_encodeCodePoint 0x2026
To import this module, add a git submodule to your repository. From the root of your git repository in the terminal, type:-
mkdir -p lib/shellfire
cd lib/shellfire
git submodule add "https://github.com/shellfire-dev/unicode.git"
cd -
git submodule init --update
You may need to change the url https://github.com/shellfire-dev/unicode.git
above if using a fork.
You will also need to add paths - include the module paths.d.
This currently exposes no functions.
This namespace exposes functions to convert a code point to a UTF-8 sequence.
If calling from another shellfire module, add to your shell code the line
core_usesIn unicode utf8
in the global scope (ie outside of any functions). A good convention is to put it above any function that depends on functions in this module. If using it directly in a program, put this line inside the _program()
function:-
_program()
{
core_usesIn unicode utf8
…
}
Parameter | Value | Optional |
---|---|---|
codePoint |
A decimal, octal (preceeded by 0 ) or hexadecimal value (preceeded by 0x ). |
No |
Writes the UTF-8 sequence of bytes for the code point to standard out. Exits with an error for an incorrect value.
This namespace exposes functions to convert a code point to a UTF-8 sequence.
If calling from another shellfire module, add to your shell code the line
core_usesIn unicode utf8
in the global scope (ie outside of any functions). A good convention is to put it above any function that depends on functions in this module. If using it directly in a program, put this line inside the _program()
function:-
_program()
{
core_usesIn unicode utf8
…
}
Parameter | Value | Optional |
---|---|---|
codePoint |
A decimal, octal (preceeded by 0 ) or hexadecimal value (preceeded by 0x ). |
No |
Writes the UTF-16 sequence of bytes for the code point to standard out. Exits with an error for an incorrect value. Writes correct surrogate pairs.