Bi-directional (BiDi) layout implementation in pure python
The algorithm starts with a single entry point bidi.algorithm.get_display.
Required arguments:
unicode_or_str
: The original unicode or string (i.e.: storage). If it's a string use the optional argumentencoding
to specify it's encoding.
Optional arguments:
encoding
: If unicode_or_str is a string, specifies the encoding. The algorithm uses unicodedata which requires unicode. This encoding will be used to decode and encode back to string before returning (default: "utf-8").upper_is_rtl
: True to treat upper case chars as strong 'R' for debugging (default: False).base_dir
: 'L' or 'R', override the calculated base_level.debug
: True to display (using sys.stderr) the steps taken with the algorithm (default: False).
Returns the display layout, either as unicode or encoding
encoded string
(depending on the type of unicode_or_str'
).
Example:
>>> from bidi.algorithm import get_display >>> get_display(u'car is THE CAR in arabic', upper_is_rtl=True) u'car is RAC EHT in arabic'
pybidi
is a command line utility (calling bidi.main
) for running the
bidi algorithm. the script can get a string as a parameter or read text from
stdin. Usage:
$ pybidi -h Usage: pybidi [options] Options: -h, --help show this help message and exit -e ENCODING, --encoding=ENCODING Text encoding (default: utf-8) -u, --upper-is-rtl treat upper case chars as strong 'R' for debugging (default: False). -d, --debug Output to stderr steps taken with the algorithm -b BASE_DIR, --base-dir=BASE_DIR Override base direction [L|R]
Examples:
$ pybidi -u 'car is THE CAR in arabic' car is RAC EHT in arabic $ cat ~/Documents/example.txt | pybidi ...
See docs/INSTALL.rst
To run the tests:
python setup.py test
Some explicit tests are failing right now (see TODO)