• Stars
    star
    18
  • Rank 1,208,065 (Top 24 %)
  • Language
    Python
  • License
    Other
  • Created about 4 years ago
  • Updated 2 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Python code for training an LSTM model for word segmentation in Thai, Burmese, and similar languages.

More Repositories

1

icu

The home of the ICU project source code.
C++
2,111
star
2

icu4x

Solving i18n for client-side and resource-constrained environments.
Rust
1,078
star
3

cldr

The home of the Unicode Common Locale Data Repository
Java
828
star
4

last-resort-font

Last Resort Font
673
star
5

cldr-json

JSON Data from the Unicode CLDR Project
Shell
447
star
6

message-format-wg

Developing a standard for localizable message strings
231
star
7

text-rendering-tests

Unicode’s test suite for text rendering engines
HTML
166
star
8

unilex

Lexical data at Unicode
Clojure
63
star
9

unicodetools

home of unicodetools and https://util.unicode.org JSPs
HTML
51
star
10

icu-data

ICU Data Repository
Java
32
star
11

icu-docs

Docs (API, Userguide) for ICU
HTML
25
star
12

cldr-staging

Proposed production data for CLDR data
HTML
25
star
13

cjk-symbols

CJK Symbols
PostScript
22
star
14

icu-demos

sample apps for ICU (formerly icuapps)
Java
20
star
15

unihan-database

For review of draft Unihan database changes, removals, and additions by experts.
18
star
16

uk-source-ideographs

UK-Source Ideographs
11
star
17

jira-github-pr-check

Checks GitHub pull requests for valid and accepted Jira tickets. Used for ICU and CLDR
JavaScript
11
star
18

cldr-implementers-guide

Implementer's Guide for CLDR
9
star
19

uli-docs

ULI has been Archived, see https://unicode.org/uli
8
star
20

ml-confusables-generator

Generates confusables for Han script using ML techniques
Jupyter Notebook
8
star
21

rust-discuss

OmnICU-SC: For discussion of i18n in Rust.
7
star
22

unicode-org.github.io

top level index.html for https://unicode-org.github.io/
HTML
7
star
23

icu-docker

Dockerfiles for ICU development
Shell
6
star
24

icu4jni

New home of the (archived) ICU4JNI project.
Java
5
star
25

uli

ULI has been Archived, see https://unicode.org/uli
Python
4
star
26

icu-jira-safari

Note: GitHub provides this directly now.
JavaScript
4
star
27

icu-trac-tools

ICU’s trac plugins
Python
3
star
28

icu4x-docs

ICU4X Docs
HTML
3
star
29

conformance

Unicode & CLDR Data Driven Testing
Python
3
star
30

icu-trac2jira

ICU and CLDR’s Trac to JIRA conversion tool. Archived, not under active maintenance.
JavaScript
3
star
31

cldr-apps-webdriver

CLDR Survey Tool WebDriver Test Framework
Java
2
star
32

icu-remunge-svndump

munger for svndump — to be used for pre-combining ICU svn trees
Perl
2
star
33

icu-perf

ICU performance test results. Maintained by ICU-TC
JavaScript
2
star
34

template-repo

Template Repository for Unicode projects
1
star