Vee Satayamas (@veer66)

Top repositories

1

wordcut

Thai word breaker for Node.js
JavaScript
139
star
2

PhlongTaIam

PHP Thai word breaker
PHP
34
star
3

chamkho

Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line
Rust
34
star
4

Yaitron

Yaitron English-Thai and Thai-English dictionary
Python
28
star
5

mapkha

Thai word segmentation program in Go
Go
27
star
6

thailang4r

Thai language utility for Ruby
Ruby
25
star
7

wordcutpy

A simple word breaker written in Python
Python
18
star
8

wordcut-engine

Word segmentation library in Rust
Rust
9
star
9

thaiwordseg

Thai word segmenter written in C
C
8
star
10

pdf2txt_th

Thai pdf to text script
Ruby
7
star
11

bhasati

Bhasati is a Mastodon client for desktop written in Ruby using GTK+
Ruby
5
star
12

corenlptut

Python
4
star
13

wordcut-clj

A word segmentation tool for ASEAN languages written in Clojure
Clojure
4
star
14

cl-wordcut

Word segmentation tools for ASEAN languages written in Common Lisp
Common Lisp
3
star
15

chamkho-pg

Rust
3
star
16

word-freq

Word frequency counter written in Rust
Rust
3
star
17

lao-dictionary

Automatically exported from code.google.com/p/lao-dictionary
3
star
18

cl-rocksdb

RocksDB binding for Common Lisp
Common Lisp
3
star
19

simple-compojure-api-buddy-example

Simple compojure-api + buddy example
Clojure
2
star
20

thai-wordnet-db

Awk
2
star
21

learn_awk

2
star
22

admichat

A simple web chat for talking web admin
Rust
2
star
23

thaidix

Free English-Thai dictionary for machine translation
Ruby
2
star
24

vrocket

A hello world example of Rocket.rs with run.sh that auto reload the server
Rust
2
star
25

khatson

Attacut port to Rust
Rust
2
star
26

wordcut-json-rpc-server

Wordcut JSON-RPC server
Ruby
2
star
27

tha-eng-wn

tha-eng-wn is Thai-English bidix generator from Wordnet
Ruby
2
star
28

switch

NodeMCU based switch controller server
JavaScript
2
star
29

utf8-input-stream

A UTF-8 string input stream over a binary stream for Common Lisp
Common Lisp
2
star
30

prolog-sheet

Prolog
1
star
31

evbcorpus

Automatically exported from code.google.com/p/evbcorpus
1
star
32

wordcut-server.js

Wordcut server
JavaScript
1
star
33

stream-par-procs

Stream parallel processors for Common Lisp
Common Lisp
1
star
34

disp_amphi

JavaScript
1
star
35

mgawika

mgawika is a PostgreSQL extension that enables full-text searching on almost every known human language.
Rust
1
star
36

asean-word-freq

ASEAN word occurrence counter from HTML files
Ruby
1
star
37

prefixtree

prefixtree is a simple prefix tree based HashMap
Rust
1
star
38

rum3

rum3 is an example figwheel + rum + ring + bidi usages.
Clojure
1
star
39

mapkha-cli

mapkha-cli is a command line tool for Mapkha - Thai word segmentation (wordcut; word boundary identification; ตัดคำ) program in Go (golang)
Go
1
star
40

lemma_srv

Python
1
star
41

libre-thai-chat-logs

1
star
42

parallel_corpus_tool

A tool for loading parallel corpus
Rust
1
star
43

moses-smt-docker

Dockerfile
1
star
44

thai_romanize

Ruby
1
star
45

thai-pos

Thai word breaker and part-of-speech tagger
Clojure
1
star
46

wordcut-guile

Word segmentaton tool written in Scheme (GNU Guile)
E
1
star
47

wordcut.rb

ASEAN word tokenizer written in Ruby
Ruby
1
star
48

wordcut-x

A word segmentation tool for ASEAN languages written in Java
Java
1
star
49

wordlist-collector

Shell
1
star
50

reinarb

A toolset for Apertium written in Ruby
Ruby
1
star
51

wordcutw

A C-interface wrapper for Wordcut - a Lao/Thai word segmentation/breaking library
Rust
1
star
52

entity-gen

A Emacs Lisp script for generating simple JPA entity/class code from a PostgreSQL table
Emacs Lisp
1
star