• Stars
    star
    1
  • Language
    Ruby
  • Created about 6 years ago
  • Updated about 6 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

ASEAN word occurrence counter from HTML files

More Repositories

1

wordcut

Thai word breaker for Node.js
JavaScript
139
star
2

PhlongTaIam

PHP Thai word breaker
PHP
34
star
3

chamkho

Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line
Rust
34
star
4

Yaitron

Yaitron English-Thai and Thai-English dictionary
Python
28
star
5

mapkha

Thai word segmentation program in Go
Go
27
star
6

thailang4r

Thai language utility for Ruby
Ruby
25
star
7

wordcutpy

A simple word breaker written in Python
Python
18
star
8

wordcut-engine

Word segmentation library in Rust
Rust
9
star
9

thaiwordseg

Thai word segmenter written in C
C
8
star
10

pdf2txt_th

Thai pdf to text script
Ruby
7
star
11

bhasati

Bhasati is a Mastodon client for desktop written in Ruby using GTK+
Ruby
5
star
12

corenlptut

Python
4
star
13

wordcut-clj

A word segmentation tool for ASEAN languages written in Clojure
Clojure
4
star
14

cl-wordcut

Word segmentation tools for ASEAN languages written in Common Lisp
Common Lisp
3
star
15

chamkho-pg

Rust
3
star
16

word-freq

Word frequency counter written in Rust
Rust
3
star
17

lao-dictionary

Automatically exported from code.google.com/p/lao-dictionary
3
star
18

cl-rocksdb

RocksDB binding for Common Lisp
Common Lisp
3
star
19

simple-compojure-api-buddy-example

Simple compojure-api + buddy example
Clojure
2
star
20

thai-wordnet-db

Awk
2
star
21

learn_awk

2
star
22

admichat

A simple web chat for talking web admin
Rust
2
star
23

thaidix

Free English-Thai dictionary for machine translation
Ruby
2
star
24

vrocket

A hello world example of Rocket.rs with run.sh that auto reload the server
Rust
2
star
25

khatson

Attacut port to Rust
Rust
2
star
26

wordcut-json-rpc-server

Wordcut JSON-RPC server
Ruby
2
star
27

tha-eng-wn

tha-eng-wn is Thai-English bidix generator from Wordnet
Ruby
2
star
28

switch

NodeMCU based switch controller server
JavaScript
2
star
29

utf8-input-stream

A UTF-8 string input stream over a binary stream for Common Lisp
Common Lisp
2
star
30

prolog-sheet

Prolog
1
star
31

evbcorpus

Automatically exported from code.google.com/p/evbcorpus
1
star
32

wordcut-server.js

Wordcut server
JavaScript
1
star
33

stream-par-procs

Stream parallel processors for Common Lisp
Common Lisp
1
star
34

disp_amphi

JavaScript
1
star
35

mgawika

mgawika is a PostgreSQL extension that enables full-text searching on almost every known human language.
Rust
1
star
36

prefixtree

prefixtree is a simple prefix tree based HashMap
Rust
1
star
37

rum3

rum3 is an example figwheel + rum + ring + bidi usages.
Clojure
1
star
38

mapkha-cli

mapkha-cli is a command line tool for Mapkha - Thai word segmentation (wordcut; word boundary identification; ตัดคำ) program in Go (golang)
Go
1
star
39

lemma_srv

Python
1
star
40

libre-thai-chat-logs

1
star
41

parallel_corpus_tool

A tool for loading parallel corpus
Rust
1
star
42

moses-smt-docker

Dockerfile
1
star
43

thai_romanize

Ruby
1
star
44

thai-pos

Thai word breaker and part-of-speech tagger
Clojure
1
star
45

wordcut-guile

Word segmentaton tool written in Scheme (GNU Guile)
E
1
star
46

wordcut.rb

ASEAN word tokenizer written in Ruby
Ruby
1
star
47

wordcut-x

A word segmentation tool for ASEAN languages written in Java
Java
1
star
48

wordlist-collector

Shell
1
star
49

reinarb

A toolset for Apertium written in Ruby
Ruby
1
star
50

wordcutw

A C-interface wrapper for Wordcut - a Lao/Thai word segmentation/breaking library
Rust
1
star
51

entity-gen

A Emacs Lisp script for generating simple JPA entity/class code from a PostgreSQL table
Emacs Lisp
1
star