• Stars
    star
    208
  • Rank 189,015 (Top 4 %)
  • Language
    Common Lisp
  • License
    Other
  • Created almost 12 years ago
  • Updated over 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Common Lisp NLP toolset

Build Status Documentation Status

CL-NLP -- a Lisp NLP toolkit

Brief description

Eventually, CL-NLP will provide a comprehensive and extensible set of tools to solve natural language processing problems in Common Lisp.

The goals of the project include the following:

  • support for constructing arbitrary NLP pipelines on top of it
  • support for easy and fast experimentation and development of new models and approaches
  • serve as a good framework for teaching NLP concepts

It comprises of a number of utility/horizontal and end-user/vertical modules that implement the basic functions and provide a way to add own extensions and models.

The utility layer includes:

  • tools for transforming raw natural language text, as well as various corpora into a form suitable for further processing
  • basic support for language modelling
  • support for a number of linguistic concepts
  • support for working with machine learning models and a number of training algorithms

The end-user layer will provide:

  • POS taggers
  • constituency parsers
  • dependency parsers
  • other stuff (will be added step-by-step, suggestions are welcome)

How to start working with CL-NLP

The project has already reached a stage of usefulness for the primary author: for instance, it supports my current language modelling experiments by providing easy access to treebanks and other utilities.

Yet, it is far from being production-ready. So, if you want to use it for production tasks, expect to bleed on the bleeding edge.

Otherwise, if you want to contribute to developing the toolkit, you're very welcome. Here are a few write-ups to give you the sense of the project and to help get started:

You'll also, probably, need to track the latest version of RUTILS from git.

For CL-NLP to reach v.0.1 that may be considered suitable for limited use by non-contributors, the following things should be finished (work-in-progress):

  • implement a comprehensive test-suite and fix all bugs encountered in the process
  • describe available models and their quality metrics

Technical notes

Dependencies

For development:

License

The license of CL-NLP is Apache 2.0.

Specific models may have different license due to the limitations of the dataset they are built with. Please see a <model>.license file accompanying each model for details.

(c) 2013-2014, Vsevolod Dyomkin [email protected]

More Repositories

1

rutils

Radical Utilities for Common Lisp
Common Lisp
234
star
2

cl-redis

Redis client for Common Lisp
Common Lisp
180
star
3

kpi-spos

Учебный курс "Операционные системы"
Common Lisp
75
star
4

progalgs-code

Code for the book "Programming Algorithms"
Common Lisp
48
star
5

should-test

A minimal yet feature-rich Common Lisp test framework
Common Lisp
42
star
6

wiki-lang-detect

Text language identification using Wikipedia data
Common Lisp
29
star
7

crawlik

Lisp web crawler and scrapper
Common Lisp
25
star
8

cl-agraph

CL-AGRAPH — A minimal portable Lisp client for AllegroGraph
Common Lisp
24
star
9

cl-parsec

Parser combinators in Common Lisp
Common Lisp
21
star
10

prj-algo

Курс по алгоритмике в Projector
C#
21
star
11

prj-nlp-2020

Home of Projector's "Data Science. Natural Language Processing" 2020 Edition
Jupyter Notebook
18
star
12

lisp-crash-ru

Креш-курс по Lisp
Common Lisp
16
star
13

prj-algo2

Java
14
star
14

flight-recorder

Robust REPL logging facility
Common Lisp
13
star
15

const-table

Minimal Perfect Hash-Table CL Implementation
Common Lisp
12
star
16

cl-who

Forked version of Edi Weitz's HTML generator with support for macro definition
Common Lisp
10
star
17

prj-algo3

Common Lisp
9
star
18

nuts

Non-Unit Test Suite -- a test library for Common Lisp
Common Lisp
8
star
19

plain-text

A tool to extract plain text from HTML pages
Common Lisp
8
star
20

date-utils

Date utilities for Common Lisp, that use an internal plist representation of dates
Common Lisp
5
star
21

cl-git-store

Use git as storage from CL
Common Lisp
4
star
22

arnesi

Fork of Marco Barringer's arnesi (http://common-lisp.net/project/bese/arnesi.html)
Common Lisp
4
star
23

lang-uk

Ukrainian language processing using CL-NLP
Common Lisp
4
star
24

asdfx

Some potential ASDF extensions
Common Lisp
2
star
25

sc-dsalg

Data Structures & Algorithms Course @ SoftConstruct
Common Lisp
2
star
26

biochat2

Matching bioinformatics dataset for fun and profit
Common Lisp
2
star
27

vseloved.github.io

My personal website
HTML
1
star
28

asdf2-versions

Broader support for versions in ASDF2
Common Lisp
1
star
29

scripts

Misc shell & other scripts
Emacs Lisp
1
star