• Stars
    star
    122
  • Rank 282,976 (Top 6 %)
  • Language
    HTML
  • License
    GNU Affero Genera...
  • Created almost 6 years ago
  • Updated over 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A text analysis application for performing common NLP tasks through a web dashboard interface and an API

NLPBuddy - Open Source Text Analysis Tool

About the project

NLPBuddy is a text analysis application for performing common NLP tasks through a web dashboard interface and an API.

It leverages Spacy for the NLP tasks plus Gensim's implementation of the TextRank algorithm for text summarization.

It supports texts in the following languages: Greek, English, German, Spanish, Portoguese, French, Italian and Dutch. Language identification is performed automatically through langid

Tasks include:

  1. Text tokenization
  2. Sentence splitting (lemmatized sentences too)
  3. Part of Speech tags identification (verbs, nouns etc)
  4. Named Entity Recognition (Location, Person, Organisation etc)
  5. Text summarization (using TextRank algorithm, implemented by Gensim)
  6. Keywords extraction
  7. Language identification
  8. For the Greek language, Categorization of text

Text can either be provided or imported after specifying a url - we use library python readability for this plus BeautifulSoup4

The Greek classifier is built with FastText and is trained in 20.000 articles labeled in these categories.

Demo

A working demo can be found on http://www.nlpbuddy.io/

Usage

Enter text and hit 'Analyze it',

alt text

API Usage

https://github.com/eellak/text-analysis/wiki/API-usage

Installation

Find development and deployment instructions here: https://github.com/eellak/text-analysis/wiki/Install

License

The code is provided under the GNU AGPL v3.0 License.

More Repositories

1

gsoc2018-spacy

[GSOC] Greek language support for spacy.io python NLP software
Python
95
star
2

glossAPI

Ground work for a Greek Open Source LLM -- Εργασίες θεμελίωσης ενός Ελληνικού LLM Ανοιχτού Κώδικα
Python
79
star
3

fossbot

40
star
4

gsoc2018-3gm

💫 Automated codification of Greek Legislation with NLP
Python
40
star
5

gsoc2021-audio-annotation-tool

Creation of a multi user audio first annotation tool - GSoC 2021
HTML
27
star
6

gsoc2019-greek-morpho

Greek open source Morphological dictionary and application of it to Greek spelling tools
Python
25
star
7

gsoc2019-diyrobot

A DIY robot kit for educators
Jupyter Notebook
21
star
8

gsoc2019-sphinx

Creation of an online Greek mail dictation system, using Sphinx and personalized acoustic/language model training
Python
19
star
9

build-recorder

C
18
star
10

gsoc2019-tms

Web-app of a Thesis management system
TypeScript
17
star
11

panoptis2016

Τεκμηρίωση για τα επεισόδια της Άσκησης Κυβερνοάμυνας Πανόπτης 2016
Shell
17
star
12

gsoc2018-librecust

LibreOffice customization and creation of legal Templates
Python
13
star
13

gsoc17-diavgeia

Diavgeia Redefined: Redefined functionality of Diavgeia using RDF and Blockchain (GSOC 2017 GFOSS Project).
JavaScript
13
star
14

business-plan-tool

Business Plan Tool, Computer Science Department of Thessaloniki
Vue
11
star
15

gsoc2019-text-extraction

GSoC 2019: Development of a Tool for Extracting Quantitative Text Profiles
JavaScript
11
star
16

Greek-Street-Names-Directory

Greek Street Names Directory
10
star
17

Greek-perfectures-municipalities-settlements-name-directory

Λίστα Νομών - Επαρχιών - Οικισμών της Ελλάδας με ιεραρχική κωδικοποίηση & επαγγέλματα
10
star
18

woocommerce-alphabank-payment-gateway

Wordpress Woocommerce Alphabank Greece payment gateway
PHP
10
star
19

scriptum

Εφαρμογή Ηλεκτρονικού Πρωτοκόλλου & Διαχείρισης Εργασιών
Java
9
star
20

gsoc2019-3gm

Greek Government Gazette Text Mining, Cross Linking and Codification
Python
8
star
21

gsoc17module-zeus

Repository of GSOC 2017 GFOSS Project for improving Zeus.
Python
8
star
22

gsoc2018-cantarell

Greek language support to the open source fonts of Cantarell
Python
8
star
23

sisyphos

Ο Σίσυφος είναι ένα ανοιχτού κώδικα, διαδικτυακό πληροφοριακό σύστημα για τα ελληνικά Σχολεία.
JavaScript
8
star
24

clio

Clio, a web-based system for maintaining (meta-)information on software components
Python
7
star
25

gsoc2019-anonymization

Anonymisation of Sensitive Data in Public Documents
Python
7
star
26

gsoc2021-HA-Auto-Node-RED

Python
7
star
27

gsoc2021-sastixcms

Java
7
star
28

gsoc2018-pypen

Penetration Testing library written in Python
Python
7
star
29

gsoc2022--Nodered-backend-

JavaScript
6
star
30

gsoc2018-arimamadurai

Greek glyphs for the open source font Arima Madurai (NDISCOVER)
6
star
31

ansible

Infra tasks for eellak servers
HTML
6
star
32

gsoc2018-GG-extraction

NER & Metadata Extraction of the Greek Government Gazette
Python
6
star
33

transmem

Translation Memory of Greek-English terms
5
star
34

fossbot-platform

TypeScript
5
star
35

greek-commiters

Scripts used to extract developers located in Greece from GITHUB
Shell
4
star
36

OpenLab_Website

Πρότυπο δικτυακού τόπου που μπορεί να προσαρμοστεί στις ανάγκες του φορέα που υιοθετεί OpenLab
PHP
4
star
37

gsoc17-Eczar

Repository of GSOC 2017 GFOSS Project for adding Greek Glyphs in eczar fonts.
Python
4
star
38

gsoc2022-Label-buddy

Python
4
star
39

diadikasies-wiki-to-BPMN

Transform processes/services from diadfikasies.gr (CPSV based or not) to BPMN
JavaScript
4
star
40

gsoc2019-qtcontrols

Port Qt Quick Controls Calendar widget to Qt Quick Controls 2 module
4
star
41

ccradio

source code of ccradio.ellak.gr
HTML
3
star
42

gsoc2019-CScout

CScout improvements
C
3
star
43

gsoc2019-git-issue

Extend git-issue with full import and export capabilities towards GitHub and GitLab
Shell
3
star
44

opengov_diavgeia

Opengov - Διαύγεια
PHP
3
star
45

greek-placenames-directory

Κατάλογος 87490 ελληνικών τοπονυμίων σε μορφή αρχείων shp και kml
3
star
46

mlmmj-archivist

A shell script for creating web archives for mlmmj mailing lists
Shell
3
star
47

opengov

OpenGov - Διαχείριση ηλεκτρονικών προσκλήσεων στελέχωσης μετακλητών θέσεων
PHP
2
star
48

commit-timeline

Create a timeline of commits by Greek committers to public repositories
Shell
2
star
49

rescriptum

Εξελιγμένη έκδοση του scriptum
Java
2
star
50

epinoo-installation-scripts

Epinoo Platform Installation scripts
Puppet
2
star
51

gsoc17-donationbox

Repository of GSOC 2017 GFOSS Project for extending the DonationBox project
PHP
2
star
52

mediawiki-wordpress-sso-extension

WPMW+ - Wordpress + MediaWiki bridge (fork)
PHP
2
star
53

pdf_from_html_demo

Demo εφαρμογής δημιουργίας PDF από φόρμα HTML
PHP
2
star
54

gsoc2022-apothesis

C++
2
star
55

gsoc2019-UMLGraph

UMLGraph - GSoC 2019 Contributions
Java
2
star
56

gimp-el-manual

Mετάφραση του εγχειριδίου GIMP
2
star
57

gsoc2019-ltsp

Designing and implementing the new LTSP
Shell
2
star
58

greek-commiters_wp-plugins

The Wordpress plugins developed to implement the Github Contributor page
PHP
2
star
59

opengov_agora

OpenGov Αγορά
PHP
2
star
60

conference-app

Ionic Conference Application
HTML
2
star
61

anonimos-amka

Αλγόριθμος ανωνυμοποίησης ΑΜΚΑ
Python
2
star
62

countries-cities_gr-municipalities

Catalogue with Countries, Greek Cities and Municipalities
2
star
63

greek-committers-php

Script to retrieve GitHub committers based on location and prepare the information for WordPress import
PHP
2
star
64

coronamap

Χάρτης εξάπλωσης COVID-19 - Eλλάδα
CSS
2
star
65

gsoc2021-sch-webapps

JavaScript
2
star
66

wp-gpchild-ellak-theme

GeneratePress WordPress child theme for ellak.gr
PHP
1
star
67

gsoc2019-univerSiS

JavaScript
1
star
68

gsoc2019-apidesign

1
star
69

innovathens-map

Interactive map with hubs, moke etc.; for Innovathens.
CSS
1
star
70

opengov_adeies

Πρότυπη Εφαρμογή - Διαχείρισης Αδειών Στελεχών Περιφέρειας
PHP
1
star
71

facebook-cbp-app

Facebook CBP App
PHP
1
star
72

ypodeigma

1
star
73

opengov_site

OpenGov Πρότυπος Δικτυακός Τόπος Φορέα
PHP
1
star
74

DMS

Σύστημα Διαχείρισης Εγγράφων και Ψηφιακής Υπογραφής
1
star
75

wp-moodle-lessons-api

WP Moodle Lessons API
PHP
1
star
76

CnC_test

Test repository for Code + Create project
1
star
77

SeLCont

Synchronized eLearning Content ToolKit
PHP
1
star
78

gsoc2018-wso2

WSO2 Identity Server Userstore using Web Services to get claims
Java
1
star
79

opengov_openepad

OpenGov Open ePad - πλατφόρμα καταγραφής προβλημάτων της Δημόσιας Διοίκησης
PHP
1
star
80

gpchild-ellak-openhardware

openhardware.ellak.gr theme
PHP
1
star
81

opengov_services

Εφαρμογή διαχείρισης καταλόγου Υπηρεσιών απο το diadikasies.gr
PHP
1
star
82

opendata-around-the-world

1
star
83

wp_ultimate_member_custom

wordpress ultimate member plugin with customizations on profile update
PHP
1
star
84

ansible-laptops

Διαχείριση υπολογιστών openlabs
1
star
85

OpenLab_Files

Distribution and Documentation files for OpenLabs Wordpress theme
1
star
86

fortos

Καταγραφή Διαδικασιών και Υπολογισμός Φόρτου Εργασίας
PHP
1
star
87

gsoc2018-clio

Clio, a web-based system for maintaining (meta-)information on software components
1
star
88

mlmmj-php-web-admin

Web admin interface for mlmmj list manager
PHP
1
star
89

panoptis2019

Rich Text Format
1
star
90

eellak_fortos_generic

Εφαρμογή Καταγραφής Διαδικασιών και Υπολογισμού Φόρτου Εργασίας
PHP
1
star
91

opengov_consultations

OpenGov Wordpress Theme for Opengov Consultations
PHP
1
star
92

public-opinion-questionnaire

Public Opinion Questionnaire - poq
PHP
1
star
93

fossbot-web-simulator

GDScript
1
star
94

opengov_website

Πρότυπος Ιστοχώρος Ανοικτής Διακυβέρνησης
PHP
1
star
95

open_elearn_gr

PHP
1
star
96

block_nuke_bot

Node command line bot for blocking Mediawiki spammers and the pages they created
JavaScript
1
star