• Stars
    star
    2,197
  • Rank 20,829 (Top 0.5 %)
  • Language
    Python
  • License
    Other
  • Created over 8 years ago
  • Updated 12 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

NLP, before and after spaCy

textacy: NLP, before and after spaCy

textacy is a Python library for performing a variety of natural language processing (NLP) tasks, built on the high-performance spaCy library. With the fundamentals --- tokenization, part-of-speech tagging, dependency parsing, etc. --- delegated to another library, textacy focuses primarily on the tasks that come before and follow after.

build status current release version pypi version conda version

features

  • Access and extend spaCy's core functionality for working with one or many documents through convenient methods and custom extensions
  • Load prepared datasets with both text content and metadata, from Congressional speeches to historical literature to Reddit comments
  • Clean, normalize, and explore raw text before processing it with spaCy
  • Extract structured information from processed documents, including n-grams, entities, acronyms, keyterms, and SVO triples
  • Compare strings and sequences using a variety of similarity metrics
  • Tokenize and vectorize documents then train, interpret, and visualize topic models
  • Compute text readability and lexical diversity statistics, including Flesch-Kincaid grade level, multilingual Flesch Reading Ease, and Type-Token Ratio

... and much more!

links

maintainer

Howdy, y'all. πŸ‘‹

More Repositories

1

jChartbeat

A simple and lightweight jQuery plugin for the Chartbeat API
JavaScript
34
star
2

bigboard

Always know what's up. A simple leader-board style visualization to show your best stories on your biggest screen.
CSS
21
star
3

wordpress

Chartbeat Wordpress plugin
PHP
16
star
4

ottla

An opinionated clojure framework for writing kafka machines
Clojure
15
star
5

asana-scrum-board

JavaScript
15
star
6

parselmouth

An object-oriented interface for abstracting away the ugly parts of ad server APIs
Python
14
star
7

puppet-mdadm

Puppet module for managing md raid arrays
Ruby
14
star
8

wade

We're All Database Engineers
Python
14
star
9

map

Knock, knock. Explore a map and watch your visitors drop in from around the world in full screen fashion.
JavaScript
13
star
10

ChartbeatNotifier

Mac OS X app for Chartbeat data
Objective-C
10
star
11

trepl

Generic Tiered Replication implementation.
Python
6
star
12

visualbigboard

Take on a "visual" big board
JavaScript
6
star
13

cljbeat-opentsdb

Clojure OpenTSDB client library
Clojure
5
star
14

lein-assembly

The missing assembly plugin for leiningen
Clojure
5
star
15

analyzecal

Analyze Google Calendar Data
Python
4
star
16

chartbeat-angular

JavaScript
4
star
17

gulp-slurpee

Run the same gulp task from multiple gulpfile's concurrently.
JavaScript
4
star
18

bigboardmosaic

Size matters: Pages are represented as blocks, the larger the block, the greater its share of traffic. Color denotes dominant traffic source (Newsbeat sites only).
JavaScript
4
star
19

pygoth2014

Bend Postgres to Your Python Will (PyGotham 2014 talk)
Python
3
star
20

puppet-varnish

Puppet Module to manage Varnish on Debian
Ruby
3
star
21

sublime-closure

Closure syntax checker for Sublime
Python
3
star
22

iflow

Watch your traffic fly from one page to another. See which pages are driving the most traffic, and which ones need some extra love.
JavaScript
3
star
23

puppet-confluent-kafka

Unofficial puppet module for managing confluent's kafka distribution
Puppet
2
star
24

kafkameta

Python library for interacting with Kafka's metadata via ZooKeeper
Python
2
star
25

chartbeat-ios-sdk

Objective-C
2
star
26

swailing

An opinionated logging library for applications that produce a debilitating amount of log output.
Python
2
star
27

chartbook

A plugin for combining chartbeat and facebook data. It shows images and share on chartbeat top pages.
JavaScript
2
star
28

statman

A stats gathering library for clojure based on jmx
Clojure
1
star
29

universe

Have you ever wondered what your site would look like in the sky? Wonder no more.
JavaScript
1
star
30

percentoftheinternet

JavaScript
1
star
31

cljbeat-options

Clojure command line option parsing tools
Clojure
1
star