Awesome Go Text Processing

  • align align 84
    star
    updated about 3 years ago MIT License

    A general purpose application and library for aligning text.

  • allot allot 56
    star
    updated over 1 year ago MIT License

    Parse placeholder and wildcard text commands

  • bafi bafi 47
    star
    updated almost 3 years ago MIT License

    Universal JSON, BSON, YAML, CSV, XML converter with templates

  • updated 4 months ago MIT License

    Converter from BBCode to HTML

  • updated almost 2 years ago MIT License

    The Go library that will drive you to AOP world!

  • updated 10 months ago Other

    Blackfriday: a markdown processor for Go

  • updated 8 months ago Other

    bluemonday: a fast golang HTML sanitizer (inspired by the OWASP Java HTML Sanitizer) to scrub user generated content of XSS

  • updated over 6 years ago GNU General Publi...

    Simple script for farm free books from PackPub.com

  • updated over 1 year ago BSD 2-Clause "Sim...

    CSS selector library in Go

  • cat cat 93
    star
    updated about 1 year ago The Unlicense

    Extract text from plaintext, .docx, .odt and .rtf files. Pure go.

  • updated about 5 years ago Other

    🌲 Parses indented code and returns a tree structure.

  • colly colly 23,245
    star
    updated 4 months ago Apache License 2.0

    Elegant Scraper and Crawler Framework for Golang

  • updated over 1 year ago MIT License

    🍫 A collection of common regular expressions for Go

  • updated over 3 years ago BSD 3-Clause "New...

    csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.

  • updated over 1 year ago BSD 3-Clause "New...

    Extract structured data from web sites. Web sites scraping.

  • did did 88
    star
    updated almost 2 years ago Apache License 2.0

    A golang package to work with Decentralized Identifiers (DIDs)

  • doi doi 7
    star
    updated about 2 years ago MIT License

    Parse and check doi objects in go.

  • updated 9 months ago MIT License

    Read edifact into your Go data structures.

  • updated 8 months ago MIT License

    EditorConfig Core written in Go

  • enca enca 16
    star
    updated over 8 years ago MIT License

    Minimal cgo bindings for libenca

  • updated about 5 years ago Apache License 2.0

    Go package provides a generic interface to encoders and decoders

  • updated over 1 year ago MIT License

    Go library for detecting and decoding various content front matter formats

  • genex genex 77
    star
    updated almost 5 years ago MIT License

    Genex package for Go

  • updated about 1 year ago MIT License

    GitHub Flavored Markdown renderer with fenced code block highlighting, clickable header anchor links.

  • updated about 5 years ago

    Stupid simple slide presenter or static site creator.

  • updated over 2 years ago MIT License

    📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...

  • updated 6 months ago MIT License

    Go Mathematical Expression Toolkit. Run-time mathematical expression parser and evaluation engine.

  • updated almost 2 years ago BSD 3-Clause "New...

    go-fasttld is a high performance effective top level domains (eTLD) extraction module.

  • gofeed gofeed 2,264
    star
    updated about 1 year ago MIT License

    Parse RSS, Atom and JSON feeds in Go

  • updated 10 months ago MIT License

    Encoding and decoding for fixed-width formatted data

  • updated over 1 year ago MIT License

    A sanitization-based swear filter for Go.

  • updated almost 2 years ago Other

    Parses the Graphviz DOT language in golang

  • updated about 1 year ago Other

    Go Humans! (formatters for units to human friendly sizes)

  • gojq gojq 3,016
    star
    updated 9 months ago MIT License

    Pure Go implementation of jq

  • goldmark goldmark 3,586
    star
    updated 4 months ago MIT License

    🏆 A markdown parser written in Go. Easy to extend, standard(CommonMark) compliant, well structured.

  • updated over 5 years ago MIT License

    Takes a full name and splits it into individual name parts

  • updated 4 months ago MIT License

    A NMEA parser library in pure Go

  • goq goq 256
    star
    updated over 3 years ago MIT License

    A declarative struct-tag-based HTML unmarshaling or scraping package for Go built on top of the goquery library

  • updated over 1 year ago Apache License 2.0

    Go package for scraping website recipes

  • updated over 2 years ago Apache License 2.0

    randexp for Go.

  • updated over 1 year ago MIT License

    wcwidth for golang

  • updated over 4 years ago MIT License

    Pretty Slug.

  • updated over 3 years ago Apache License 2.0

    ⚡ Light weight Golang spider framework | 轻量的 Golang 爬虫框架

  • updated almost 4 years ago Apache License 2.0

    Gotabulate - Easily pretty-print your tabular data with Go

  • gotext gotext 430
    star
    updated 6 months ago Other

    Go (Golang) GNU gettext utilities package

  • gout gout 15
    star
    updated 9 months ago Mozilla Public Li...

    Output go objects in standard formats, such as YAML, JSON, etc

  • updated over 1 year ago MIT License

    A Go library to parse and format vCard

  • updated over 1 year ago Other

    🚀 Fast and light wildcard pattern matching.

  • updated over 4 years ago Apache License 2.0

    Zero-width character detection and removal for Go

  • updated about 7 years ago MIT License

    Guess the natural language of a text in Go

  • updated about 1 year ago MIT License

    Library and cli for extracting data from HTML via CSS selectors

  • updated 4 months ago MIT License

    ⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.

  • updated 7 months ago MIT License

    🐹 render YAML as HTML in Go

  • inject inject 1,393
    star
    updated almost 6 years ago MIT License

    Package inject provides a reflect based injector.

  • updated over 1 year ago MIT License

    Colorized JSON output for Go https://godoc.org/github.com/nwidger/jsoncolor

  • updated about 1 year ago MIT License

    JSONPath with dot notation generator for golang

  • kace kace 19
    star
    updated over 6 years ago MIT License

    Common case conversions covering common initialisms.

  • koazee koazee 515
    star
    updated over 1 year ago MIT License

    A StreamLike, Immutable, Lazy Loading and smart Golang Library to deal with slices.

  • ksuid ksuid 4,121
    star
    updated over 1 year ago MIT License

    K-Sortable Globally Unique IDs

  • updated 5 months ago MIT License

    Go implementation to calculate Levenshtein Distance.

  • updated over 5 years ago Apache License 2.0

    High performance LTSV (Labeled Tab Separeted Value) reader for Go.

  • micro micro 22,579
    star
    updated over 1 year ago MIT License

    A modern and intuitive terminal-based text editor

  • mxj mxj 619
    star
    updated 8 months ago MIT License

    Decode / encode XML to/from map[string]interface{} (or JSON); extract values with dot-notation paths and wildcards. Replaces x2j and j2x packages.

  • updated almost 2 years ago MIT License
  • odf odf 73
    star
    updated over 1 year ago Do What The F*ck ...

    Open Document Format (ODF) generator library for Go.

  • updated 5 months ago MIT License

    omniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.

  • pagser pagser 103
    star
    updated about 1 year ago MIT License

    Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawler

  • updated almost 4 years ago Apache License 2.0

    A simple library for formatting and laying out text in your terminal

  • updated almost 8 years ago MIT License

    A string argument parser that understands quotes and backslashes

  • parth parth 46
    star
    updated 7 months ago MIT License

    Path parsing for segment unmarshaling and slicing.

  • updated about 4 years ago MIT License

    iTunes and RSS 2.0 Podcast Generator in Golang

  • radix radix 185
    star
    updated over 1 year ago BSD 2-Clause "Sim...

    A fast string sorting algorithm (MSD radix sort)

  • updated 2 months ago BSD 2-Clause "Sim...

    Web content filter that runs as an HTTP proxy

  • updated over 5 years ago GNU Affero Genera...

    Recommendation engine for Go

  • updated 6 months ago Apache License 2.0

    Match regex group into go struct using struct tags and automatic parsing

  • updated about 1 year ago GNU Affero Genera...

    Draws a flowchart graph of any Visual Novel from Renpy .rpy files !

  • sdp sdp 113
    star
    updated over 4 years ago BSD 3-Clause "New...

    RFC 4566 SDP implementation in go

  • sh sh 7,112
    star
    updated 4 months ago BSD 3-Clause "New...

    A shell parser, formatter, and interpreter with bash support; includes shfmt

  • updated about 1 year ago MIT License

    Telegram bot constructor from command-line

  • slug slug 1,126
    star
    updated 9 months ago Mozilla Public Li...

    URL-friendly slugify with multiple languages support.

  • updated over 6 years ago MIT License

    A Go slugify application that handles string

  • updated over 1 year ago MIT License

    Golang metrics for calculating string similarity and other string utility functions

  • updated over 1 year ago MIT License

    A syndication feed parser for Atom 1.0 and RSS 2.0 in Go

  • updated 5 months ago Apache License 2.0

    Tagify produces a set of tags from a given source. Source can be either an HTML page, a Markdown document or a plain text. Supports English, Russian, Chinese, Hindi, Spanish, Arabic, Japanese, German, Hebrew, French and Korean languages.

  • updated over 5 years ago MIT License

    Port of Python's "textwrap" module to Go

  • updated about 2 months ago MIT License

    Tokenizer (lexer) for golang

  • toml toml 4,464
    star
    updated 6 months ago MIT License

    TOML parser for Golang with reflection.

  • TySug TySug 18
    star
    updated over 1 year ago MIT License

    A project around helping to prevent typing typos. TySug (Typo Suggestions) suggests alternative words with respect to keyboard layouts

  • xml xml 1
    star
    updated over 8 years ago

    Package feed implements a flexible, robust and efficient RSS and Atom parser

  • xquery xquery 156
    star
    updated over 6 years ago MIT License

    Extract data or evaluate value from HTML/XML documents using XPath

  • xurls xurls 1,175
    star
    updated 5 months ago BSD 3-Clause "New...

    Extract urls from text