• Stars
    star
    468
  • Rank 90,613 (Top 2 %)
  • Language
    Go
  • License
    MIT License
  • Created about 6 years ago
  • Updated about 1 month ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Blazing fast syslog parser

MIT License

A parser for Syslog messages and transports.

Blazing fast Syslog parsers

By @leodido.

To wrap up, this package provides:

This library provides the pieces to parse Syslog messages transported following various RFCs.

For example:

  • TLS with octet count (RFC5425)
  • TCP with non-transparent framing or with octet count (RFC 6587)
  • UDP carrying one message per packet (RFC5426)

Installation

go get github.com/influxdata/go-syslog/v3

Docs

Documentation

The docs directory contains .dot files representing the finite-state machines (FSMs) implementing the syslog parsers and transports.

Usage

Suppose you want to parse a given sequence of bytes as a RFC5424 message.

Notice that the same interface applies for RFC3164. But you can always take a look at the examples file.

i := []byte(`<165>4 2018-10-11T22:14:15.003Z mymach.it e - 1 [ex@32473 iut="3"] An application event log entry...`)
p := rfc5424.NewParser()
m, e := p.Parse(i)

This results in m being equal to:

// (*rfc5424.SyslogMessage)({
//  Base: (syslog.Base) {
//   Facility: (*uint8)(20),
//   Severity: (*uint8)(5),
//   Priority: (*uint8)(165),
//   Timestamp: (*time.Time)(2018-10-11 22:14:15.003 +0000 UTC),
//   Hostname: (*string)((len=9) "mymach.it"),
//   Appname: (*string)((len=1) "e"),
//   ProcID: (*string)(<nil>),
//   MsgID: (*string)((len=1) "1"),
//   Message: (*string)((len=33) "An application event log entry...")
//  },
//  Version: (uint16) 4,
//  StructuredData: (*map[string]map[string]string)((len=1) {
//   (string) (len=8) "ex@32473": (map[string]string) (len=1) {
//    (string) (len=3) "iut": (string) (len=1) "3"
//   }
//  })
// })

And e being equal to nil since the i byte slice contains a perfectly valid RFC5424 message.

Best effort mode

RFC5424 parser has the ability to perform partial matches (until it can).

With this mode enabled, when the parsing process errors out it returns the message collected until that position, and the error that caused the parser to stop.

Notice that in this modality the output is returned iff it represents a minimally valid message - ie., a message containing almost a priority field in [1,191] within angular brackets, followed by a version in ]0,999] (in the case of RFC5424).

Let's look at an example.

i := []byte("<1>1 A - - - - - -")
p := NewParser(WithBestEffort())
m, e := p.Parse(i)

This results in m being equal to the following SyslogMessage instance.

// (*rfc5424.SyslogMessage)({
//  Base: (syslog.Base) {
//   Facility: (*uint8)(0),
//   Severity: (*uint8)(1),
//   Priority: (*uint8)(1),
//   Timestamp: (*time.Time)(<nil>),
//   Hostname: (*string)(<nil>),
//   Appname: (*string)(<nil>),
//   ProcID: (*string)(<nil>),
//   MsgID: (*string)(<nil>),
//   Message: (*string)(<nil>)
//  },
//  Version: (uint16) 1,
//  StructuredData: (*map[string]map[string]string)(<nil>)
// })

And, at the same time, in e reporting the error that actually stopped the parser.

// expecting a RFC3339MICRO timestamp or a nil value [col 5]

Both m and e have a value since at the column the parser stopped it already was able to construct a minimally valid RFC5424 SyslogMessage.

Builder

This library also provides a builder to construct valid syslog messages.

Notice that its API ignores input values that does not match the grammar.

Let's have a look to an example.

msg := &rfc5424.SyslogMessage{}
msg.SetTimestamp("not a RFC3339MICRO timestamp")
msg.Valid() // Not yet a valid message (try msg.Valid())
msg.SetPriority(191)
msg.SetVersion(1)
msg.Valid() // Now it is minimally valid

Printing msg you will verify it contains a nil timestamp (since an invalid one has been given).

// (*rfc5424.SyslogMessage)({
//  Base: (syslog.Base) {
//   Facility: (*uint8)(23),
//   Severity: (*uint8)(7),
//   Priority: (*uint8)(191),
//   Timestamp: (*time.Time)(<nil>),
//   Hostname: (*string)(<nil>),
//   Appname: (*string)(<nil>),
//   ProcID: (*string)(<nil>),
//   MsgID: (*string)(<nil>),
//   Message: (*string)(<nil>)
//  },
//  Version: (uint16) 1,
//  StructuredData: (*map[string]map[string]string)(<nil>)
// })

Finally you can serialize the message into a string.

str, _ := msg.String()
// <191>1 - - - - - -

Message transfer

Excluding encapsulating one message for packet in packet protocols there are two ways to transfer syslog messages over streams.

The older - ie., the non-transparent framing - and the newer one - ie., the octet counting framing - which is reliable and has not been seen to cause problems noted with the non-transparent one.

This library provide stream parsers for both.

Octet counting

In short, RFC5425 and RFC6587, aside from the protocol considerations, describe a transparent framing technique for Syslog messages that uses the octect counting technique - ie., the message length of the incoming message.

Each Syslog message is sent with a prefix representing the number of bytes it is made of.

The octecounting package parses messages stream following such rule.

To quickly understand how to use it please have a look at the example file.

Non transparent

The RFC6587 also describes the non-transparent framing transport of syslog messages.

In such case the messages are separated by a trailer, usually a line feed.

The nontransparent package parses message stream following such technique.

To quickly understand how to use it please have a look at the example file.

Things we do not support:

  • trailers other than LF or NUL
  • trailers which length is greater than 1 byte
  • trailer change on a frame-by-frame basis

Performances

To run the benchmark execute the following command.

make bench

On my machine1 these are the results obtained paring RFC5424 syslog messages with best effort mode on.

[no]_empty_input__________________________________  4524100        274 ns/op      272 B/op        4 allocs/op
[no]_multiple_syslog_messages_on_multiple_lines___  3039513        361 ns/op      288 B/op        8 allocs/op
[no]_impossible_timestamp_________________________  1244562        951 ns/op      512 B/op       11 allocs/op
[no]_malformed_structured_data____________________  2389249        512 ns/op      512 B/op        9 allocs/op
[no]_with_duplicated_structured_data_id___________  1000000       1183 ns/op      712 B/op       17 allocs/op
[ok]_minimal______________________________________  6876235        178 ns/op      227 B/op        5 allocs/op
[ok]_average_message______________________________   730473       1653 ns/op     1520 B/op       24 allocs/op
[ok]_complicated_message__________________________   908776       1344 ns/op     1264 B/op       24 allocs/op
[ok]_very_long_message____________________________   392737       3114 ns/op     2448 B/op       25 allocs/op
[ok]_all_max_length_and_complete__________________   510740       2431 ns/op     1872 B/op       28 allocs/op
[ok]_all_max_length_except_structured_data_and_mes   755124       1593 ns/op      867 B/op       13 allocs/op
[ok]_minimal_with_message_containing_newline______  6142984        199 ns/op      230 B/op        6 allocs/op
[ok]_w/o_procid,_w/o_structured_data,_with_message  1670286        732 ns/op      348 B/op       10 allocs/op
[ok]_minimal_with_UTF-8_message___________________  3013480        407 ns/op      339 B/op        6 allocs/op
[ok]_minimal_with_UTF-8_message_starting_with_BOM_  2926410        423 ns/op      355 B/op        6 allocs/op
[ok]_with_structured_data_id,_w/o_structured_data_  1558971        814 ns/op      570 B/op       11 allocs/op
[ok]_with_multiple_structured_data________________  1000000       1243 ns/op     1205 B/op       16 allocs/op
[ok]_with_escaped_backslash_within_structured_data  1000000       1025 ns/op      896 B/op       17 allocs/op
[ok]_with_UTF-8_structured_data_param_value,_with_  1000000       1241 ns/op     1034 B/op       19 allocs/op

As you can see it takes:

  • ~250ns to parse the smallest legal message

  • less than 2µs to parse an average legal message

  • ~3µs to parse a very long legal message

Other RFC5424 implementations, like this one in Rust, spend 8µs to parse an average legal message.

TBD: comparison against other Go parsers.


  • [1]: Intel Core i7-8850H CPU @ 2.60GHz

More Repositories

1

influxdb

Scalable datastore for metrics, events, and real-time analytics
Rust
27,320
star
2

telegraf

The plugin-driven server agent for collecting & reporting metrics.
Go
13,778
star
3

kapacitor

Open source framework for processing, monitoring, and alerting on time series data
Go
2,279
star
4

influxdb_iox

Pronounced (influxdb eye-ox), short for iron oxide. This is the new core of InfluxDB written in Rust on top of Apache Arrow.
Rust
1,803
star
5

influxdb-python

Python client for InfluxDB
Python
1,678
star
6

chronograf

Open source monitoring and visualization UI for the TICK stack
TypeScript
1,477
star
7

influxdb-java

Java client for InfluxDB
Java
1,156
star
8

influxdb-relay

Service to replicate InfluxDB data for high availability
Python
830
star
9

flux

Flux is a lightweight scripting language for querying databases (like InfluxDB) and working with data. It's part of InfluxDB 1.7 and 2.0, but can be run independently of those.
FLUX
753
star
10

influxdb-client-python

InfluxDB 2.0 python client
Python
664
star
11

influxdb-client-go

InfluxDB 2 Go Client
Go
572
star
12

sandbox

A sandbox for the full TICK stack
Shell
475
star
13

influxdb-php

influxdb-php: A PHP Client for InfluxDB, a time series database
PHP
430
star
14

influxdb-client-java

InfluxDB 2 JVM Based Clients
Java
412
star
15

influxdb-client-csharp

InfluxDB 2.x C# Client
C#
337
star
16

community-templates

InfluxDB Community Templates: Quickly collect & analyze time series data from a range of sources: Kubernetes, MySQL, Postgres, AWS, Nginx, Jenkins, and more.
Python
332
star
17

influxdb-client-js

InfluxDB 2.0 JavaScript client
TypeScript
316
star
18

influxdata-docker

Official docker images for the influxdata stack
Shell
314
star
19

influxdb-comparisons

Code for comparison write ups of InfluxDB and other solutions
Go
306
star
20

rskafka

A minimal Rust client for Apache Kafka
Rust
282
star
21

docs.influxdata.com-ARCHIVE

ARCHIVE - 1.x docs for InfluxData
Less
253
star
22

helm-charts

Official Helm Chart Repository for InfluxData Applications
Mustache
212
star
23

influxdb-rails

Ruby on Rails bindings to automatically write metrics into InfluxDB
Ruby
205
star
24

influxdb-csharp

A .NET library for efficiently sending points to InfluxDB 1.x
C#
198
star
25

influxdb1-client

The old clientv2 for InfluxDB 1.x
Go
187
star
26

giraffe

A foundation for visualizations in the InfluxDB UI
TypeScript
178
star
27

influxql

Package influxql implements a parser for the InfluxDB query language.
Go
163
star
28

influxdb-client-php

InfluxDB (v2+) Client Library for PHP
PHP
140
star
29

tdigest

An implementation of Ted Dunning's t-digest in Go.
Go
126
star
30

influx-stress

New tool for generating artificial load on InfluxDB
Go
118
star
31

tick-charts

A repository for Helm Charts for the full TICK Stack
Smarty
90
star
32

ui

UI for InfluxDB
TypeScript
86
star
33

telegraf-operator

telegraf-operator helps monitor application on Kubernetes with Telegraf
Go
79
star
34

pbjson

Auto-generate serde implementations for prost types
Rust
79
star
35

inch

An InfluxDB benchmarking tool.
Go
78
star
36

influxdata-operator

A k8s operator for InfluxDB
Go
76
star
37

docs-v2

InfluxData Documentation that covers InfluxDB Cloud, InfluxDB OSS 2.x, InfluxDB OSS 1.x, InfluxDB Enterprise, Telegraf, Chronograf, Kapacitor, and Flux.
SCSS
66
star
38

wirey

Manage local wireguard interfaces in a distributed system
Go
62
star
39

influxdb-go

61
star
40

influx-cli

CLI for managing resources in InfluxDB v2
Go
58
star
41

terraform-aws-influx

Reusable infrastructure modules for running TICK stack on AWS
HCL
50
star
42

grade

Track Go benchmark performance over time by storing results in InfluxDB
Go
43
star
43

influxdb-r

R library for InfluxDB
R
43
star
44

influxdb-observability

Go
43
star
45

clockface

UI Kit for building Chronograf
TypeScript
43
star
46

nginx-influxdb-module

C
40
star
47

influxdb2-sample-data

Sample data for InfluxDB 2.0
JavaScript
40
star
48

influxdb-client-ruby

InfluxDB 2.0 Ruby Client
Ruby
40
star
49

tensorflow-influxdb

Jupyter Notebook
34
star
50

nifi-influxdb-bundle

InfluxDB Processors For Apache NiFi
Java
33
star
51

line-protocol

Go
33
star
52

whisper-migrator

A tool for migrating data from Graphite Whisper files to InfluxDB TSM files (version 0.10.0).
Go
33
star
53

iot-center-flutter

InlfuxDB 2.0 dart client flutter demo
Dart
31
star
54

kube-influxdb

Configuration to monitor Kubernetes with the TICK stack
Shell
30
star
55

k8s-kapacitor-autoscale

Demonstration of using Kapacitor to autoscale a k8s deployment
Go
30
star
56

terraform-aws-influxdb

Deploys InfluxDB Enterprise to AWS
HCL
29
star
57

catslack

Shell -> Slack the easy way
Go
28
star
58

influxdb-operator

The Kubernetes operator for InfluxDB and the TICK stack.
Go
27
star
59

flux-lsp

Implementation of Language Server Protocol for the flux language
Rust
26
star
60

influxdb-client-swift

InfluxDB (v2+) Client Library for Swift
Swift
26
star
61

flightsql-dbapi

DB API 2 interface for Flight SQL with SQLAlchemy extras.
Python
26
star
62

influxdb-c

C
25
star
63

influxdb-client-dart

InfluxDB (v2+) Client Library for Dart and Flutter
Dart
24
star
64

ansible-chrony

A role to manage chrony on Linux systems
Ruby
24
star
65

kapacitor-course

24
star
66

vsflux

Flux language extension for VSCode
TypeScript
24
star
67

grafana-flightsql-datasource

Grafana plugin for Flight SQL APIs.
TypeScript
24
star
68

influxdb-scala

Scala client for InfluxDB
Scala
22
star
69

cron

A fast, zero-allocation cron parser in ragel and golang
Go
21
star
70

influxdb-plugin-fluent

A buffered output plugin for Fluentd and InfluxDB 2
Ruby
21
star
71

terraform-google-influx

Reusable infrastructure modules for running TICK stack on GCP
Shell
20
star
72

influxdb3_core

InfluxData's core functionality for InfluxDB Edge and IOx
Rust
18
star
73

openapi

An OpenAPI specification for influx (cloud/oss) apis.
Shell
17
star
74

influxdb-university

InfluxDB University
Python
16
star
75

influxdb-client-r

InfluxDB (v2+) Client R Package
R
14
star
76

cd-gitops-reference-architecture

Details of the CD/GitOps architecture in use at InfluxData
Shell
13
star
77

kafka-connect-influxdb

InfluxDB 2 Connector for Kafka
Scala
13
star
78

oats

An OpenAPI to TypeScript generator.
TypeScript
12
star
79

awesome

SCSS
12
star
80

windows-packager

Create a windows installer
Shell
12
star
81

iot-api-ui

Common React UI for iot-api-<js, python, etc.> example apps designed for InfluxDB client library tutorials.
TypeScript
12
star
82

promql

Go
11
star
83

yarpc

Yet Another RPC for Go
Go
11
star
84

iot-api-python

Python
11
star
85

influxdb-gds-connector

Google Data Studio Connector for InfluxDB.
JavaScript
11
star
86

object_store_rs

Rust
10
star
87

ansible-influxdb-enterprise

Ansible role for deploying InfluxDB Enterprise.
10
star
88

influxdb-sample-data

Sample time series data used to test InfluxDB
9
star
89

ingen

ingen is a tool for directly generating TSM data
Go
8
star
90

ansible-kapacitor

Official Kapacitor Ansible Role for Linux
Jinja
7
star
91

wlog

Simple log level based Go logger.
Go
7
star
92

iot-api-js

An example IoT app built with NextJS (NodeJS + React) and the InfluxDB API client library for Javascript.
JavaScript
7
star
93

influxdb-iox-client-go

InfluxDB/IOx Client for Go
Go
7
star
94

k8s-jsonnet-libs

Jsonnet Libs repo - mostly generated with jsonnet-libs/k8s project
Jsonnet
7
star
95

google-deployment-manager-influxdb-enterprise

GCP Deployment Manager templates for InfluxDB Enterprise.
HTML
6
star
96

jaeger-influxdb

Go
6
star
97

influxdb-action

A GitHub action for setting up and configuring InfluxDB and the InfluxDB Cloud CLI
Shell
6
star
98

influxdb-fsharp

A F# client library for InfluxDB, a time series database http://influxdb.com
F#
6
star
99

qprof

A tool for profiling the performance of InfluxQL queries
Go
6
star
100

influxdb-nodejs

InfluxDB client library for NodeJS
5
star