• Stars
    star
    674
  • Rank 66,980 (Top 2 %)
  • Language jq
  • License
    MIT License
  • Created about 2 years ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

jq implementation of jq

jqjq

jq implementation of jq

Warning this project is mostly for learning, experimenting and fun.

Why? It started when I was researching how to write decoders directly in jq for fq which ended up involving some syntax tree rewriting and walking and then it grew from there.

But it's also a great way to promote and show that jq is a very expressive, capable and nice language! :)

You can try and play around with it at jqplay.org.

Use via jqjq wrapper

$ ./jqjq -n 'def f: 1,8; [f,f] | map(.+105) | implode'
"jqjq"

$ ./jqjq '.+. | map(.+105) | implode' <<< '[1,8]'
"jqjq"

# jqjq using jqjq to run above example
# eval concatenation of jqjq.jq as a string and example
$ ./jqjq "eval($(jq -Rs . jqjq.jq)+.)" <<< '"eval(\"def f: 1,8; [f,f] | map(.+105) | implode\")"'
"jqjq"

$ ./jqjq --repl
> 1,2,3 | .*2
2
4
6
> "jqjq" | explode | map(.-32) | implode
"JQJQ"
> "jqjq" | [eval("explode[] | .-32")] | implode
"JQJQ"
> ^D

# 01mf02 adaptation of itchyny's bf.jq running fib.bf
$ ./jqjq -n "\"$(cat fib.bf)\" | $(cat bf.jq)"
"1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89, 144, 233"

$ ./jqjq -h
jqjq - jq implementation of jq
Usage: jqjq [OPTIONS] [--] [EXPR]
  --jq PATH        jq implementation to run with
  --lex            Lex EXPR
  --no-builtins    No builtins
  --null-input,-n  Null input
  --parse          Lex and parse EXPR
  --repl           REPL
  --run-tests      Run jq tests from stdin
  --slurp,-s       Slurp inputs into an array

Use with jq

$ jq -n -L . 'include "jqjq"; eval("def f: 1,8; [f,f] | map(.+105) | implode")'
"jqjq"

$ jq -L . 'include "jqjq"; eval("(.+.) | map(.+105) | implode")' <<< '[1,8]'
"jqjq"

Run tests

make test

Progress

  • 123, .123, 1.23, 1.23e2, 1.23e+2, "abc", true, false, null Scalar literals
    • Unicode codepoint escape "\ud83d\ude03"
    • Handle surrogate pairs \ud800-\udfff, should translate to codepoint.
    • Control code and quote escape "\"\n\r\t\f\b\\\/"
  • {key: "value"} Object literal
    • {key}
    • {"key"}
    • {$key}
    • {(f): f}
    • {("a","b"): (1,2), c: 2} Multiple key/value outputs
    • {"\(f)"} String interpolation
    • {key: 1 | .} Multi value queries
  • [1,2,3] Array literal, collect
  • 1, 2 Comma operator
  • 1 | 2 Pipe operator
  • +, -, *, /, % Arithmetic operators
  • +123, -1 Unary operators
  • ==, !=, <, <=, >, >= Comparison operators
  • 123 as $a | ... Binding
    • (1,2,3) as $a | ... Binding per output
    • {a: [123]} as {a: [$v]} Destructuring binding
  • . Identity
  • .a, ."a", .[1], .[f] Index
  • .key[123]."key"[f] Suffix expressions
    • .a.b Multi index
    • .a.b? Optional index
    • .a[] Iterate index
    • .[]? Try iterate
  • .[] Iterate
  • .[start:stop], .[:stop], .[start:] Array slicing
    • .[{start: 123, stop: 123}] Slice using object
    • Slice and path tracking path(.[1:2]) -> [{"start":1,"end":2}]
  • try f, Shorthand for try f catch empty
  • f? Shorthand for try f catch empty
  • and, or operators
  • not operator
  • if f then 2 else 3 end Conditional
    • if f then 2 end Optional else
    • if f then 2 elif f then 3 end Else if clauses
    • if true,false then "a" else "b" end Multiple condition outputs
  • reduce f as $a (init; update) Reduce outputs from f into one output
  • foreach f as $a (init; update; extract) Foreach outputs of f update state and output extracted value
    • Optional extract
  • f = v Assignment
  • f |= v, f += Update assignment
  • +=, -=, *=, /=, %= Arithmetic update assignment
  • eval($expr) (jqjq specific)
  • path(f) Output paths for f
  • input, inputs
  • Builtins / standard library
    • del(f)
    • add
    • all, all(cond), all(gen; cond)
    • any, any(cond), any(gen; cond)
    • debug (passthrough)
    • delpaths($paths) (passthrough)
    • empty (passthrough)
    • endswith($s)
    • error($v) (passthrough)
    • error (passthrough)
    • explode (passthrough)
    • first(f)
    • first
    • flatten, flatten($depth)
    • from_entries
    • fromjson
    • getpath(path) (passthrough)
    • group, group_by(f)
    • has($key) (passthrough)
    • implode (passthrough)
    • isempty
    • join($s)
    • last(f)
    • last
    • length (passthrough)
    • limit($n; f)
    • map(f)
    • max, max_by(f)
    • min, min_by(f)
    • nth($n; f); nth($n)
    • range($to), range($from; $to), range($from; $to; $by)
    • recurse, recurse(f)
    • repeat
    • reverse
    • scalars
    • select(f)
    • setpath (passthrough)
    • sort, sort_by(f)
    • startswith($s)
    • to_entries
    • tojson
    • tonumber (passthrough)
    • tostring (passthrough)
    • match($regex; $flags) (passthrough)
    • match($val)
    • gsub($regex; f) (passthrough)
    • gsub($regex; f; $flags)
    • transpose
    • type (passthrough)
    • unique, unique_by(f)
    • until(cond; next)
    • while(cond; update)
    • with_entries
    • Math functions, sin/0, ... atan/2, ...
    • More...
  • def f: . Function declaration
    • def f(lambda): lambda Lambda argument
    • (def f: 123; f) | . Closure function
    • def f: def _f: 123; _f; f Local function
    • def f($binding): $binding Binding arguments
    • def f: f; Recursion
  • .. Recurse input, same as recurse
  • // Alternative operator
  • ?// Alternative destructuring operator
  • $ENV
  • @format "string" Format string
  • label $out | break $out Break out
  • include "f", import "f" Include
  • Run jqjq with jqjq
  • Bugs

jq's test suite

$ ./jqjq --run-tests < ../jq/tests/jq.test | grep passed
245 of 362 tests passed

Note that expected test values are based on stedolan's jq. If you run with a different jq implementation like gojq some tests might fail because of different error messages, support for arbitrary precision integers etc.

Design overview

jqjq has the common lex, parse, eval design.

Lex

Lexer gets a string and chews off parts from left to right producing an array of tokens [{<name>: ...}, ...]. Each chew is done by testing regex:s in a priority order to make sure to match longer prefixes first, ex: += is matched before +. For a match a lambda is evaluated, usually just . (identity), but in some cases like for quoted strings it is a bit more complicated.

You can use ./jqjq --lex '...' to lex and see the tokens.

Parse

Parser takes an array of tokens and uses a left-to-right (LR) parser with backtracking in combination with precedence climbing for infix operators to not end up in an infinite loop (ex parser rule E -> E + E). Backtracking is done by outputting empty for non-match and // to try the next rule, ex: a // b // ... // error where a and b are functions that try to match a rule. When a rule has matched it returns an array with the pair [<tokens left>, <ast>]. <ast> uses the same AST design as gojq.

You can use ./jqjq --parse '...' to lex and parse and see the AST tree.

Eval

Eval is done by traversing the AST tree and evaluates each AST node and also keeps track of the current path and environment.

Path is used in jq to keep track of current path to where you are in the input, this only works for simple indexing (ex: path(.a[1]), .b outputs ["a",1] and ["b"]). This is also used to implement assignment and some other operators.

Environment is an object with current functions and bindings. Functions have the key name <name>/<arity> and the value is a function AST. Bindings use the key name $<name>/0 and the value is {value: <value>} where value is normal jq value.

When evaluating the AST eval function get the current AST node, path and environment and will output zero, one or more arrays with the pair [<path>, <value>]. Path can be [null] if the evaluation produced a "new" value etc so that path tracking is not possible.

Problems, issues and unknowns

  • Better error messages.
  • The "environment" pass around is not very efficient and also it makes support recursion a bit awkward (called function is injected in the env at call time).
  • "," operator in jq (and gojq) is left associate but for the way jqjq parses it creates the correct parse tree when it's right associate. Don't know why.
  • Suffix with multiple [] outputs values in wrong order.
  • Non-associate operators like == should fail, ex: 1 == 2 == 3.
  • Object are parsed differently compared to gojq. gojq has a list of pipe queries, jqjq will only have one that might be pipe op.
  • Less "passthrough" piggyback on jq features:
    • reduce/foreach via recursive function? similar to if or {}-literal?
    • try/catch via some backtrack return value? change [path, value] to include an error somehow?
  • How to support label/break?
  • How to support delpaths (usd by del etc). Have to keep paths the same while deleting a group of paths? use sentinel value? work with paths instead?
  • Rewrite AST before eval, currently if and some others do rewrite (optional parts etc) while evaluating.
  • Rethink invalid path handling, current [null] is used as sentinel value.
  • {a:123} | .a |= empty should remove the key.

Useful references

Tools and tricks

  • jq -n --debug-dump-disasm '...' show jq byte code
  • jq -n --debug-trace=all '...' show jq byte code run trace
  • jq -n '{a: "hello"} | debug' 2> >(jq -R 'gsub("\u001b\\[.*?m";"") | fromjson' >&2) pretty print debug messages
  • GOJQ_DEBUG=1 go run -tags gojq_debug cmd/gojq/main.go -n '...' run gojq in debug mode
  • fq -n '".a.b" | _query_fromstring' gojq parse tree for string
  • fq -n '{...} | _query_tostring' jq expression string for gojq parse tree
  • For a convenient jq development experience:

Thanks to

  • stedolan for jq and got me interested in generator/backtracking based languages.
  • pkoppstein for writing about jq and PEG parsing.
  • itchyny for jqjq fixes and gojq from which is learned a lot and is also from where most of jqjq's AST design comes from. Sharing AST design made it easier to compare parser output (ex via fq's _query_fromstring). gojq also fixes some confusing jq bugs and has better error messages which saves a lot of time.
  • Michael FΓ€rber @01mf02 for jaq and where I also learned about precedence climbing.

License

Copyright (c) 2022 Mattias Wadman

jqjq is distributed under the terms of the MIT License.

See the LICENSE file for license details.

More Repositories

1

fq

jq for binary formats - tool, language and decoders for working with binary and text formats
Go
9,728
star
2

static-ffmpeg

Multi-arch docker image with ffmpeg/ffprobe binaries built as hardened static PIE binaries with no external dependencies
Dockerfile
247
star
3

ydls

youtube-dl HTTP download and transcode service
Go
185
star
4

postfix-relay

Postfix SMTP relay docker image
Shell
122
star
5

ansisvg

Convert ANSI to SVG
Go
86
star
6

goutubedl

Go wrapper for youtube-dl and yt-dlp
Go
84
star
7

bump

A generic version tracking and update tool
Go
66
star
8

jq-lsp

jq language server
JSONiq
59
star
9

gormstore

GORM backend for gorilla sessions
Go
57
star
10

catgolf

cat(1) golf
42
star
11

rgen

Resource code generator for iOS inspired by Android resource handling
Objective-C
40
star
12

inkmake

Makefile inspired export from SVG files using Inkscape as backend with some added smartness.
Ruby
24
star
13

php-tftpserver

TFTP server written in PHP
PHP
22
star
14

ffcat

Preview media files in the shell
Go
17
star
15

vscode-jq

jq extension for VSCode
TypeScript
16
star
16

chromesthesia

Find out what song is playing in a Chrome tab
JavaScript
16
star
17

aes-arraybuffer

JavaScript AES and CBC implementation using ArrayBuffer
JavaScript
13
star
18

docker-webdav

WebDAV and web file browser server
Go
12
star
19

flac.tcl

Probably the slowest FLAC decoder in the world
Tcl
11
star
20

disable_sendfile_vbox_linux

Go VirtualBox vboxsf sendfile bug workaround
Go
11
star
21

crxreload

Reload Chrome extension automatically when files are changed
JavaScript
10
star
22

libfa

C automata library to build, determinize, minimize, translate regexp etc
C
10
star
23

ios-misc

Misc iOS related classes
Objective-C
10
star
24

respect

Resource inspector and lint tool to help find issues with resources in Xcode projects
Objective-C
10
star
25

gtktetris

Simple GTK tetris game
C
5
star
26

appsnax

Generate wireless app distribution mainifests for iOS directly from IPA files
PHP
5
star
27

static-gm

Image with graphicsmagick binary built as hardened static PIE binaries with no external dependencies
Dockerfile
5
star
28

filtertransport

Filtering go http transport and proxy handler
Go
4
star
29

static-shaka-packager

Image with shaka-packager binary built as hardened static PIE binaries with no external dependencies
Dockerfile
4
star
30

jq-dash-docset

Tools to generate jq dash docset
HTML
4
star
31

fultracker

Erlang bittorrent tracker experiment (Inactive project)
Erlang
3
star
32

maggie

Maggie, a GNU GameBoy Emulator (Inactive project)
C
3
star
33

textfiend

TCL implementation of the hexfiend binary template API
Tcl
3
star
34

htmx-dash-docset

</> htmx dash docset
HTML
3
star
35

slippy

iOS puzzle game featuring the hungry non-swimmer penguin Slippy that needs your help to catch fish
Objective-C
2
star
36

homebrew-tap

wader homebrew taps
Ruby
2
star
37

gameboy-games

Archive of GameBoy games I wrote many years ago
C
2
star
38

notify

Configurable inotify based file system monitor (Inactive project)
Python
2
star
39

objcheck

an ObjC port of the QuickCheck unit test framework
Objective-C
1
star
40

rails-misc

Misc ruby on rails goodies
Shell
1
star
41

citris

Circular Tetris (Inactive project)
C
1
star
42

fuse-misc

Misc FUSE file systems, ccxfuse, fusememfs, ircfs, pipefs, tagfs and erlfuse (Inactive projects)
C
1
star
43

gae-misc

Misc Google App Engine snippets
Python
1
star
44

chrome-search-spotify

Search Spotify Chrome extension
JavaScript
1
star
45

compose-hook

git hook for running docker-compose
Shell
1
star
46

lurker

Audio silence splitter (Inactive project)
C
1
star
47

vscode-go-debug-test

Test project for custom vscode-go debug command
Go
1
star
48

app-client-macos

Experimental appsocket support for macOS
Go
1
star
49

mo-mw-test

Testing
1
star
50

logutils

Various utils to be used with the go log package etc
Go
1
star
51

osleaktest

Checks for leaked fds, child processes and temp files
Go
1
star
52

firefox-all-spaces

Ugly hack to have a separate Firefox window visible on all Mac OS X spaces (workaround because afloat etc can't be used with Firefox)
Shell
1
star
53

language-jq

jq language support in Atom
1
star
54

hugo-dropbox

Docker image that serves a hugo site from dropbox
Nginx
1
star
55

json5.jq

JSON5 implementation for jq
jq
1
star