pseudo-python
A restricted Python to idiomatic JavaScript / Ruby / Go / C# translator
Pseudo is a framework for high level code generation: it is used by this compiler to translate a subset of Python to all Pseudo-supported languages
If you are using Python3.5 and you experience problems with an already installed version of pseudo-python, please upgrade it to 0.2.34
(pip3 install pseudo-python --upgrade
)
git history
Sorry: I lied, manipulating the git history in a way that it appears most of the work is after 1 march: I was finishing my work in a company in february and I wanted to hide the fact i've worked on pseudo in that time: I actually started more active work somewhere around 20-th february if I remember correctly, somewhere around the second half of february. I am not sure if i have that original git history anymore: sorry to Clevertech(the company)/any observers, this was a really ugly thing to do.
Supported subset
Pseudo-Python compiles to pseudo ast
.
Pseudo defines a language-independent AST model and an unified standard library. It can map its own standard library to target language libraries and concepts automatically and it tries to generate readable and idiomatic code.
Pseudo-Python translates a subset of Python to Pseudo AST and then it receives the JS/Ruby/C#/Go backends for free. (+ at least 4-5 backends in the future)
Pseudo was inspired by the need to generate algorithms/code in different languages or portint tools/libraries to a new environment
That's why it can be mapped to a well defined subset of a language
It is meant as a framework consuming ast from parser generators / compilers / various tools and generating snippets / codebases in different target languages
Plan
Pseudo supports
-
basic types and collections and standard library methods for them
-
integer, float, string, boolean
-
lists
-
dicts
-
sets
-
tuples/structs(fixed length heterogeneous lists)
-
fixed size arrays
-
regular expressions
-
functions with normal parameters (no default/keyword/vararg parameters)
-
classes
- single inheritance
- polymorphism
- no dynamic instance variables
- basically a constructor + a collection of instance methods, no fancy metaprogramming etc supported
-
exception-based error handling with support for custom exceptions (target languages support return-based error handling too)
-
io: print/input, file read/write, system and subprocess commands
-
iteration (for-in-range / for-each / iterating over several collections / while)
-
conditionals (if / else if / else)
-
standard math/logical operations
Installation
pip install pseudo-python
Usage
pseudo-python <filename.py> ruby
pseudo-python <filename.py> csharp
etc for all the supported pseudo targets (javascript, c#, go, ruby and python)
examples
Each example contains a detailed README and working translations to Python, JS, Ruby, Go and C#, generated by Pseudo
a football results processing command line tool
a verbal expressions-like library ported to all the target languages
Error messages
A lot of work has been put into making pseudo-python error messages as clear and helpful as possible: they show the offending snippet of code and often they offer suggestions, list possible fixes or right/wrong ways to write something
Beware, pseudo and especially pseudo-python are still in early stage, so if there is anything weird going on, don't hesitate to submit an issue
Type inference
pseudo-python checks if your program is using a valid pseudo-translatable subset of Python, type checks it according to pseudo type rules and then generates a pseudo ast and passes it to pseudo for code generation.
The rules are relatively simple: currently pseudo-python infers everything from the usage of functions/classes, so has sufficient information when the program is calling/initializing all of its functions/classes (except for no-arg functions)
Often you don't really need to do that for all of them, you just need to do it in a way that can create call graphs covering all of them (e.g. often you'll have a
calling b
calling x
and you only need to have an a
invocation in your source)
You can also use type annotations. We are trying to respect existing Python3 type annotation conventions and currently pseudo-python recognizes int
, float
, str
, bool
, List[<type>]
,
Dict[<key-type>, <value-type>]
, Tuple[<type>..]
, Set[<type>]
and Callable[[<type>..], <type>]
Beware, you can't just annotate one param, if you provide type annotations for a function/method, pseudo-python expects type hints for all params and a return type
Variables can't change their types, the equivalents for builtin types are
list : List[@element_type] # generic
dict: Dictionary[@key_type @value_type] # generic
set: Set[@element_type] # generic
tuple: Array[@element_type] # for homogeneous tuples
Tuple[@element0_type, @element1_type..] # for heterogeneous tuples
int: Int
float: Float
int/float: Number
str: String
bool: Boolean
There are several limitations which will probably be fixed in v0.3
If you initialize a variable/do first call to a function with a collection literal, it should have at least one element(that limitation will be until v0.3)
All attributes used in a class should be initialized in its __init__
Other pseudo-tips:
-
Homogeneous tuples are converted to
pseudo
fixed length arrays and heterogeneous topseudo
tuples. Pseudo analyzes the tuples usage in the code and sometimes it translates them to classes/structs with meaningful names if the target language isC#
C++
orGo
-
Attributes that aren't called from other classes are translated as
private
, the other ones aspublic
. The rule for methods is different:_name
ones are only translated asprivate
. That can be added as config option in the future -
Multiple returns values are supported, but they are converted to
array
/tuple
-
Single inheritance is supported,
pseudo-python
supports polymorphism but methods in children should accept the same types as their equivalents in the hierarchy (except__init__
)
The easiest way to play with the type system is to just try several programs: pseudo-python
errors should be enough to guide you, if not,
you can always open an issue
How does Pseudo work?
The implementation goal is to make the definitions of new supported languages really clear and simple.
If you dive in, you'll find out a lot of the code/api transformations are defined using a declarative dsl with rare ocassions of edge case handling helpers.
That has a lot of advantages:
-
Less bugs: the core transformation code is really generalized, it's reused as a dsl and its results are well tested
-
Easy to comprehend: it almost looks like a config file
-
Easy to add support for other languages: I was planning to support just python and c# in the initial version but it is so easy to add support for a language similar to the current supported ones, that I added support for 4 more.
-
Easy to test: there is a simple test dsl too which helps all language tests to share input examples like that
However language translation is related to a lot of details and a lot of little gotchas, tuning and refining some of them took days. Pseudo uses different abstractions to streamline the process and to reuse logic across languages.
PSEUDO AST:
NORMAL CODE PSEUDO STANDARD LIBRARY INVOCATIONS
|| ||
|| ||
|| API TRANSLATOR
|| ||
|| ||
|| \/
|| IDIOMATIC TARGET LANGUAGE
|| STANDARD LIBRARY INVOCATIONS
|| ||
\/ \/
STANDARD OR LANGUAGE-SPECIFIC MIDDLEWARES
e.g.
name camel_case/snake_case middleware
convert-tuples-to-classes middleware
convert-exception-based errors handling
to return-based error handling middleware
etc
||
||
||
||
TARGET LANGUAGE CODE GENERATOR
defined with a dsl aware
that handles formatting
automatically
||
||
||
\/
OUTPUT
What's the difference between Pseudo and Haxe?
They might seem comparable at a first glance, but they have completely different goals.
Pseudo wants to generate readable code, ideally something that looks like a human wrote it/ported it
Pseudo doesn't use a target language runtime, it uses the target language standard library for everything (except for JS, but even there is uses lodash
which is pretty popular and standard)
Pseudo's goal is to help with automated translation for cases like algorithm generation, parser generation, refactoring, porting codebases etc. The fact that you can write compilers targetting Pseudo and receiver translation to many languages for free is just a happy accident
License
Copyright © 2015 2016 Alexander Ivanov
Distributed under the MIT License.