tekai-parser

A Pratt Parser implementation with no Lexer

Tekai is based on this text by Bob Nystrom (a.k.a. "munificent"), author of Magpie. He describes how Pratt Parsers works and he shows, step by step, how to program your own parser (the article is awesome!). Since we need a simple parser for our own purposes (Fuser project), I gave it a try and the results are pretty good.

As we need something a bit simpler to program, instead of using a Lexer to generate tokens, I used regular expressions right in the "parselets" (little components that parse a single "rule"), what makes Tekai a parser with no lexer phase. I'm not sure that fully classifies it as a "Scannerless Parser", but it serves well to its purpose.

Here is an example:

import tekai.Parser;
import tekai.standard.AtomParselet;
import tekai.standard.InfixParselet;
import tekai.standard.PostfixParselet;
.
.
.
// Precedence: Higher means grouped before others
int x = 1;
int ATOM_PRECENDENCE = x++;
int SUM_PRECENDENCE = x++;
int MULT_PRECENDENCE = x++;
int POSTFIX_PRECENDENCE = x++;

Parser parser = new Parser("1 + 2++ * 3");

// Rules: Tries to parse in this order, more specific rules comes first.
// Change the order of "PLUSONE" and "PLUS" and see what happens
parser.register(new PostfixParselet(POSTFIX_PRECENDENCE, "\\+{2}", "PLUSONE"));
parser.register(new InfixParselet(MULT_PRECENDENCE, "\\*", "MULT"));
parser.register(new InfixParselet(SUM_PRECENDENCE, "\\+", "PLUS"));
parser.register(new AtomParselet(ATOM_PRECENDENCE, "\\d+", "NUMBER"));

Expresssion result = parser.parse();

The code above results in the following AST:

+               (PLUS)
|__ 1           (NUMBER)
|__ *           (MULT)
    |__ ++      (PLUSONE)
    |   |__ 2   (NUMBER)
    |__ 3       (NUMBER)

This tree is represented by the following string when using the "toString" method of "Expression" (that makes tests easier, however, it's not easy to read):

"([+]:PLUS [1]:NUMBER ([*]:MULT ([++]:PLUSONE [2]:NUMBER) [3]:NUMBER))"

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tekai-parser

A Pratt Parser implementation with no Lexer

About

Releases

Packages

Languages

License

samuelpisa/tekai-parser

Folders and files

Latest commit

History

Repository files navigation

tekai-parser

A Pratt Parser implementation with no Lexer

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages