Go module to handle Augmented Backus-Naur Form (ABNF), providing a large API. It implements RFC 5234 and 7405, with Errata 2968 and 3076.
Capabilities:
- parse ABNF (to manipulable datastructure ; with cycle detection)
- compile ABNF to regex
- create a minimal set of tests that covers the full grammar
- generate a visual representation of the ABNF grammar provided (mermaid)
- create an ABNF fuzzer for your modules (version >= Go1.18beta1)
Under the hood, go-abnf
is a dependency-free brute-force parser. It enumerates all possibilities for a given grammar and an input, and returns all possible paths. Those then have to be lexed in order to produce a new grammar.
As this implementation is not adhesive to the ABNF grammar of the ABNF grammar as defined in RFC 5234, updated by RFC 7405 and fixed by Erratum 2968 and 3076, it enables genericity. This imply that for any valid grammar in ABNF that is properly lexed, if you can write a lexer for this grammar, you can parse new input with the original grammar. To init this loop, we had to hardcode the manual decomposition of the ABNF grammar, reviewed multiple times.
Examples can be found in the examples directory
As go-abnf revolves around grammars, you can use a random walk to traverse its graph and efficiently generate valid inputs according to a given grammar.
This is particularly powerful when you want to fuzz Go implementations that require a very specific input format that the Go's fuzzing engine can't produce.
You can use go-abnf to efficiently produce test cases, as follows.
package example
import (
_ "embed"
"testing"
goabnf "github.com/pandatix/go-abnf"
)
//go:embed my-grammar.abnf
var myGrammar []byte
func FuzzFunction(f *testing.F) {
g, err := goabnf.ParseABNF(myGrammar)
if err != nil {
f.Fatal(err)
}
f.Fuzz(func(t *testing.T, seed int64) {
// Generate a random test case based on the seed
b, _ := g.Generate(seed, "a",
goabnf.WithRepMax(15), // Limit repetitions to 15
goabnf.WithThreshold(1024), // Stop ASAP input generation if reached 1024 bytes
)
Function(b)
})
}
Q: My ABNF grammar does not work. Do you have any idea why ?
R: There could be many reasons to this. First make sure your grammar ends up by a newline (LF), and especially that the input content has a CR LF. As those appear the same, it is often a source of error.
Q: Is there a difference between pap and bap ?
R: Yes, first of all the language (i.e. Go) enables more portability thus integration in workflows. But the real difference between pap and bap is the way they work: pap is built on an opportunity to challenge bap whether bap is built to generate meaningful errors to the end user. Out of this, no, pap and bap are very similar as they are ABNF parsers/validators.