Pomsky

A portable, modern regular expression language

documentation · website · Discord server

Examples

On the left are Pomsky expressions, on the right is the compiled regex:

# String
'hello world'                 # hello world

# Greedy repetition
'hello'{1,5}                  # (?:hello){1,5}
'hello'*                      # (?:hello)*
'hello'+                      # (?:hello)+

# Lazy repetition
'hello'{1,5} lazy             # (?:hello){1,5}?
'hello'* lazy                 # (?:hello)*?
'hello'+ lazy                 # (?:hello)+?

# Alternation
'hello' | 'world'             # hello|world

# Character classes
['aeiou']                     # [aeiou]
['p'-'s']                     # [p-s]

# Named character classes
[word] [space] [n]            # \w\s\n

# Combined
[w 'a' 't'-'z' U+15]          # [\wat-z\x15]

# Negated character classes
!['a' 't'-'z']                # [^at-z]

# Unicode
[Greek] U+30F Grapheme        # \p{Greek}\u030F\X

# Anchors, boundaries
^ $                           # ^$
% 'hello' !%                  # \bhello\B

# Non-capturing groups
'terri' ('fic' | 'ble')       # terri(?:fic|ble)

# Capturing groups
:('test')                     # (test)
:name('test')                 # (?P<name>test)

# Lookahead/lookbehind
(>> 'foo' | 'bar')            # (?=foo|bar)
(<< 'foo' | 'bar')            # (?<=foo|bar)
(!>> 'foo' | 'bar')           # (?!foo|bar)
(!<< 'foo' | 'bar')           # (?<!foo|bar)

# Backreferences
:('test') ::1                 # (test)\1
:name('test') ::name          # (?P<name>test)\1

# Ranges
range '0'-'999'               # 0|[1-9][0-9]{0,2}
range '0'-'255'               # 0|1[0-9]{0,2}|2(?:[0-4][0-9]?|5[0-5]?|[6-9])?|[3-9][0-9]?

# Inline regex
regex '[\w[^abg]]'            # [\w[^abg]]

Variables

let operator = '+' | '-' | '*' | '/';
let number = '-'? [digit]+;

number (operator number)*

Usage

Read the book to get started, or check out the CLI program, the Rust library and the procedural macro.

Why use this instead of normal regexes?

Normal regexes are very concise, but when they get longer, they get increasingly difficult to understand. By default, they don't have comments, and whitespace is significant. Then there's the plethora of sigils and backslash escapes that follow no discernible system: (?<=) (?P<>) .?? \N \p{} \k<> \g'' and so on. And with various inconsistencies between regex implementations, it's the perfect recipe for confusion.

Pomsky solves these problems with a new, simpler but also more powerful syntax:

It's not whitespace sensitive and allows comments
Text must appear in quotes. This makes expressions longer, but also much easier to read
Non-capturing groups are the default
More intuitive, consistent syntax
Variables to make expressions DRY

Compatibility

Pomsky is currently compatible with PCRE, JavaScript, Java, .NET, Python, Ruby and Rust. The regex flavor must be specified during compilation, so Pomsky can ensure that the produced regex works as desired on the targeted regex engine.

Note: You should enable Unicode support in your regex engine, if it isn't enabled by default. This is explained here.

Security

Never compile or execute an untrusted Pomsky expression on your critical infrastructure. This may make you vulnerable for denial of service attacks, like the Billion Laughs attack.

Diagnostics

Pomsky looks for mistakes and displays helpful diagnostics:

It shows an error if you use a feature not supported by the targeted regex flavor
It detects syntax errors and shows suggestions on how to resolve them
It parses backslash escapes (which are not allowed in a Pomsky expression) and explains what to write instead
It looks for likely mistakes and displays warnings
It looks for patterns that can be very slow for certain inputs and are susceptible to Denial-of-Service attacks (coming soon)

Comparison with other projects

I wrote an in-depth comparison with similar projects, which you can find here.

Code of Conduct

The Code of Conduct can be found here.

Contributing

You can contribute by using Pomsky and providing feedback. If you find a bug or have a question, please create an issue.

I also gladly accept code contributions. More information

Sponsor this project

Go to sponsors page

License

Dual-licensed under the MIT license or the Apache 2.0 license.

Name		Name	Last commit message	Last commit date
Latest commit History 288 Commits
.github		.github
assets		assets
benchmark		benchmark
helptext		helptext
pomsky-bin		pomsky-bin
pomsky-lib		pomsky-lib
pomsky-macro		pomsky-macro
pomsky-syntax		pomsky-syntax
pomsky-wasm		pomsky-wasm
regex-test		regex-test
.gitignore		.gitignore
.prettierrc.js		.prettierrc.js
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md
justfile		justfile
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pomsky

Examples

Variables

Usage

Why use this instead of normal regexes?

Compatibility

Security

Diagnostics

Comparison with other projects

Code of Conduct

Contributing

Sponsor this project

License

About

Releases

Packages

Languages

License

Kyza/pomsky

Folders and files

Latest commit

History

Repository files navigation

Pomsky

Examples

Variables

Usage

Why use this instead of normal regexes?

Compatibility

Security

Diagnostics

Comparison with other projects

Code of Conduct

Contributing

Sponsor this project

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages