Oxylisp Design Decisions: Why Simple is Better

The Whitespace Trick

The Oxylisp lexer uses a simple approach: add spaces around delimiters, then split on whitespace. Why?

expression
    .replace('(', " ( ")
    .replace(')', " ) ")
    .split_whitespace()

Pros:

Cons:

You’d need a real lexer when:

For learning interpreters, clarity beats cleverness. A 50-line lexer you understand is better than a 500-line one you don’t.

The goal isn’t to build the next Clojure. It’s to understand how interpreters work.

Version 1 used panic!:

_ => panic!("invalid token")

Version 2 uses Result:

_ => Err(anyhow!("Unrecognized token: '{}'", token))

This isn’t just “better practice” - it’s about learning. When your lexer fails, you want to know why, not just crash.

The patterns evolved from:

r"[A-Za-z+*-=]"  // Matches single character

To:

r"^[A-Za-z+*/-=<>!][A-Za-z0-9+*/-=<>!]*$"  // Matches full token

The anchors (^ and $) prevent matching substrings. The extended character set handles real Lisp operators.

Every decision optimizes for learning:

The best code for learning isn’t the best code for production. And that’s okay.