Benchmarks

Greph ships a benchmark harness under bin/ and benchmarks/. The harness measures every search mode against real corpora (WordPress, Laravel) and synthetic datasets, captures structured reports, and lets you diff two reports against each other.

The published numbers in the README are sourced from GitHub Actions runs, never from local machines. Local runs are useful for iteration; CI runs are the source of truth.

Running benchmarks

# Full benchmark suite
./bin/bench

# Specific category
./bin/bench --category text
./bin/bench --category ast

# Compare against external tools when available
./bin/bench --compare rg,grep
./bin/bench --compare sg

# Choose a corpus
./bin/bench --corpus wordpress
./bin/bench --corpus laravel
./bin/bench --corpus synthetic

# Aggregate multiple runs
./bin/bench-series 5 1   # 5 measured runs, 1 warmup
./bin/bench-aggregate reports/run-*.json

The harness writes JSON reports per run. bin/bench-compare diffs two reports and prints a summary, useful when iterating on a hot path.

Reading the published numbers

The README publishes a snapshot of the latest CI run for each baseline. Each section is sourced from a specific GitHub Actions run and labeled with the corpus, runner, PHP version, and number of measured runs and warmups. If you need to reproduce a number, use the same combination locally.

The comparison columns (rg, grep, sg) are gathered in the same job, on the same runner, against the same corpus. They are not extrapolated from external benchmarks.

How to add a benchmark

Add a benchmark definition under benchmarks/.
Add a corresponding scenario to the regression suite if there is no equivalent yet (so the behavior under measurement is also verified).
Run ./bin/bench --category <yours> to validate the new benchmark.
Capture a baseline with ./bin/bench-series 5 1 and commit the report under benchmarks/baselines/.
Update the README table once the CI run that includes the new benchmark has landed on main.

Performance philosophy

Greph is not trying to outrun ripgrep. Ripgrep is implemented in Rust with hand-tuned SIMD literal scanning, and a pure-PHP implementation cannot beat that. The goal is different:

Be fast enough that interactive use and agent loops feel native.
Beat grep on the workloads users care about.
Beat ast-grep on indexed and cached AST workloads (where Greph's warm caches outperform ast-grep's cold parser).
Make every performance improvement measurable and reproducible.

Performance claims that are not backed by a benchmark report are not landed.

Benchmarks

Running benchmarks

Categories

Scan Mode: Text

Scan Mode: Traversal

Scan Mode: Parallel Text

Scan Mode: AST

Indexed Text Mode

Indexed Summary Queries

Indexed / Cached AST

Build Costs

Reading the published numbers

How to add a benchmark

Performance philosophy

On this page