Open source · autonomous

Explorbot

Autonomous exploratory testing

An AI agent that investigates your product like your most relentless QA engineer — and turns every discovery into a test you can keep. This is vibe-testing.

$ npm i explorbot

Read Docs GitHub

explorbot — start

A new layer of testing

Unit, end-to-end, and now exploratory

Exploratory tests can run on your CI pipeline just as unit and e2e tests do. Everything stays local, no external access from cloud services.

Unit Tests

Verify a single function or component in isolation.

Written by developers
Tiny, isolated, instant
Catch logic bugs early

End-to-End Tests

Drive the whole app through fixed user journeys.

Scripted by QA engineers
Real flows, same path each run
Catch broken journeys

New

Exploratory Tests

A curious AI agent that investigates with intent.

Driven by an AI agent
New ways, never the same twice
Catch what no one tested

Explorbot makes the third layer as routine as the first two.

Preconditions

Plug AI and Browser to get started

Point Explorbot at a running web app and whatch it explores the app same way as your users do

What you need

Requirements

Playwright and browsers

The Playwright library with its browsers. The same engine your end-to-end suite already runs on.
Fast AI models

Access to fast AI models from any provider. We recommend OpenRouter for the best speed and choice.

Nothing else. No GPUs, no virtualization and no special runners. If your CI runs Playwright, it runs Explorbot.

What makes it so fast?

What it costs

~ $1 / hour

in tokens (no extra charge)

One hour of intensive work, no pauses.
30 to 50 meaningful tests every hour.
Fast, cheap models keep the bill low.

Recommended AI models that run at 100+ tokens per second:

gpt-oss-20b gemma 4 gpt-5.6-luna gpt-nano

Works with any AI API platform

OpenRouter OpenAI Google Gemini Anthropic Mistral Groq OpenRouter OpenAI Google Gemini Anthropic Mistral Groq

See it in action

Watch Explorbot work, unscripted

A real session: the agent plans, drives the browser and writes tests on the fly — no human in the loop.

explorbot · autonomous session

Live

Explorbot is exploring…

Recorded live: Explorbot explores the shop, runs a product tour, then completes a full checkout, verifying each step as it goes.

Works with your suite

It won't replace your regression tests — it covers what they can't

Your Playwright and CodeceptJS suites re-run the same fixed steps every build. Explorbot re-explores the very same pages new ways — clicking new UI elements and paths your scripts never touch.

Regression suite

Same steps · every build

/login /cart /checkout /account same path, every run

in parallel

Explorbot

New ways & new UI · every run

/login /cart /checkout + new UI + edge states + new feature

Re-explore what you already test

Scripted tests repeat the exact same steps every build. Run Explorbot beside them and it re-explores the same pages new ways each time — clicking new UI elements and paths your script never touches — to surface regressions hiding just off the scripted route.

Runs in parallel with Playwright, Cypress or CodeceptJS
Revisits known pages, never the same way twice
Finds new UI elements and regressions scripts miss

Cover features before tests exist

A brand-new feature with zero automated coverage? Point Explorbot at it. From the live UI plus the specs and domain knowledge you give it, it works out the basic test cases and runs them straight away.

Reads the page, your specs and domain rules
Derives and runs the essential test cases
Gives new work real coverage from day one

Real app types

Put it to work on your SaaS, CRM or ERP

Point Explorbot at the app you already run. Here is how to set it up — and the kind of glitches it surfaces that scripted suites walk right past.

SaaS

Add credentials, then point it at its main tables, forms and settings.

Catches glitches like

Filters that return stale results
Create or delete that silently fails
Edge-case input accepted unchecked
Forms that fail without saying why
Settings that don’t save
Confusing or uninformative screens

Explorbot exploring a CRM sales pipeline

CRM

Add credentials, then point it at the deals pipeline and record pages.

Catches glitches like

Stage changes the board doesn’t reflect
Edits that don’t persist after save or reload
Required fields that save empty — no validation
Filters and saved views returning stale results
Bulk actions that skip or half-apply records
Duplicate entries created on re-submit

Explorbot exploring an ERP orders and inventory module

ERP

Add credentials, then point it at the orders and inventory modules.

Catches glitches like

Filters that return the wrong rows
Records that won’t save or delete
Edge-case values accepted unchecked
Numbers that don’t update after an edit
Bulk creation that drops rows
Forms that lose data between steps

How it sees

Every page becomes a UI Map

Explorbot never reads your source. It scans the live page, finds every interactive element and indexes it as a locator.

A CodeceptJS test-management screen as a person sees it

Raw page

The same screen annotated by Explorbot, with an indexed box on every interactive element

UI Map

Explorbot expands dropdowns and collapsable elements to discover what is hidden.

How a session works

Reasonable Live Testing Flow

Give Explorbot a goal and a URL — a team of specialised agents takes it from there, with no scripts and no human in the loop.

01 · Researcher
Research

Maps the UI into sections and indexes every element — no docs or source needed.
02 · Planner
Plan

Drafts test scenarios across normal, curious and edge styles — state changes first.
03 · Tester & Pilot
Execute

Drives the browser step by step; the Pilot breaks dead loops and adapts as the app changes.
04 · Analyst
Verify

Verifies each outcome before passing, clusters findings by root cause and captures evidence.
05 · Historian
Keep

Saves passing flows as Playwright tests with reports and screencasts — and learns for next run.

Under the hood

A crew of specialised agents

Cheap, fast workers do the clicking and reading; smart managers make the calls — so a full session costs cents, not dollars.

Researcher

Maps each page into sections, locators and an element index.

Planner

Drafts scenarios and cycles testing styles: happy path vs. negative cases.

Tester

Drives the browser one real step at a time to run each scenario.

Pilot

Supervises the Tester and breaks it out of dead loops.

Captain

Orchestrates the session and takes your commands in real time.

Navigator

Executes browser actions with resilient locator fallbacks.

Analyst

Writes the run report, clustering defects by root cause.

Historian

Saves successful tests as markdown or Playwright or CodeceptJS code.

Fisherman

Seeds data via REST API using OpenAPI spec or learns from intercepted XHR requests.

The takeaway

Impactful reports

Every run is summarized into a report your whole team can read: a clear pass and fail breakdown, the pages and features it covered, and a written analysis of what works and what broke.

An Explorbot run report in Testomat.io: a pass and fail donut chart beside a written session analysis with coverage and findings

The same Explorbot run report in Testomat.io, dark theme

Shown here in Testomat.io. Self-hosted runs also produce standalone HTML and Markdown reports you can open or commit anywhere.

Also produces

Runnable Tests

Playwright or CodeceptJS specs for every flow.

Videos

Step-by-step screencasts of every run as evidence.

Experience

Knowledge it reuses to test smarter next run.

Not only for web testing

Explorbot can do more

ApiBot and DocBot bring the same autonomy to your APIs and your documentation.

API testing

ApiBot

ApiBot autonomously tests your API endpoints — hunting for security breaches, edge cases, and the request combinations a scripted suite would never try.

Probes endpoints for security breaches
Hammers edge cases and bad inputs
Chains API call combinations on its own

$ explorbot api explore /users

Living documentation

DocBot

DocBot walks through your app and builds documentation straight from the live UI, so it is never outdated. It is not a spec — it captures how things actually look and behave right now, even when no spec exists or the UI has grown past it. It documents the real, current state of your app.

Built from the live UI, not a spec
Never outdated — reflects today’s app
Documents reality, even beyond the spec

$ explorbot docs collect /dashboard

Get started

Try it now!

Set up Explorbot manually or make an agent do the job for you.

Install with npm

Add Explorbot to your project, then scaffold a config and a .env file with a single command.

$ npm i explorbot

Initialize project:

$ npx explorbot init

Read the Explorbot readme

Install with a skill

Add the skills to your agent, then run explorbot-setup. It installs Explorbot, writes the config, and proves it can reach your app before handing back to you.

$ npx skills add testomatio/skills

Run setup in your agent:

$ claude /explorbot-setup

Browse all QA skills

Open source · Elastic License 2.0

Explorbot is open source

Install it, read the full documentation, and run it on your own app — it's all on GitHub.

Star on GitHub Read docs

Questions & answers

Frequently asked questions

Your data, the models, and how Explorbot fits next to the tools you already use.

Is my data safe?

Yes. Explorbot runs locally and keeps everything in files on your own machine. It never ships your data to third-party services, with a single exception: the AI provider you configure. If you trust that provider, you can trust Explorbot. Optionally, it can send traces to Langfuse for logs and traceability, but you can run Langfuse inside your own infrastructure. Explorbot is built to run within your own setup, including CI, where it reaches internal services with no external connections beyond the AI endpoint.

Can I run it against production?

We recommend pointing Explorbot at a local, dev or staging environment rather than production. It can create, change or delete records through the UI, so it belongs in an isolated workspace seeded with disposable data, signed in as a user with limited permissions and no real secrets in reach. You can still run it on production for regression testing, as long as that account is scoped to a single project or workspace and cannot touch anything critical.

How does Explorbot use AI?

Explorbot drives fast, cheap models over pay-per-token APIs such as OpenRouter or Groq. You cannot run it on a coding-assistant subscription like Codex, Claude, Gemini, or Copilot. Those plans only expose models that are slow and costly, which are not suited to the way Explorbot reads large amounts of HTML on every step. Pay-per-token open models keep a full run at roughly a dollar an hour.

Can we use OpenAI models?

Yes, through the OpenAI Platform. Add an API key from platform.openai.com and point Explorbot at gpt-nano for reading pages, and gpt-luna for agentic decisions and vision. This is API access billed per token. Pricing should be as low as 1.5$ per hour of extensive work.

Can we use Google Gemini models?

Yes, through Google AI Studio. Add an API key from aistudio.google.com and point Explorbot at gemini-3.1-flash-lite for reading pages and vision, and gemini-3.5-flash for agentic decisions. Flash-lite is the cheapest tier for the token-heavy reading, the full flash is stronger where the decisions are made. This is API access billed per token.

How is Explorbot different from Cursor or Claude Code with Playwright MCP?

Cursor, Codex, and Claude Code are coding agents. They help a developer write tests interactively. Explorbot is built to run on its own for hours at a time, on CI, with no human in the loop. It is designed for continuous testing, not pair programming with a developer.

Can we use our own models?

Yes. If you host your own models with vLLM, Ollama, AWS Bedrock, Azure, you can point Explorbot at any inference endpoint through the Vercel AI SDK. Speed and pricing then depend entirely on your infrastructure. For most teams, outsourcing to OpenRouter or Groq is simpler and more cost-effective than maintaining dedicated GPU capacity.

What are the prerequisites for the web app under test?

Explorbot fits CRUD-heavy web apps best: SaaS platforms, ecommerce, ERP, admin panels and internal tools. It is not a good match for landing pages, blogs or static sites. It works best when every screen has its own URL and uses titles and headings consistently, so it can tell one state from another. Point it at an isolated workspace or staging environment seeded with non-critical sample data, using an account with limited permissions and no real secrets.

Read the full prerequisites

Is Explorbot open source and free to use?

Explorbot is source-available under the Elastic License 2.0 (ELv2), the same license used by Elastic and Grafana. It is free for commercial use: you can test any application, including commercial products, modify the code, self-host it and build it into your workflow at no cost. The one restriction is that you may not resell Explorbot itself as a hosted or managed service.

Explorbot

A new layer of testing

Unit, end-to-end, and now exploratory

Unit Tests

End-to-End Tests

Exploratory Tests

Preconditions

Plug AI and Browser to get started

What you need

Requirements

Playwright and browsers

Fast AI models

What it costs

See it in action

Watch Explorbot work, unscripted

Works with your suite

It won't replace your regression tests — it covers what they can't

Re-explore what you already test

Cover features before tests exist

Real app types

Put it to work on your SaaS, CRM or ERP

SaaS

CRM

ERP

How it sees

Every page becomes a UI Map

How a session works

Reasonable Live Testing Flow

Research

Plan

Execute

Verify

Keep

Under the hood

A crew of specialised agents

Researcher

Planner

Tester

Pilot

Captain

Navigator

Analyst

Historian

Fisherman

The takeaway

Impactful reports

Runnable Tests

Videos

Experience

Not only for web testing

Explorbot can do more

API testing

ApiBot

Living documentation

DocBot

Get started

Try it now!

Install with npm

Install with a skill

Open source · Elastic License 2.0

Explorbot is open source

Questions & answers

Frequently asked questions