Trusted by ML/AI Engineers from

OpenAI Logo
Meta Logo
Deep Mind Logo
Cornell University Logo
J.P. Morgan Logo
Standard Chartered Logo

Get Started in Seconds

pipx install weco && weco setup claude-code

Then ask Claude Code: Does it make sense to apply /weco in my codebase?

How Does It Work?

Weco autonomously generates and tests candidate solutions, keeping improvements and discarding failed attempts. Click any node to inspect the code.

best-solution.py

Built by Frontier AI Researchers

We're the team behind AIDE, which achieved ~4× the medal rate of the next best autonomous agent across 75 Kaggle competitions on OpenAI's MLE‑Bench. Independently validated by researchers at OpenAI, Meta, and Sakana AI.

Benchmark Performance(Medal Rate on MLE-Bench)

4.4%
Next BestAgent
16.9%
Weco's Algorithm(AIDE ML)
~4×

Academia and Industry Recognition

Improvements You Can Actually Ship

Ship breakthroughs overnight

Weco can run for weeks without any human intervention.

Optimized for cost efficiency

Each candidate costs fractions of a cent. Find the non-obvious wins that manual iteration misses.

Your data never leaves your machine

Eval code runs locally where your data lives. Only metrics and diffs are sent - review the open-source CLI to verify.

Works with any language

Python, C++, Rust, JS - if it prints a metric to stdout, Weco optimizes it.

See every experiment in one tree

Each run produces a searchable tree of candidates. Compare any two nodes side-by-side.

Steer with natural language

Add constraints like "avoid unsafe memory access" or "prioritize readability" to guide the search.

Pricing

Try it on a real run

Free

  • 20 free credits ≈ 100 steps of Weco-hosted autoresearch.
  • Free access to Weco Observe: bring your own autoresearch agent and use the Weco dashboard for observability.

Bring your own keys

BYOK

  • Use your OpenAI, Anthropic, or Gemini keys.
  • Up to 10k experiments per month.

Managed inference at scale

Pay as you go

  • Everything in Free, plus:
  • Access to all models.
  • Priority support.

Frequently Asked Questions

Edward Grefenstette

So amazing to see something built by this team that's substantially underpinning and influencing OpenAI's agentic roadmap.

Edward Grefenstette, Director of ResearchGoogle DeepMind

Turn your codebase into a self-improving system

Point Weco at your eval script, run an optimization, and ship winning code.