Example Use Cases
Fraud Detection
ExampleAUC 0.91 → 0.93
for end-to-end transaction-fraud ML pipelines
Prompt Engineering
Example+32.4% accuracy
for vision-language chart-to-CSV extraction
Inference Optimization
Example+47.7% speedup
for causal self-attention in GPT-style LLMs
Scientific ML
Example−91.3% RMSLE
for molecular property prediction in materials science
Trusted by ML/AI Engineers from
Get Started in Seconds
pipx install weco && weco setup claude-codeThen ask Claude Code: Does it make sense to apply /weco in my codebase?
How Does It Work?
Weco autonomously generates and tests candidate solutions, keeping improvements and discarding failed attempts. Click any node to inspect the code.
Built by Frontier AI Researchers
We're the team behind AIDE, which achieved ~4× the medal rate of the next best autonomous agent across 75 Kaggle competitions on OpenAI's MLE‑Bench. Independently validated by researchers at OpenAI, Meta, and Sakana AI.
Benchmark Performance(Medal Rate on MLE-Bench)
Academia and Industry Recognition
Improvements You Can Actually Ship
Ship breakthroughs overnight
Weco can run for weeks without any human intervention.
Optimized for cost efficiency
Each candidate costs fractions of a cent. Find the non-obvious wins that manual iteration misses.
Your data never leaves your machine
Eval code runs locally where your data lives. Only metrics and diffs are sent - review the open-source CLI to verify.
Works with any language
Python, C++, Rust, JS - if it prints a metric to stdout, Weco optimizes it.
See every experiment in one tree
Each run produces a searchable tree of candidates. Compare any two nodes side-by-side.
Steer with natural language
Add constraints like "avoid unsafe memory access" or "prioritize readability" to guide the search.
Pricing
Try it on a real run
Free
- 20 free credits ≈ 100 steps of Weco-hosted autoresearch.
- Free access to Weco Observe: bring your own autoresearch agent and use the Weco dashboard for observability.
Bring your own keys
BYOK
- Use your OpenAI, Anthropic, or Gemini keys.
- Up to 10k experiments per month.
Managed inference at scale
Pay as you go
- Everything in Free, plus:
- Access to all models.
- Priority support.
Frequently Asked Questions

“So amazing to see something built by this team that's substantially underpinning and influencing OpenAI's agentic roadmap.”
Turn your codebase into a self-improving system
Point Weco at your eval script, run an optimization, and ship winning code.