Skip to content

Roadmap

Recently Completed ✓

  • Multi-Agent V3 Features (Mobile-Agent-v3 inspired):
  • Exception Handling: Automatic popup, ad, and error recovery
  • Reflection: Self-correction when stuck or looping
  • Task Decomposition: Break complex objectives into subtasks
  • Cross-App Memory: Remember facts across app switches
  • Enhanced Recording: Pre/post conditions for robust replay
  • Progress Reporting: Real-time MCP events for IDE integrations
  • Trail/Blaze Architecture: “Blaze once, trail forever” workflow
  • Standardized YAML Format: Express tests mixing natural language and static steps
  • Recording: Save agent actions, replay deterministically to save costs

Upcoming Features

  • Benchmark Integration: Run AndroidWorld/OSWorld benchmarks against Trailblaze
  • LLM Call Proxying (optional): Proxy LLM traffic to remove API key requirement on device
  • Self-Evolving Data: Auto-generate trail files from successful benchmark runs

Longer Term Vision

  • iOS testing support
  • Web testing support
  • Host mode execution: Execute tests connected to a device or emulator for more flexibility

Overall Vision

  • Create an open platform for authoring and executing tests using agentic AI
  • Enable teams to ship faster by lowering the bar to test execution
  • Achieve competitive performance on standard benchmarks (AndroidWorld 70%+, OSWorld 35%+)