Set LLM Rate Limits

Rate limiting is the process of restricting the number of requests a user or application can send to an LLM API within a specific timeframe. LLM providers enforce this with the purpose of managing resources and preventing abuse.

Since Goose is working very quickly to implement your tasks, you may need to manage rate limits imposed by the provider. If you frequently hit rate limits, consider upgrading your LLM plan to access higher tier limits or using OpenRouter.

Using OpenRouter

OpenRouter provides a unified interface for LLMs that allows you to select and switch between different providers automatically - all under a single billing plan. With OpenRouter, you can utilize free models or purchase credits for paid models.

Go to openrouter.ai and create an account.
Once verified, create your API key.

Goose Desktop
Goose CLI

Click the button in the top-left to open the sidebar.
Click the Settings button on the sidebar.
Click the Models tab.
Click Configure Providers.
Click Configure under OpenRouter to edit your OpenRouter settings.
Enter your OpenRouter API key.
Click Submit.

Run the Goose configuration command:

goose configure

Select Configure Providers from the menu.
Follow the prompts to choose OpenRouter as your provider and enter your OpenRouter API key when prompted.

Now Goose will send your requests through OpenRouter which will automatically switch models when necessary to avoid interruptions due to rate limiting.

Using OpenRouter​

Using OpenRouter