Control the rate of requests to your Neurons to prevent abuse and manage resource usage effectively.
Rate Limiting settings in the Neuron Settings page
Metric | Description |
---|---|
Requests | The number of requests made to the Neuron. |
Tokens | The number of tokens processed by the Neuron, across all AI providers calls. |
Scope | Description |
---|---|
Total | The total number of the chosen Metrics |
Per IP | The number of the chosen Metrics per IP address |
Per User | The number of the chosen Metrics per user |
Per User
is only enforced if you are using JWT Authentication with a username property path defined, otherwise it will be ignored. See Access Control for more information.429
HTTP status code and an error message. Make sure to handle this error in your application.
You can monitor rate limit hits and usage patterns through the execution logs to fine-tune your rate limiting rules.