Step-by-step guide to benchmark Bifrost in your own environment using the official benchmarking tool.
π‘ Open Source: The benchmarking tool is completely open source! Feel free to submit pull requests if you think anything is missing or could be improved.
benchmark
executable (or benchmark.exe
on Windows).
Flag | Required | Description | Default |
---|---|---|---|
-provider <name> | β | Provider name (e.g., bifrost , litellm ) | None |
-port <number> | β | Port number of your Bifrost instance | None |
-endpoint <path> | β | API endpoint path | v1/chat/completions |
-rate <number> | β | Requests per second | 500 |
-duration <seconds> | β | Test duration in seconds | 10 |
-output <filename> | β | Results output file | results.json |
Flag | Description | Default |
---|---|---|
-include-provider-in-request | Include provider name in request payload | false |
-big-payload | Use larger, more complex request payloads | false |
Target RPS | Memory Usage | Recommended Instance | Notes |
---|---|---|---|
< 1,000 | < 1GB | t3.small | Cost-effective for light loads |
1,000 - 3,000 | 1-2GB | t3.medium | Balanced performance/cost |
3,000 - 5,000 | 2-4GB | t3.large | High-performance production |
5,000+ | 3-6GB | t3.xlarge+ | Enterprise/mission-critical |
initial_pool_size
buffer_size
initial_pool_size
buffer_size
.env
file at root.