Introduction to Bifrost’s performance capabilities and how to choose the right instance size for your workload.
Metric | t3.medium | t3.xlarge | Improvement |
---|---|---|---|
Success Rate @ 5k RPS | 100% | 100% | No failed requests |
Bifrost Overhead | 59 µs | 11 µs | -81% |
Average Latency | 2.12s | 1.61s | -24% |
Queue Wait Time | 47.13 µs | 1.67 µs | -96% |
JSON Marshaling | 63.47 µs | 26.80 µs | -58% |
Response Parsing | 11.30 ms | 2.11 ms | -81% |
Peak Memory Usage | 1,312.79 MB | 3,340.44 MB | +155% |
Note: t3.xlarge tests used significantly larger response payloads (~10 KB vs ~1 KB), yet still achieved better performance metrics.
Configuration Parameter | Effect |
---|---|
initial_pool_size | Higher values = faster performance, more memory usage |
buffer_size & concurrency | Controls queue depth and max parallel workers (per provider) |
retry & timeout | Tune aggressiveness for each provider to meet your SLOs |