Detailed performance metrics and analysis for Bifrost running on AWS t3.xlarge instances (4 vCPUs, 16GB RAM).
Metric | Value | Notes |
---|---|---|
Success Rate | 100.00% | Perfect reliability under high load |
Average Request Size | 0.13 KB | Lightweight request payload |
Average Response Size | 10.32 KB | Large response payload testing |
Average Latency | 1.61s | Total end-to-end response time |
Peak Memory Usage | 3,340.44 MB | ~21% of available 16GB RAM |
Note: t3.xlarge tests used significantly larger response payloads (~10 KB vs ~1 KB on t3.medium) to stress-test performance with realistic production data sizes.
Operation | Latency | Performance Notes |
---|---|---|
Queue Wait Time | 1.67 µs | 96% faster than t3.medium |
Key Selection Time | 10 ns | 37% faster weighted API key selection |
Message Formatting | 2.11 µs | Consistent with t3.medium performance |
Params Preparation | 417 ns | Slight improvement over t3.medium |
Request Body Preparation | 2.36 µs | 11% faster request assembly |
JSON Marshaling | 26.80 µs | 58% faster serialization |
Request Setup | 7.17 µs | Comparable to t3.medium |
HTTP Request | 1.50s | 4% faster provider API calls |
Error Handling | 162 ns | 14% faster error processing |
Response Parsing | 2.11 ms | 81% faster despite 7.5x larger payloads |
initial_pool_size
to 18,000-20,000buffer_size
to 25,000-30,000initial_pool_size
up to 25,000buffer_size
to 35,000+Metric | t3.medium | t3.xlarge | Improvement |
---|---|---|---|
Bifrost Overhead | 59 µs | 11 µs | -81% |
Average Latency | 2.12s | 1.61s | -24% |
Queue Wait Time | 47.13 µs | 1.67 µs | -96% |
JSON Marshaling | 63.47 µs | 26.80 µs | -58% |
Response Parsing | 11.30 ms | 2.11 ms | -81% |
Response Size Handled | 1.37 KB | 10.32 KB | +7.5x |
Peak Memory Usage | 1,312.79 MB | 3,340.44 MB | +155% |
Memory Utilization | 33% | 21% | -36% |