vincent
|
0894a86af8
|
ADR-006 v2.1: final revision, NVIDIA provider keys, reply to 徐聪
v2.1 changes from 2nd-round review:
+ Emergency channel RPM: max(1, max_rpm * 0.1)
+ Queue 503: add Retry-After: 30 header
+ sidecar_backup_success Prometheus metric
+ Startup crypto.py key validation on boot
+ SQLite size limits: 100MB practical, 500MB WAL
+ RPM flow: per-request counting, not token-based
+ SSE streaming: TTFT for avg_latency_ms
+ Merge proxy/retry.py into core/cooldown.py
Added sidecar-v2-nvidia-providers.yaml (11 keys)
Co-authored-by: multica-agent <github@multica.ai>
|
2026-06-25 15:19:21 +08:00 |
|