模型級聯 / 降級鏈

ML基礎設施

查詢先送給快速且經濟的小模型,若信心度不足則自動升級到大模型,兼顧成本效率和回應品質。

agentsystem
為什麼需要 OSOP

模型級聯是成本優化的關鍵策略。OSOP 用條件邏輯和降級邊定義模型選擇流程,讓你輕鬆調整閾值和模型組合,同時記錄每次路由決策以利分析優化。

Workflow Steps (5)

1
Receive Query
event
2
Fast Model (Haiku)
agent
3
Confidence Check
system
4
Large Model (Opus)
agent
5
Return Response
api

Connections (5)

Receive QueryFast Model (Haiku)sequential
Fast Model (Haiku)Confidence Checksequential
Confidence CheckReturn Responseconditionalconfidence >= 0.8
Confidence CheckLarge Model (Opus)conditionalconfidence < 0.8
Large Model (Opus)Return Responsesequential
5
Steps
5
Connections
4
Node Types