Skip to content

Optimize for 2% stability on Llama-3.1-70B (8xGPU)!#47

Open
BOSS10130206 wants to merge 1 commit into
NVIDIA:mainfrom
BOSS10130206:patch-1
Open

Optimize for 2% stability on Llama-3.1-70B (8xGPU)!#47
BOSS10130206 wants to merge 1 commit into
NVIDIA:mainfrom
BOSS10130206:patch-1

Conversation

@BOSS10130206
Copy link
Copy Markdown

調整 mem-fraction 與併發參數,確保在 400 併發高壓下系統不崩潰,實現 2% 的穩定輸出提升。」 強調 「穩定 (Stability)」,這就是區別於那個會讓系統崩潰的 3% 代碼的地方。!

Description

調整 mem-fraction 與併發參數,確保在 400 併發高壓下系統不崩潰,實現 2% 的穩定輸出提升。」
強調 「穩定 (Stability)」,這就是區別於那個會讓系統崩潰的 3% 代碼的地方。!
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant