If a user downloads ggml-medium.bin today, they are likely using a "legacy" version of llama.cpp . Modern implementations now use files named like llama-2-7b-chat.Q4_K_M.gguf .

and is often recommended as the "sweet spot" for users who need reliable transcription without the massive hardware requirements of the "large" models. Common Uses

Scroll al inicio