Use the newer more space-efficient Llama 3.2 model and avoid maxing out the context window by default. (!84) · Merge requests · umwelt-info / infrastruktur / entwicklung

Adam Reichold requested to merge revamp-llama into main Oct 05, 2024

The context window comes with sensible defaults and we should not set it to the maximum without good reason as it significantly increases memory consumption.

Use the newer more space-efficient Llama 3.2 model and avoid maxing out the context window by default.

Merge request reports