Which GGUF is right for me? (Opinionated)

Good question! I am collecting human data on how quantization affects outputs. See here for more information: ggml-org/llama.cpp#5962

In the meantime, use the largest that fully fits in your GPU. If you can comfortably fit Q4_K_S, try using a model with more parameters.

llama.cpp feature matrix

See the wiki upstream: https://github.com/ggerganov/llama.cpp/wiki/Feature-matrix

lxdlam / README.md

Created April 1, 2026 21:11 — forked from emschwartz/README.md

The Most Popular Blogs of Hacker News in 2025

This is an OPML version of the HN Popularity Contest results for 2025, for importing into RSS feed readers.

Plug: if you want to find content related to your interests from thousands of obscure blogs and noisy sources like HN Newest, check out Scour. It's a free, personalized content feed I work on where you define your interests in your own words and it ranks content based on how closely related it is to those topics.

lxdlam / game-networking-links.md

Created November 18, 2024 08:26 — forked from shmolyneaux/game-networking-links.md

Game Network Programming Resources

lxdlam / links.md

Created March 7, 2024 06:02 — forked from Leandros/links.md

Writing a Modern Rendering Engine

Writing a modern rendering engine

Designing a Modern GPU Interface by @BrookeHodgman
Optimizing the Graphics Pipeline with Compute by @gwihlidal
GPU Driven Rendering Pipelines by @SebAaltonen
Destiny’s Multi-threaded Renderer Architecture by @Mirror2Mask
Stingray Renderer Walkthrough by @tobias_persson

lxdlam lxdlam

Which GGUF is right for me? (Opinionated)

llama.cpp feature matrix

Writing a modern rendering engine