docs: split Memory-efficient Inference and GGUF+SageAttention into sub READMEs

#19
by gkalstn0 - opened
Motif Technologies org

Move detailed guides to docs/ for better README readability.

  • docs/memory-efficient-inference.md β€” CPU offload + FP8 guide
  • docs/gguf-sageattention.md β€” GGUF code + SageAttention + benchmark
  • README section headings and anchors preserved for existing references
  • README reduced by ~200 lines
gkalstn0 changed discussion status to closed

Sign up or log in to comment