generativeai / inference / README.md References Github - vllm Github - sglang Github - zml Infrastructure - Skypilot Github - ggml-org: llama.cpp Showcase - smolvlm-realtime-webcam