Llama-4-Scout-17B-16E by GitHub Models
Free prototyping for all GitHub users. 45+ models. Per-request limits (8K in / 4K out).
Access
Free API tier
Free limits
15 RPM, 150 RPD
Modality
image, text
Credit card
Not required
Commercial use
Unclear, verify
Context window
512,000 tokens
Model ID
meta/Llama-4-Scout-17B-16E
Base URL
https://models.github.ai/inference
Last verified
June 2026
How to use Llama-4-Scout-17B-16E
Free prototyping for all GitHub users. 45+ models. Per-request limits (8K in / 4K out).
Quickstart
curl https://models.github.ai/inference/chat/completions \
-H "Authorization: Bearer $KEY" \
-d '{"model":"meta/Llama-4-Scout-17B-16E","messages":[{"role":"user","content":"hi"}]}'Frequently asked
Is Llama-4-Scout-17B-16E free?
Yes. GitHub Models offers it as Free API tier with these limits: 15 RPM, 150 RPD. No credit card is required.
Can I use Llama-4-Scout-17B-16E commercially?
Commercial terms are not clearly documented. Check the provider's current terms before shipping.
How do I start using Llama-4-Scout-17B-16E?
Free prototyping for all GitHub users. 45+ models. Per-request limits (8K in / 4K out).
Related free models
- Kimi K2.6 (Ollama Cloud) · Free tier
- Gemini 2.5 Flash (Google AI Studio) · Generous · no card
- GLM 4.6 (Z.ai) · Self-host free
- @cf/openai/gpt-oss-120b (Cloudflare Workers AI) · 10K neurons/day (shared)
- bytedance-seed/dola-seed-2.0-pro:free (Kilo Code) · ~200 req/hr
- @cf/deepseek-ai/deepseek-r1-distill-qwen-32b (Cloudflare Workers AI) · 10K neurons/day (shared)
Want this wired into your business?
We build production automations and agents on free and paid models, picked for your workload and budget.
Book a build call