Inference API by Hugging Face
One free token unlocks thousands of open models (text, image, embeddings) via a single API.
Access
Free API tier
Free limits
Rate-limited · 1000s of models
Modality
text, image, embeddings
Credit card
Not required
Commercial use
Allowed
Last verified
June 2026
How to use Inference API
One free token unlocks thousands of open models (text, image, embeddings) via a single API.
Quickstart
curl https://api-inference.huggingface.co/models/meta-llama/Llama-3.3-70B-Instruct \
-H "Authorization: Bearer $HF_TOKEN" -d '{"inputs":"hi"}'Frequently asked
Is Inference API free?
Yes. Hugging Face offers it as Free API tier with these limits: Rate-limited · 1000s of models. No credit card is required.
Can I use Inference API commercially?
Yes, commercial use is allowed. Verify the current license or terms before shipping.
How do I start using Inference API?
One free token unlocks thousands of open models (text, image, embeddings) via a single API.
Related free models
- Kimi K2.6 (Ollama Cloud) · Free tier
- Gemini 2.5 Flash (Google AI Studio) · Generous · no card
- GLM 4.6 (Z.ai) · Self-host free
- @cf/openai/gpt-oss-120b (Cloudflare Workers AI) · 10K neurons/day (shared)
- bytedance-seed/dola-seed-2.0-pro:free (Kilo Code) · ~200 req/hr
- @cf/deepseek-ai/deepseek-r1-distill-qwen-32b (Cloudflare Workers AI) · 10K neurons/day (shared)
Want this wired into your business?
We build production automations and agents on free and paid models, picked for your workload and budget.
Book a build call