Llama 3.3 70B by Groq

OpenAI-compatible chat API on the fastest inference hardware (LPUs). Great default free LLM for chat + tool-calling.

Access
Free API tier
Free limits
1,000 RPD · 30 RPM
Modality
text, code
Credit card
Not required
Commercial use
Allowed
Model ID
llama-3.3-70b-versatile
Base URL
https://api.groq.com/openai/v1
Last verified
June 2026

How to use Llama 3.3 70B

OpenAI-compatible chat API on the fastest inference hardware (LPUs). Great default free LLM for chat + tool-calling.

Quickstart

curl https://api.groq.com/openai/v1/chat/completions \
  -H "Authorization: Bearer $GROQ_KEY" \
  -d '{"model":"llama-3.3-70b-versatile","messages":[{"role":"user","content":"hi"}]}'

Frequently asked

Is Llama 3.3 70B free?

Yes. Groq offers it as Free API tier with these limits: 1,000 RPD · 30 RPM. No credit card is required.

Can I use Llama 3.3 70B commercially?

Yes, commercial use is allowed. Verify the current license or terms before shipping.

How do I start using Llama 3.3 70B?

OpenAI-compatible chat API on the fastest inference hardware (LPUs). Great default free LLM for chat + tool-calling.

Related free models

Want this wired into your business?

We build production automations and agents on free and paid models, picked for your workload and budget.

Book a build call