Llama 4 - A new era of natively multimodal AI innovation
Llama 4
A new era of natively multimodal AI innovation
Screenshots

Hunter's comment
The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding. Can't wait to try this out. We're experimenting with running models on-device for our product (desktop app) but haven't been able to get great results yet for the average laptop. Looking forward to see the reality of inference speeds for these models.
Llama 4 Scout:
• 17B x 16 experts
• Natively multi-modal
• 10M token context length
• Runs on a single GPU
• Highest performing small model
Llama 4 Maverick:
• 17B x 128 experts
• Natively multi-modal
• Beats GPT-4o and Gemini Flash 2
• Smaller and more efficient than DeepSeek, but still comparable on text, plus also multi-modal
• Runs on a single host
Llama 4 Behemoth:
• 2+ trillion parameters
• Highest performing base model
• Still training!
Link
https://www.producthunt.com/posts/llama-4-5

This is posted on Steemhunt - A place where you can dig products and earn STEEM.
View on Steemhunt.com
Upvoted! Thank you for supporting witness @jswit.
Congratulations!
We have upvoted your post for your contribution within our community.
Thanks again and look forward to seeing your next hunt!
Want to chat? Join us on: