A beta version of the new Box developer documentation site is launching soon! Updated Developer Guides, modern API Reference, and AI-powered search are on the way to help you build with Box faster. Stay tuned for more updates.

IBM Llama 3.2 Vision Instruct

Guides Box AI Supported AI models IBM Llama 3.2 Vision Instruct

IBM Llama 3.2 Vision Instruct

IBM Llama 3.2 Vision Instruct is a model built for image-in, text-out use cases such as document-level understanding, interpretation of charts and graphs, and captioning of images.

Item	Value	Description
Model name	IBM Llama 3.2 Vision Instruct	The name of the model.
Model category	Standard	The category of the model - standard or premium.
API model name	`ibm__llama__3_2_90b_vision_instruct`	The name of the model that is used in the Box AI API for model overrides. The user must provide this exact name for the API to work.
Hosting layer	IBM	The trusted organization that securely hosts LLM.
Model provider	Meta	The organization that provides this model.
Release date	September 25th 2024	The release date for the model.
Knowledge cutoff date	December 2023	The date after which the model does not get any information updates.
Input context window	128k	The number of tokens supported by the input context window.
Maximum output tokens	Not specified	The number of tokens that can be generated by the model in a single request.
Empirical throughput	Not specified	The number of tokens the model can generate per second.
Open source	Yes	Specifies if the model's code is available for public use.
IP infringement protection	No	Use of this model does not come with any intellectual property rights assurances or protections from Box. Please consider any potential IP issues that might arise from using the model’s outputs.

For additional information, see official IBM Llama 3.2 Vision Instruct documentation.

Related Guides