Box Developer Documentation

IBM Llama 3.2 Vision Instruct

IBM Llama 3.2 Vision Instruct

IBM Llama 3.2 Vision Instruct is a model built for image-in, text-out use cases such as document-level understanding, interpretation of charts and graphs, and captioning of images.

Model details

ItemValueDescription
Model nameIBM Llama 3.2 Vision InstructThe name of the model.
API model nameibm__llama__3_2_90b_vision_instructThe name of the model that is used in the Box AI API for model overrides. The user must provide this exact name for the API to work.
Hosting layerIBMThe trusted organization that securely hosts LLM.
Model providerMetaThe organization that provides this model.
Release dateSeptember 25th 2024The release date for the model.
Knowledge cutoff dateDecember 2023The date after which the model does not get any information updates.
Input context window128kThe number of tokens supported by the input context window.
Maximum output tokensNot specifiedThe number of tokens that can be generated by the model in a single request.
Empirical throughputNot specifiedThe number of tokens the model can generate per second.
Open sourceYesSpecifies if the model's code is available for public use.
IP infringement protectionNoUse of this model does not come with any intellectual property rights assurances or protections from Box. Please consider any potential IP issues that might arise from using the model’s outputs.

Additional documentation

For additional information, see official IBM Llama 3.2 Vision Instruct documentation.