IBM Llama 3.2 Vision Instruct
IBM Llama 3.2 Vision Instruct
IBM Llama 3.2 Vision Instruct is a model built for image-in, text-out use cases such as document-level understanding, interpretation of charts and graphs, and captioning of images.
Model details
Item | Value | Description |
---|---|---|
Model name | IBM Llama 3.2 Vision Instruct | The name of the model. |
API model name | ibm__llama__3_2_90b_vision_instruct | The name of the model that is used in the Box AI API for model overrides. The user must provide this exact name for the API to work. |
Hosting layer | IBM | The trusted organization that securely hosts LLM. |
Model provider | Meta | The organization that provides this model. |
Release date | September 25th 2024 | The release date for the model. |
Knowledge cutoff date | December 2023 | The date after which the model does not get any information updates. |
Input context window | 128k | The number of tokens supported by the input context window. |
Maximum output tokens | Not specified | The number of tokens that can be generated by the model in a single request. |
Empirical throughput | Not specified | The number of tokens the model can generate per second. |
Open source | Yes | Specifies if the model's code is available for public use. |
IP infringement protection | No | Use of this model does not come with any intellectual property rights assurances or protections from Box. Please consider any potential IP issues that might arise from using the model’s outputs. |
Additional documentation
For additional information, see official IBM Llama 3.2 Vision Instruct documentation.