Google Gemini 2.5 Flash
Google Gemini 2.5 Flash
Google Gemini 2.5 Flash is a multimodal model, offering well-rounded capabilities. It features thinking capabilities, which lets you see the thinking process the model goes through when generating its response.
Model details
| Item | Value | Description |
|---|---|---|
| Model name | Google Gemini 2.5 Flash | The name of the model. |
| Model category | Standard | The category of the model - standard or premium. |
| API model name | google__gemini_2_5_flash | The name of the model that is used in the Box AI API for model overrides. The user must provide this exact name for the API to work. |
| Hosting layer | The trusted organization that securely hosts the LLM. | |
| Model provider | The organization that provides this model. | |
| Release date | June 17th 2025 | The release date for the model. |
| Knowledge cutoff date | January 2025 | The date after which the model does not get any information updates. |
| Input context window | 1m tokens | The number of tokens supported by the input context window. |
| Maximum output tokens | 64k tokens | The number of tokens that can be generated by the model in a single request. |
| Empirical throughput | Estimated between 163 - 300 | The number of tokens the model can generate per second. |
| Open source | No | Specifies if the model's code is available for public use. |
Additional documentation
For additional information, see official Google Gemini 2.5 Flash documentation.