Starting April 29, 2025, Gemini 1.5 Pro and Gemini 1.5 Flash models are not available in projects that have no prior usage of these models, including new projects. For details, see Model versions and lifecycle.
Required. The instances that are the input to the prediction call. A DeployedModel may have an upper limit on the number of instances it supports per request, and when it is exceeded the prediction call errors in case of AutoML Models, or, in case of customer created Models, the behaviour is as documented by that Model. The schema of any single instance may be specified via Endpoint's DeployedModels' Model'sPredictSchemata'sinstanceSchemaUri.
The parameters that govern the prediction. The schema of the parameters may be specified via Endpoint's DeployedModels' Model's PredictSchemata'sparametersSchemaUri.
Response body
If successful, the response body contains an instance of PredictResponse.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-06-27 UTC."],[],[],null,["# Method: endpoints.predict\n\n**Full name**: projects.locations.endpoints.predict\n\nPerform an online prediction. \n\n### Endpoint\n\npost `https:``/``/aiplatform.googleapis.com``/v1beta1``/{endpoint}:predict` \n\n### Path parameters\n\n`endpoint` `string` \nRequired. The name of the Endpoint requested to serve the prediction. Format: `projects/{project}/locations/{location}/endpoints/{endpoint}`\n\n### Request body\n\nThe request body contains data with the following structure:\nFields `instances[]` `value (`[Value](https://protobuf.dev/reference/protobuf/google.protobuf/#value)` format)` \nRequired. The instances that are the input to the prediction call. A DeployedModel may have an upper limit on the number of instances it supports per request, and when it is exceeded the prediction call errors in case of AutoML Models, or, in case of customer created Models, the behaviour is as documented by that Model. The schema of any single instance may be specified via Endpoint's DeployedModels' `Model's` `PredictSchemata's` `instanceSchemaUri`.\n`parameters` `value (`[Value](https://protobuf.dev/reference/protobuf/google.protobuf/#value)` format)` \nThe parameters that govern the prediction. The schema of the parameters may be specified via Endpoint's DeployedModels' `Model's ` `PredictSchemata's` `parametersSchemaUri`. \n\n### Response body\n\nIf successful, the response body contains an instance of [PredictResponse](/vertex-ai/generative-ai/docs/reference/rest/v1beta1/PredictResponse)."]]