Optional
fields: GoogleVertexAITextInputMaximum number of tokens to generate in the completion.
Model to use
Sampling temperature to use
Top-k changes how the model selects tokens for output.
A top-k of 1 means the selected token is the most probable among all tokens in the model’s vocabulary (also called greedy decoding), while a top-k of 3 means that the next token is selected from among the 3 most probable tokens (using temperature).
Top-p changes how the model selects tokens for output.
Tokens are selected from most probable to least until the sum of their probabilities equals the top-p value.
For example, if tokens A, B, and C have a probability of .3, .2, and .1 and the top-p value is .5, then the model will select either A or B as the next token (using temperature).
Protected
connectionProtected
streamedFormats the input instance for the Google Vertex AI model based on the model type (text or code).
Prompt to be formatted as an instance.
A GoogleVertexAILLMInstance object representing the formatted instance.
Formats the input instance as a code instance for the Google Vertex AI model.
Prompt to be formatted as a code instance.
A GoogleVertexAILLMInstance object representing the formatted code instance.
Formats the input instance as a text instance for the Google Vertex AI model.
Prompt to be formatted as a text instance.
A GoogleVertexAILLMInstance object representing the formatted text instance.
Enables calls to the Google Cloud's Vertex AI API to access Large Language Models.
To use, you will need to have one of the following authentication methods in place:
GOOGLE_APPLICATION_CREDENTIALS
environment variable is set to the path of a credentials file for a service account permitted to the Google Cloud project using Vertex AI.Example