Llama-3.2-90B-Vision-Instruct-Turbo
Model Overview
How to Make a Call
API Schema
Creates a chat completion using a language model, allowing interactive conversation by predicting the next response based on the given chat history. This is useful for AI-driven dialogue systems and virtual assistants.
Authorizations
AuthorizationstringRequired
Bearer key
Body
modelundefined · enumRequiredPossible values:
max_tokensnumber · min: 1OptionalDefault:
512stopany ofOptional
stringOptional
string[]Optional
any · nullableOptional
streambooleanOptionalDefault:
falseninteger · min: 1Optional
seedinteger · min: 1Optional
top_pnumber · min: 0.01 · max: 1Optional
top_knumberOptional
temperaturenumberOptional
repetition_penaltynumber · nullableOptional
logprobsboolean · nullableOptional
echobooleanOptional
min_pnumber · max: 1Optional
presence_penaltynumber · min: -2 · max: 2 · nullableOptional
frequency_penaltynumber · min: -2 · max: 2 · nullableOptional
tool_choiceany ofOptional
string · enumOptionalPossible values:
response_formatone ofOptional
or
or
Responses
201Success
post
/v1/chat/completions201Success
No content
Code Example (Python)
Last updated