Skip to content

Instantly share code, notes, and snippets.

@crmne
Last active April 3, 2025 16:26
Show Gist options
  • Save crmne/301be1d38ff193e7274a69833947139a to your computer and use it in GitHub Desktop.
Save crmne/301be1d38ff193e7274a69833947139a to your computer and use it in GitHub Desktop.
Proposed multi-provider capabilities and pricing API. Blog post: https://paolino.me/standard-api-llm-capabilities-pricing/
# This is a YAML file so I can have comments but the API should obviously return an array of models in JSON.
# Legend:
# Required: this is important to have in v1.
# Optional: this is still important but can wait for v2.
id: gpt-4.5-preview # Required, will match it with the OpenAI API
display_name: GPT-4.5 Preview # Required
provider: openai # Required
family: gpt45 # Optional, each model page is a family for OpenAI models
context_window: 128000 # Required
max_output_tokens: 16384 # Required
knowledge_cutoff: 20231001 # Optional
modalities:
text:
input: true # Required
output: true # Required
image:
input: true # Required
output: false # Required
audio:
input: false # Required
output: false # Required
pdf_input: false # Optional - from Anthropic and Google
embeddings_output: false # Required
moderation_output: false # Optional
capabilities:
streaming: true # Optional
function_calling: true # Required
structured_output: true # Required
predicted_outputs: false # Optional
distillation: false # Optional
fine_tuning: false # Optional
batch: true # Required
realtime: false # Optional
image_generation: false # Required
speech_generation: false # Required
transcription: false # Required
translation: false # Optional
citations: false # Optional - from Anthropic
reasoning: false # Optional - called Extended Thinking in Anthropic's lingo
pricing:
text_tokens:
standard:
input_per_million: 75.0 # Required
cached_input_per_million: 37.5 # Required
output_per_million: 150.0 # Required
reasoning_output_per_million: 0 # Optional
batch:
input_per_million: 37.5 # Required
output_per_million: 75.0 # Required
images:
standard:
input: 0.0 # Optional
output: 0.0 # Optional
batch:
input: 0.0 # Optional
output: 0.0 # Optional
audio_tokens:
standard:
input_per_million: 0.0 # Optional
output_per_million: 0.0 # Optional
batch:
input_per_million: 0.0 # Optional
output_per_million: 0.0 # Optional
embeddings:
standard:
input_per_million: 0.0 # Required
batch:
input_per_million: 0.0 # Required
@crmne
Copy link
Author

crmne commented Apr 2, 2025

Thank you so much for the info @MadBomber!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment