Compare Large Language Model Endpoint Prices

"Once upon a time, in a land not too far away, I ventured into the realm of Large Language Models without comparing the endpoint prices. Oh, the tales I could tell of the gold coins I could have saved, the unicorns I missed out on, and the mystical discounts that slipped through my fingers. Lesson learned: Don't be like me, always compare LLM Endpoint prices!"
- Someone, somewhere, probably

Find the Best Deals on Cutting-Edge Language Model Endpoints


Explore and compare prices of state-of-the-art Large Language Models (LLMs) from different providers. Find cost-effective and advanced models for your needs. Make an informed decision with our user-friendly interface.

Enter your email address in this form to receive notifications about significant price changes and the addition of new models!

Provider Model Type Input Price($)/1M Tokens Output Price($)/1M Tokens
OpenAI gpt-4-1106-preview chat 10 30
OpenAI gpt-4-1106-vision-preview chat 10 30
OpenAI gpt-4 chat 30 60
OpenAI gpt-4-32k chat 60 120
OpenAI gpt-3.5-turbo-1106 chat 1 2
OpenAI gpt-3.5-turbo-instruct chat 1.5 2
OpenAI ada v2 embeddings 0.1
OpenAI davinci-002 base 2
OpenAI babbage-002 base 0.4
Mistral mistral-tiny chat 0.1526 0.42
Mistral mistral-small chat 0.654 1.8
Mistral mistral-medium chat 2.725 7.5
Mistral mistral-medium embeddings 0.10900000000000001
Anyscale Mistral-7B-OpenOrca chat 0.15 0.15
Anyscale Mistral-7B-Instruct-v0.1 chat 0.15 0.15
Anyscale Zephyr-7b-beta chat 0.15 0.15
Anyscale Llama-Guard-7b chat 0.15 0.15
Anyscale Llama-2-7b-chat-hf chat 0.15 0.15
Anyscale Llama-2-13b-chat-hf chat 0.25 0.25
Anyscale Mixtral-8x7B-Instruct-v0.1 chat 0.5 0.5
Anyscale Llama-2-70b-chat-hf chat 1.0 1.0
Anyscale CodeLlama-34b-Instruct-hf chat 1.0 1.0
Anyscale thenlper-gte-large chat 0.05 0.05
Together up to 4B chat 0.1 0.1
Together 4.1B to 8B chat 0.2 0.2
Together 8B to 21B chat 0.3 0.3
Together 21B to 41B chat 0.8 0.8
Together 41B to 70B chat 0.9 0.9
Together llama-7b chat 0.2 0.2
Together llama-13b chat 0.225 0.225
Together llama-13b chat 0.9 0.9
Together mixture of experts 8X7B chat 0.6 0.6
DeepInfra Llama-2-7b-chat chat 0.13 0.13
DeepInfra Mistral-7B chat 0.13 0.13
DeepInfra OpenChat-3.5 chat 0.13 0.13
DeepInfra MistralLite chat 0.13 0.13
DeepInfra Yi-6B-200K chat 0.13 0.13
DeepInfra Llama-2-13b-chat chat 0.22 0.22
DeepInfra MythoMax-L2-13b chat 0.22 0.22
DeepInfra mixtral-8x7B-chat chat 0.27 0.27
DeepInfra Yi-34B-Chat chat 0.6 0.6
DeepInfra CodeLlama-34b-Instruct chat 0.6 0.6
DeepInfra Phind-CodeLlama-34B-v2 chat 0.6 0.6
DeepInfra Llama-2-70b-chat chat 0.7 0.9
DeepInfra Airoboros-70b chat 0.7 0.9
DeepInfra Lzlv-70b chat 0.7 0.9
MosaicML Instructor-Large chat 0.1 0.1
MosaicML Instructor-XL chat 0.2 0.2
MosaicML Llama-2-70b-Chat chat 2 2
MosaicML MPT-30B Instruct chat 1 1
MosaicML MPT-7B Instruct(fastest) chat 0.5 0.5