Orchestrate SLMs. Unlock agentic AI

Arcee Orchestra delivers agentic AI that you can customize to do exactly what you need, without compromising on speed or data security.

Frequently
Asked Questions

What exactly is the Arcee Model Engine, and how can I use it?

The Arcee Model Engine is a hosted inference service currently in public beta and available at models.arcee.ai. It provides access to Arcee AI’s suite of specialized Small Language Models (SLMs), designed to work with Arcee Orchestra for intelligent agentic orchestration.

These models are optimized for a wide range of tasks, from general-purpose activities to specialized use cases such as vision-language processing and advanced reasoning. You can easily integrate these models into your workflows or use them as standalone systems for maximum flexibility.

What are the models available on the Arcee Model Engine?

The available models include Virtuoso (Large, Medium, Small) for general-purpose tasks, Caller Large for executing complex workflows and system interactions, and Coder (Large, Small) for programming and development tasks across various languages. Additional specialized models are in the pipeline and will be automatically available to all subscribers.

How do I get started with Model Engine?

To get started, you can either chat directly through Arcee Model Engine–where you'll find a built-in chat interface for each model after signing up–or integrate our models into your own applications. Integration is straightforward since all Arcee AI models are compatible with OpenAI's inference format, requiring only a few environment variables to set up. For more information, please check out our announcement blog here.

How does Model Engine’s subscription and token billing work?

When you sign up for Model Engine, you'll be charged a $20 monthly subscription fee that goes into your wallet. This allows you to start chatting with Arcee Model Engine models. Upon your first use of a model, 2 million tokens (1M input + 1M output) are deducted from your wallet as an initial charge. Additional charges apply each time you exceed 1M tokens. API calls pause if your wallet hits zero until you add more funds.

What are the specific rates per token inside Model Engine?

Model
Input Cost (1/M Tokens)
Output Cost (1/M Tokens)
Virtuoso Large
$1.27
$1.50
Virtuoso Medium
$0.67
$0.82
Virtuoso Small
$0.40
$0.52
Coder
$0.67
$0.82
Coder Small
$0.40
$0.52
Caller
$0.67
$0.82
Maestro
$1.59
$1.88
Spotlight
$0.29
$0.40

Ready to see more?

Contact us for a custom demo.

Talk to us