Try our new intelligent model routing solution, Arcee Conductor. Sign up today and get a $200 credit (~400M free tokens).

Introducing Arcee Conductor

A new standard for intelligent model routing.

Conductor intelligently routes your prompt to the best model, to efficiently deliver precise results, for any task.

The right model for each prompt, every time.

Try now

Today’s AI is about more than the single best large language model (LLM)

With new AI models released daily, it's hard to keep up with which model is best for your business. Our pioneering work in small language models (SLMs) gives us unique insights into which model is the right one for your tasks or queries.

That's why we built Arcee Conductor, a model-agnostic platform that gives you access to a complete suite of top-performing SLMs, as well as other industry-leading LLMs.

Arcee Conductor intelligently routes your query to the optimal model based on factors like industry/specialty, complexity, efficiency, and cost–all in an easy-to-use interface that requires no technical expertise.

How Arcee Conductor works

Arcee Conductor takes intelligent model routing to a new level

Precise, efficient routing

Automatically routes your query based on complexity, type of task, industry or domain, whether it involves tool calling, function calling, or other requirements.

World-class AI, cost-effective pricing

Cost-per-query reduced by 85% using Arcee AI SLMs, with live visibility into query costs, and suggestions for maximizing performance while minimizing costs.

Diverse selection of world-class models

Includes Arcee AI SLMs like Blitz, Virtuoso-Medium, and Virtuoso-Large, as well as leading LLMs such as Google Gemini 2.0 Flash, OpenAI GPT-4o, and Anthropic Claude 3.7 – with more to come soon.

Advanced query processing

Supports chain-of-thought reasoning for enhanced analytical capabilities, mixture-of-agents functionality for parallel processing across models, and automatic temperature and compute scaling for task-specific optimization.

One platform, diverse models

The only routing platform that utilizes open-source and closed-source models, including our proprietary SLMs and our distilled DeepSeek models.

Customizable model settings

User-defined preferences for routing parameters and model prioritization, with preset profiles for different tasks.

Try our industry-leading small language models

Arcee Al creates powerful, highly efficient small language models (SLMs). They contain significantly fewer parameters than large language models, which makes them cost-effective and fast. But because they're purpose-built for specific tasks, data, and requirements, they perform remarkably well.

Explore Arcee Conductor

Pricing

Arcee Conductor "Auto mode" automatically chooses the best models to use, within a defined list of available models, based on complexity and efficiency.

With the Enterprise tier, you can unlock additional options with Arcee Conductor like volume discounts, custom model configuration, and dedicated SLAs.

Please contact sales to upgrade to the Enterprise Tier.

Model
Price per Million Tokens (I/O)
Arcee AI
Arcee-Blitz
$0.03 / $0.05
Arcee AI
Virtuoso-Medium
$0.09 / $0.35
Arcee AI
Virtuoso-Large
$0.12 / $0.50
Anthropic
Sonnet-3.7
$3.00 / $15.00
OpenAI
GPT-4o
$2.50 / $10.00
Google
Gemini-2.0-Flash
$0.10 / $0.40

Frequently
Asked Questions

Why is model routing important for your business?

Currently, companies face significant challenges:

  • Foundation model performance varies significantly and in unintuitive ways
  • High-margin AI products are creating financial pressure on traditional SaaS businesses
  • Teams cannot realistically track which model performs best for each specific prompt.

While selecting the best model for your task improves response quality, that isn't enough – on its own – to solve these challenges completely. This is why intelligent model routing is essential. It automatically selects the optimal model for each prompt, helping you to reclaim your profit margins without sacrificing quality. 

What exactly is Arcee Conductor?

Arcee Conductor is an intelligent model-agnostic platform that directs each input to its ideal AI model based on complexity, domain, cost, and other requirements. By dynamically routing between large language models (LLMs) and small language models (SLMs), Conductor maximizes cost efficiency without compromising performance. You get the right model for each prompt, every time. 

What models power Arcee Conductor?

In Arcee Conductor, your prompt is automatically routed to the most suitable model through an advanced routing mechanism.

Available models include:
Arcee AI SLMs: Virtuoso-Large, Virtuoso-Medium, Blitz
LLMs: Claude-Sonnet-3.7, GPT-4o, Gemini-Flash-2.0

What tiers are available on Arcee Conductor?

We offer two primary tiers:

  • Standard Tier: This is the default tier when you  sign up. This tier allows you to prompt directly in the interface (or use an API) while Arcee Conductor Auto mode intelligently selects the optimal models based on complexity and efficiency from our available model list.  
  • Enterprise Tier: Unlock additional options with Arcee Conductor with the enterprise tier,  like volume discounts, custom model configuration, and dedicated SLAs.  Please contact sales to upgrade to the Enterprise Tier. 

How do I get started with Arcee Conductor?

You can sign up here to begin using Arcee Conductor today.  First-time users receive $200 in credits (equivalent to approximately 400 million tokens) towards your Conductor usage. 

How does Arcee Conductor's billing work for the standard tier?

Arcee Conductor is based on  a usage-based pricing model. When you sign up for Conductor for the first time,  you'll receive $200 in credits automatically. This allows you to immediately start using the platform.

After you've used your $200 in credits, charges will be applied to your payment method based on your usage. The specific rates vary depending on specific rates per token of models. 

What are the specific rates per token in Arcee Conductor?

                      Model

Input Tokens
(per million tokens)

Output Tokens
(per million tokens)

Arcee-Blitz  (General Purpose)

$0.03

$0.05

Virtuoso-Medium (General Purpose)

$0.09

$0.35

Virtuoso-Large (General Purpose)

$0.12

$0.50

Claude Sonnet 3.7

$3.00

$15

Gemini 2.0 Flash

$0.10

$0.40

GPT-4o

$2.50

$10.00

Industry-leading SLMs & LLMs with unified API access, inference, and intelligent routing
all in one solution.

First-time users of Arcee Conductor can get started with a one-time $200 credit (~400 million tokens). Optimize output across models, reduce usage costs, and maximize performance with intelligent routing.

Get started