Easily combine models with Arcee MergeKit to create a single, more effective model.

Industry-leading tool to merge pre-trained models to create smaller, more efficient models without additional training. Original capabilities of the models are preserved while AI performance and versatility are enhanced.

Get started with the open source library.

Learn about MergeKit Enterprise license

The power of model merging

Merges can be run entirely on CPUs with as little as 8 GB of VRAM. The resulting model outranks the input models. You boost the performance and accuracy of your models in a cost-efficient and time-efficient way.

Here at Arcee AI, we use model merging as one of the key techniques in the world-class proprietary pipeline we developed to train the small language models (SLMs) that power Arcee Orchestra.

Training a model with MergeKit

Step 1

Start with a pre-trained open-source model, then train it on your data.

Step 2

Next, merge that model with another pre-trained open-source model.

Result

The output “merged” model is the same size and inference cost as just one of the input models, but has the “brains” of both models.

When you merge models that have been 
trained on the same tasks

Better performance and improved out-of-domain generalization

When you merge models that have been trained on different tasks

Synergistic effects can actually boost task performance above input level

MergeKit for Enterprises

With more than 5,000 stars on GitHub, MergeKit is the premier tool for merging pre-trained models, and offers unparalleled flexibility and efficiency in model integration.

While the library itself is a powerful resource, achieving optimal merging results requires deep expertise and precision–so we also offer a MergeKit Enterprise License (BSL) that includes direct access to our team, including the creator of MergeKit, Charles Goddard. Get in touch for licensing details.

Thank you!

We will get back to you soon.
Oops! Something went wrong while submitting the form.

Frequently
Asked Questions

What are the benefits of using MergeKit?

MergeKit is a great tool for combining the strengths of multiple language models. With MergeKit, you can train a model on just the data that matters to you, and add the capabilities of cutting edge open source models without further training. This brings benefits of efficiency and cost savings, and also reduces complexity in your LLM pipeline.

What models work with MergeKit?

MergeKit supports most decoder-only language models implemented in the transformers library. MergeKit features first-class support for Llama, Mistral, Phi-2, StableLM, GPT-NeoX, Mixtral, and many more.

Will Mergekit remain open source?

Yes, MergeKit will remain open source and we will be extending and expanding it to be the most powerful toolkit in the world for model merging.

How do I incorporate MergeKit into my Arcee AI workflow?

MergeKit is utilized under the hood of the Arcee product in our Adaptation and Alignment layers.

Can I see some examples of how to implement MergeKit?

Absolutely, you can go to the examples folder in the repo for a variety of examples.

How do I get started using MergeKit?

The easiest way to get started with model merging and MergeKit is to follow the open source repo guide to installation and getting started.

Ready to see more?

Contact us for a custom demo.

Talk to Us