Open-Source Toolkits
MergeKit changed the game when it came to model merging, and today we're excited to bring you some game-changing updates to MergeKit–with what we're calling MergeKit v0.1. Starting today, you'll be able to unlock the power of model merging more than ever, with enterprise hosting, premium features, and expert support.
It's been just slightly over a year since Arcee AI acquired MergeKit and joined forces with its creator, Charles Goddard. Since then, we've had an incredible year of constant innovation, collaboration with the open-source community, and productizing of model merging as we built out our world-class model training pipeline.
To mark the one-year anniversary of Arcee AI + MergeKit, we're bringing you the most significant updates to MergeKit to date. Check them out and let us know what you think and what you build. And as always, happy merging!
Meet MergeKit v0.1, which dramatically expands the range of models you can merge. No longer are you limited to specific architectures explicitly supported by MergeKit. This release introduces two game-changing improvements:
This release also introduces the public availability of Arcee Fusion, a sophisticated merging method previously used internally to develop our Supernova, Medius, and Virtuoso series models. Arcee Fusion takes a more intelligent approach to merging, focusing on the importance of differences between models rather than simply merging everything indiscriminately. Arcee Fusion works in three key stages:
Arcee Fusion avoids the pitfalls of over-updating that can occur with simple averaging, providing a more refined and controlled merging experience. You can activate this powerful new method by specifying merge_method: arcee_fusion in your merge configuration file.
MergeKit v0.1 introduces a new --parallel flag for multi-GPU execution. If you have access to a multi-GPU environment, this flag will unlock a near-linear speedup for your merge operations. The --parallel flag is compatible with all merge methods and model types, significantly reducing merge times and boosting your productivity.
Finally, we want to address a change to our licensing model. While we are committed to open access, we also need to ensure the long-term sustainability of MergeKit's development. Therefore, we are transitioning to a Business Source License (BSL).
Why are we doing this?
The techniques and methods we've developed are valuable and unique. We believe in the power of open source and want the community to benefit from our work. However, unrestricted commercial use by large entities could jeopardize our ability to continue developing and improving MergeKit. The BSL is a balanced approach that allows us to share our innovations while protecting our long-term viability.
What does this mean for you?
For the vast majority of users (personal, research, and non-commercial), nothing changes. You retain unrestricted access to MergeKit. Even most commercial users will likely be unaffected. The BSL primarily applies to large corporations and highly successful startups using MergeKit in a production setting. If this applies to you, we'll simply need to discuss a commercial license (which includes direct access to Charles Goddard and the MergeKit development team).
We believe this approach strikes the right balance between fostering open innovation and ensuring the continued growth and development of MergeKit. We want to make it clear: we want you to use MergeKit! This licensing change is about ensuring we can continue to provide you with the best possible tool for model merging. You can find MergeKit 0.1 here, and to learn more about the BSL licensing, drop a note to our team at licensing@arcee.ai.