You can fine-tune language models like Llama 2 or image models like SDXL with your own data on Replicate. If you don't make any requests to your fine-tuned model for a while, it can take some time to start again. This is called a cold boot, and can be as slow as a few minutes for large models.
We've made some dramatic improvements to cold boots for fine-tuned models. They now boot in less than one second.
It works on these models:
For now, it's available only for new fine-tuned models created starting today. We're also working on a more cold boot improvements for all models. Stay tuned.
To get started, check out these guides:
Let's go. 🚀