CollectionLoRA tackles a quietly painful problem in customized image editing: as the number of effect LoRAs you want to ship grows, storing and dynamically loading them all turns into deployment overhead nobody planned for. Its answer is to fold up to 50 different effect LoRAs — plus few-step generation — into a single LoRA.
## Three ideas, one adapter
The framework uses multi-teacher on-policy distillation, with three technical pieces. Probabilistic Dual-Stream Routing lets the model switch between data sources during training, so the student isn’t trapped tracking one teacher at a time. Asymmetric Orthogonal Prompting isolates concepts inside the prompt space, so 50 effects don’t bleed into each other. A Coarse-to-Fine Distillation Objective smooths the distribution gap that usually opens up between teachers and a single compressed student.
## Why it matters
The paper reports the distilled single LoRA matches or beats independently trained teacher LoRAs on concept fidelity — while killing the load-and-swap overhead. For anyone shipping a creative tool with many user-selectable effects, that’s the difference between a 50-file mess and a single adapter you ship once. As image and video models become product surfaces that need lots of small style controls, packing many adapters into one is the kind of plumbing that decides whether the product feels fast or sluggish.

Leave a comment