arxiv:2503.02039

Dynamic Search for Inference-Time Alignment in Diffusion Models

Published on Mar 3

Authors:

Abstract

Diffusion models have shown promising generative capabilities across diverse domains, yet aligning their outputs with desired reward functions remains a challenge, particularly in cases where reward functions are non-differentiable. Some gradient-free guidance methods have been developed, but they often struggle to achieve optimal inference-time alignment. In this work, we newly frame inference-time alignment in diffusion as a search problem and propose Dynamic Search for Diffusion (DSearch), which subsamples from denoising processes and approximates intermediate node rewards. It also dynamically adjusts beam width and tree expansion to efficiently explore high-reward generations. To refine intermediate decisions, DSearch incorporates adaptive scheduling based on noise levels and a lookahead heuristic function. We validate DSearch across multiple domains, including biological sequence design, molecular optimization, and image generation, demonstrating superior reward optimization compared to existing approaches.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/2503.02039 in a model README.md to link it from this page.

No dataset linking this paper

Cite arxiv.org/abs/2503.02039 in a dataset README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/2503.02039 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.