RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions Paper โข 2506.03448 โข Published Jun 3 โข 4
Steering Rectified Flow Models in the Vector Field for Controlled Image Generation Paper โข 2412.00100 โข Published Nov 27, 2024 โข 16
TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives Paper โข 2411.02545 โข Published Nov 4, 2024 โข 1
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks Paper โข 2204.07705 โข Published Apr 16, 2022 โข 2
$ฮป$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space Paper โข 2402.05195 โข Published Feb 7, 2024 โข 19
ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations Paper โข 2312.04655 โข Published Dec 7, 2023 โข 21
CRIPP-VQA: Counterfactual Reasoning about Implicit Physical Properties via Video Question Answering Paper โข 2211.03779 โข Published Nov 7, 2022 โข 1
ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models Paper โข 2306.04695 โข Published Jun 7, 2023 โข 1
WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models Paper โข 2306.04744 โข Published Jun 7, 2023 โข 1