view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 By tomaarsen • Mar 26 • 165
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model Paper • 2305.14014 • Published May 23, 2023
Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models Paper • 2305.18010 • Published May 29, 2023
Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data Paper • 2504.09895 • Published Apr 14 • 1
Protecting Copyrighted Material with Unique Identifiers in Large Language Model Training Paper • 2403.15740 • Published Mar 23, 2024
CenterCLIP: Token Clustering for Efficient Text-Video Retrieval Paper • 2205.00823 • Published May 2, 2022
Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data Paper • 2504.09895 • Published Apr 14 • 1
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing Paper • 2502.17258 • Published Feb 24 • 79