File size: 1,339 Bytes
5b5e46d
 
9613bfb
 
5b5e46d
 
9613bfb
 
 
 
 
 
 
5b5e46d
 
 
 
 
 
 
 
 
 
 
 
 
a724093
 
 
 
 
 
5b5e46d
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
---
license: apache-2.0
base_model:
- lmms-lab/LLaVA-Video-7B-Qwen2
---

<a href='https://arxiv.org/abs/2504.12083'><img src='https://img.shields.io/badge/arXiv-paper-red'></a> 
<a href='https://pritamqu.github.io/RRPO/'><img src='https://img.shields.io/badge/project-RRPO-blue'></a> 
<a href='https://huggingface.co/datasets/pritamqu/self-alignment'><img src='https://img.shields.io/badge/huggingface-datasets-green'></a> 
<a href='https://huggingface.co/collections/pritamqu/rrpo-67fbc8c048b298a5fdfb167b'><img src='https://img.shields.io/badge/model-checkpoints-yellow'></a> 
</a><a href='https://github.com/pritamqu/RRPO'><img src='https://img.shields.io/badge/github-repository-purple'></a> 


## Installation

Clone the repository and navigate to the RRPO directory:

```sh
git clone https://github.com/pritamqu/RRPO
cd RRPO

conda create -n llava python=3.10 -y
conda activate llava
pip install -r llavavideo.txt
```

## Download weights

```
git clone [email protected]:pritamqu/LLaVA-Video-7B-Qwen2-RRPO-32f
```

## Inference

```
conda activate llava
WEIGHTS_ROOT="./"

## using merged weights
python inference.py \
    --base_model_name "llavavideo_qwen_7b" \
    --model-path ${WEIGHTS_ROOT}"/LLaVA-Video-7B-Qwen2-RRPO-32f" \
    --video_path "sample_video.mp4" \
    --question "Describe this video." \
    --model_max_length 1024
```