This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
Key metrics and engagement data
Repository has been active for 1 year
⭐25
Want deeper insights? Explore GitObs.com
This repository does not have a README file.