Unify Efficient Fine-Tuning of 100+ LLMs
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V...
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、Min...
Instruction Tuning with GPT-4
Aligning pretrained language models with instruction data generated by t...
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Video-LLaVA: Learning United Visual Representation by Alignment Before P...
We unified the interfaces of instruction-tuning data (e.g., CoT data), m...
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
A one-stop data processing system to make data higher-quality, juicier, ...
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Video Foundation Models & Data for Multimodal Understanding
An Open-sourced Knowledgable Large Language Model Framework.
A collection of open-source dataset to train instruction-following LLMs ...