Creating a software for automatic monitoring in online proctoring
AI Research Platform for Reinforcement Learning from Real Panoramic Images.
A curated list of awesome vision and language resources (still under con...
[arXiv 2023] PointLLM: Empowering Large Language Models to Understand Po...
[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomp...
Conceptual 12M is a dataset containing (image-URL, caption) pairs collec...
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-H...
HPT - Open Multimodal LLMs from HyperGAI
This repo lists relevant papers summarized in our survey paper: A Syste...
Implementation of 'X-Linear Attention Networks for Image Captioning' [CV...
code for TCL: Vision-Language Pre-Training with Triple Contrastive Learn...
Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for...
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robus...
Official Implementation of "GiT: Towards Generalist Vision Transformer t...
This repository is a curated collection of the most exciting and influen...