Spaces:

mallepally
/

MultimodalGPT

Build error

RangiLyu commited on Apr 26, 2023

Commit

579d11f

1 Parent(s): d1b2b44

update readme

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
-# 🤖 Multi-modality GPT
-Train a multi-modality chatbot with visual and language instructions!
 Based on the open-source multi-modal model [OpenFlamingo](https://github.com/mlfoundations/open_flamingo), we create various **visual instruction** data with open datasets, including VQA, Image Captioning, Visual Reasoning, Text OCR, and Visual Dialogue. Additionally, we also train the language model component of OpenFlamingo using only **language-only instruction** data.
@@ -37,7 +37,7 @@ conda env create -f environment.yml
     Download the OpenFlamingo pre-trained model from [openflamingo/OpenFlamingo-9B](https://huggingface.co/openflamingo/OpenFlamingo-9B)
-    Download our LoRA Weight from [here](TODO)
     Then place these models in checkpoints folders like this:
@@ -61,7 +61,8 @@ conda env create -f environment.yml
 # Examples
 ### Recipe:
-![image4](https://user-images.githubusercontent.com/12907710/234523451-51b35c99-67ce-43d4-a498-f2a71eaf9cb7.png)
 ### Travel plan:
 ![image3](https://user-images.githubusercontent.com/12907710/234523464-80c4e3f0-f99f-4498-96ef-dc43ef89c64b.png)
 ### Movie:
@@ -135,4 +136,4 @@ torchrun --nproc_per_node=8 mmgpt/train/instruction_finetune.py \
 - [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca)
 - [MiniGPT-4](https://github.com/Vision-CAIR/MiniGPT-4)
 - [LLaVA](https://github.com/haotian-liu/LLaVA/tree/main)
-- [Instruction Tuning with GPT-4](https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM)

+# 🤖 Multi-modal GPT
+Train a multi-modal chatbot with visual and language instructions!
 Based on the open-source multi-modal model [OpenFlamingo](https://github.com/mlfoundations/open_flamingo), we create various **visual instruction** data with open datasets, including VQA, Image Captioning, Visual Reasoning, Text OCR, and Visual Dialogue. Additionally, we also train the language model component of OpenFlamingo using only **language-only instruction** data.
     Download the OpenFlamingo pre-trained model from [openflamingo/OpenFlamingo-9B](https://huggingface.co/openflamingo/OpenFlamingo-9B)
+    Download our LoRA Weight from [here](https://download.openmmlab.com/mmgpt/v0/mmgpt-lora-v0-release.pt)
     Then place these models in checkpoints folders like this:
 # Examples
 ### Recipe:
+![image4](https://user-images.githubusercontent.com/12907710/234554562-8f3be88f-d563-47ba-97d9-ade8d47c46b0.png)
 ### Travel plan:
 ![image3](https://user-images.githubusercontent.com/12907710/234523464-80c4e3f0-f99f-4498-96ef-dc43ef89c64b.png)
 ### Movie:
 - [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca)
 - [MiniGPT-4](https://github.com/Vision-CAIR/MiniGPT-4)
 - [LLaVA](https://github.com/haotian-liu/LLaVA/tree/main)
+- [Instruction Tuning with GPT-4](https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM)