GPT4Tools / docs /tools.md
AnmolManocha's picture
Initial
4f242c9
GPT4Tools can support 22 tools, including:
| No. | Tools Name | Function |
|-----|:-----------------------:|:------------------------------------------------------------------------------:|
| 1 | InstructPix2Pix | Style the image to be like the text. |
| 2 | Text2Image | Generate an image from an input text. |
| 3 | ImageCaptioning | Describe the input image. |
| 4 | Image2Canny | Detect the edge of the image |
| 5 | CannyText2Image | Generate a new real image from both the user description and a canny image. |
| 6 | Image2Line | Detect the straight line of the image. |
| 7 | Image2Hed | Detect the soft hed boundary of the image. |
| 8 | HedText2Image | Generate a new real image from both the user description. |
| 9 | Image2Scribble | Generate a scribble of the image. |
| 10 | ScribbleText2Image | Generate a new real image from both the user description and a scribble image. |
| 11 | Image2Pose | Detect the human pose of the image. |
| 12 | PoseText2Image | Generate a new real image from both the user description. |
| 13 | SegText2Image | Generate a new real image from both the user description and segmentations. |
| 14 | Image2Depth | Detect depth of the image. |
| 15 | DepthText2Image | Generate a new real image from both the user description and depth image. |
| 16 | Image2Normal | Detect norm map of the image. |
| 17 | NormalText2Image | Generate a new real image from both the user description and normal map. |
| 18 | VisualQuestionAnswering | Answer for a question based on an image. |
| 19 | Segmenting | Segment all the part of the image. |
| 20 | Text2Box | Detect or find out given objects in the picture. |
| 21 | ObjectSegmenting | Segment the certain objects in the picture. |
| 22 | ImageEditing | Remove and object or something from the photo. |
You can customize the used tools by specifying ```{tools_name}_{devices}``` after args ```--load``` of ```gpt4tools.py```. For example, enabling ```Text2Box```, ```Segmenting```, and ```ImageCaptioning```:
```
python gpt4tools.py \
--base_model <path_to_vicuna_with_tokenizer> \
--lora_model <path_to_lora_weights> \
--llm_device "cpu" \
--load "Text2Box_cuda:0,Segmenting_cuda:0,ImageCaptioning_cuda:0"
```
More tools will be supported in the future!