File size: 3,260 Bytes
4f242c9
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
GPT4Tools can support 22 tools, including:

| No. |        Tools Name       |                                    Function                                    |
|-----|:-----------------------:|:------------------------------------------------------------------------------:|
|  1  |     InstructPix2Pix     | Style the image to be like the text.                                           |
|  2  |        Text2Image       | Generate an image from an input text.                                          |
|  3  |     ImageCaptioning     | Describe the input image.                                                      |
|  4  |       Image2Canny       | Detect the edge of the image                                                   |
|  5  |     CannyText2Image     | Generate a new real image from both the user description and a canny image.    |
|  6  |        Image2Line       | Detect the straight line of the image.                                         |
|  7  |        Image2Hed        | Detect the soft hed boundary of the image.                                     |
|  8  |      HedText2Image      | Generate a new real image from both the user description.                      |
|  9  |      Image2Scribble     | Generate a scribble of the image.                                              |
|  10 |    ScribbleText2Image   | Generate a new real image from both the user description and a scribble image. |
|  11 |        Image2Pose       | Detect the human pose of the image.                                            |
|  12 |      PoseText2Image     | Generate a new real image from both the user description.                      |
|  13 |      SegText2Image      | Generate a new real image from both the user description and segmentations.    |
|  14 |       Image2Depth       | Detect depth of the image.                                                     |
|  15 |     DepthText2Image     | Generate a new real image from both the user description and depth image.      |
|  16 |       Image2Normal      | Detect norm map of the image.                                                  |
|  17 |     NormalText2Image    | Generate a new real image from both the user description and normal map.       |
|  18 | VisualQuestionAnswering | Answer for a question based on an image.                                       |
|  19 |        Segmenting       | Segment all the part of the image.                                             |
|  20 |         Text2Box        | Detect or find out given objects in the picture.                               |
|  21 |     ObjectSegmenting    | Segment the certain objects in the picture.                                    |
|  22 |       ImageEditing      | Remove and object or something from the photo.                                 |

You can customize the used tools by specifying ```{tools_name}_{devices}``` after args ```--load``` of ```gpt4tools.py```. For example, enabling ```Text2Box```, ```Segmenting```, and ```ImageCaptioning```:
```
python gpt4tools.py \
	--base_model <path_to_vicuna_with_tokenizer> \
	--lora_model <path_to_lora_weights> \
	--llm_device "cpu" \
	--load "Text2Box_cuda:0,Segmenting_cuda:0,ImageCaptioning_cuda:0"
```
More tools will be supported in the future!