Spaces:

AnmolManocha
/

GPT4Tools

Configuration error

App Files Files Community

GPT4Tools / docs /tools.md

AnmolManocha

Initial

4f242c9 over 2 years ago

preview code

raw

history blame contribute delete

3.26 kB

	GPT4Tools can support 22 tools, including:

	\| No. \| Tools Name \| Function \|
	\|-----\|:-----------------------:\|:------------------------------------------------------------------------------:\|
	\| 1 \| InstructPix2Pix \| Style the image to be like the text. \|
	\| 2 \| Text2Image \| Generate an image from an input text. \|
	\| 3 \| ImageCaptioning \| Describe the input image. \|
	\| 4 \| Image2Canny \| Detect the edge of the image \|
	\| 5 \| CannyText2Image \| Generate a new real image from both the user description and a canny image. \|
	\| 6 \| Image2Line \| Detect the straight line of the image. \|
	\| 7 \| Image2Hed \| Detect the soft hed boundary of the image. \|
	\| 8 \| HedText2Image \| Generate a new real image from both the user description. \|
	\| 9 \| Image2Scribble \| Generate a scribble of the image. \|
	\| 10 \| ScribbleText2Image \| Generate a new real image from both the user description and a scribble image. \|
	\| 11 \| Image2Pose \| Detect the human pose of the image. \|
	\| 12 \| PoseText2Image \| Generate a new real image from both the user description. \|
	\| 13 \| SegText2Image \| Generate a new real image from both the user description and segmentations. \|
	\| 14 \| Image2Depth \| Detect depth of the image. \|
	\| 15 \| DepthText2Image \| Generate a new real image from both the user description and depth image. \|
	\| 16 \| Image2Normal \| Detect norm map of the image. \|
	\| 17 \| NormalText2Image \| Generate a new real image from both the user description and normal map. \|
	\| 18 \| VisualQuestionAnswering \| Answer for a question based on an image. \|
	\| 19 \| Segmenting \| Segment all the part of the image. \|
	\| 20 \| Text2Box \| Detect or find out given objects in the picture. \|
	\| 21 \| ObjectSegmenting \| Segment the certain objects in the picture. \|
	\| 22 \| ImageEditing \| Remove and object or something from the photo. \|

	You can customize the used tools by specifying ```{tools_name}_{devices}``` after args ```--load``` of ```gpt4tools.py```. For example, enabling ```Text2Box```, ```Segmenting```, and ```ImageCaptioning```:
	```
	python gpt4tools.py \
	--base_model <path_to_vicuna_with_tokenizer> \
	--lora_model <path_to_lora_weights> \
	--llm_device "cpu" \
	--load "Text2Box_cuda:0,Segmenting_cuda:0,ImageCaptioning_cuda:0"
	```
	More tools will be supported in the future!

	GPT4Tools can support 22 tools, including:

	\| No. \| Tools Name \| Function \|
	\|-----\|:-----------------------:\|:------------------------------------------------------------------------------:\|
	\| 1 \| InstructPix2Pix \| Style the image to be like the text. \|
	\| 2 \| Text2Image \| Generate an image from an input text. \|
	\| 3 \| ImageCaptioning \| Describe the input image. \|
	\| 4 \| Image2Canny \| Detect the edge of the image \|
	\| 5 \| CannyText2Image \| Generate a new real image from both the user description and a canny image. \|
	\| 6 \| Image2Line \| Detect the straight line of the image. \|
	\| 7 \| Image2Hed \| Detect the soft hed boundary of the image. \|
	\| 8 \| HedText2Image \| Generate a new real image from both the user description. \|
	\| 9 \| Image2Scribble \| Generate a scribble of the image. \|
	\| 10 \| ScribbleText2Image \| Generate a new real image from both the user description and a scribble image. \|
	\| 11 \| Image2Pose \| Detect the human pose of the image. \|
	\| 12 \| PoseText2Image \| Generate a new real image from both the user description. \|
	\| 13 \| SegText2Image \| Generate a new real image from both the user description and segmentations. \|
	\| 14 \| Image2Depth \| Detect depth of the image. \|
	\| 15 \| DepthText2Image \| Generate a new real image from both the user description and depth image. \|
	\| 16 \| Image2Normal \| Detect norm map of the image. \|
	\| 17 \| NormalText2Image \| Generate a new real image from both the user description and normal map. \|
	\| 18 \| VisualQuestionAnswering \| Answer for a question based on an image. \|
	\| 19 \| Segmenting \| Segment all the part of the image. \|
	\| 20 \| Text2Box \| Detect or find out given objects in the picture. \|
	\| 21 \| ObjectSegmenting \| Segment the certain objects in the picture. \|
	\| 22 \| ImageEditing \| Remove and object or something from the photo. \|

	You can customize the used tools by specifying ```{tools_name}_{devices}``` after args ```--load``` of ```gpt4tools.py```. For example, enabling ```Text2Box```, ```Segmenting```, and ```ImageCaptioning```:
	```
	python gpt4tools.py \
	--base_model <path_to_vicuna_with_tokenizer> \
	--lora_model <path_to_lora_weights> \
	--llm_device "cpu" \
	--load "Text2Box_cuda:0,Segmenting_cuda:0,ImageCaptioning_cuda:0"
	```
	More tools will be supported in the future!