Huanzhi Mao
commited on
Commit
·
2b538b7
1
Parent(s):
383da93
update description
Browse files
app.py
CHANGED
|
@@ -1029,7 +1029,7 @@ with gr.Blocks() as demo:
|
|
| 1029 |
"**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
|
| 1030 |
)
|
| 1031 |
gr.Markdown(
|
| 1032 |
-
"""**AST means evaluation through Abstract Syntax Tree and Exec means evaluation
|
| 1033 |
|
| 1034 |
**FC = native support for function/tool calling.**
|
| 1035 |
|
|
@@ -1046,7 +1046,7 @@ with gr.Blocks() as demo:
|
|
| 1046 |
"**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
|
| 1047 |
)
|
| 1048 |
gr.Markdown(
|
| 1049 |
-
"""**AST means evaluation through Abstract Syntax Tree and Exec means evaluation
|
| 1050 |
|
| 1051 |
**FC = native support for function/tool calling.**
|
| 1052 |
|
|
@@ -1064,8 +1064,8 @@ with gr.Blocks() as demo:
|
|
| 1064 |
|
| 1065 |
We provide a short summary here. For more details, please refer to our release [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html):
|
| 1066 |
|
| 1067 |
-
**AST** means evaluation through Abstract Syntax Tree, and **Exec** means evaluation
|
| 1068 |
-
|
| 1069 |
**Cost** is calculated as an estimate of the cost per 1000 function calls, in USD.
|
| 1070 |
|
| 1071 |
**Latency** is measured in seconds.
|
|
|
|
| 1029 |
"**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
|
| 1030 |
)
|
| 1031 |
gr.Markdown(
|
| 1032 |
+
"""**AST means evaluation through Abstract Syntax Tree and Exec means evaluation by executing all the API calls the LLM generates.**
|
| 1033 |
|
| 1034 |
**FC = native support for function/tool calling.**
|
| 1035 |
|
|
|
|
| 1046 |
"**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
|
| 1047 |
)
|
| 1048 |
gr.Markdown(
|
| 1049 |
+
"""**AST means evaluation through Abstract Syntax Tree and Exec means evaluation by executing all the API calls the LLM generates.**
|
| 1050 |
|
| 1051 |
**FC = native support for function/tool calling.**
|
| 1052 |
|
|
|
|
| 1064 |
|
| 1065 |
We provide a short summary here. For more details, please refer to our release [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html):
|
| 1066 |
|
| 1067 |
+
**AST** means evaluation through Abstract Syntax Tree, and **Exec** means evaluation by executing all the API calls the LLM generates.
|
| 1068 |
+
|
| 1069 |
**Cost** is calculated as an estimate of the cost per 1000 function calls, in USD.
|
| 1070 |
|
| 1071 |
**Latency** is measured in seconds.
|