Upload TrailRAG cross-encoder model for hotpotqa
Browse files- CECorrelationEvaluator_validation_results.csv +14 -66
- README.md +15 -15
- model.safetensors +1 -1
CECorrelationEvaluator_validation_results.csv
CHANGED
@@ -1,67 +1,15 @@
|
|
1 |
epoch,steps,Pearson_Correlation,Spearman_Correlation
|
2 |
-
0,
|
3 |
-
|
4 |
-
1,
|
5 |
-
|
6 |
-
|
7 |
-
|
8 |
-
0,
|
9 |
-
|
10 |
-
1,
|
11 |
-
|
12 |
-
|
13 |
-
|
14 |
-
|
15 |
-
|
16 |
-
4,50,nan,nan
|
17 |
-
4,-1,nan,nan
|
18 |
-
5,50,nan,nan
|
19 |
-
5,-1,nan,nan
|
20 |
-
6,50,nan,nan
|
21 |
-
6,-1,nan,nan
|
22 |
-
7,50,nan,nan
|
23 |
-
7,-1,nan,nan
|
24 |
-
0,50,0.8844942424785296,0.8857172465764791
|
25 |
-
0,-1,0.8844942424785296,0.8857172465764791
|
26 |
-
1,50,0.9336146534653291,0.8904310386001277
|
27 |
-
1,-1,0.9336146534653291,0.8904310386001277
|
28 |
-
2,50,0.9450708705265047,0.908718841785034
|
29 |
-
2,-1,0.9450708705265047,0.908718841785034
|
30 |
-
3,50,0.9478919301215523,0.9122450535593611
|
31 |
-
3,-1,0.9478919301215523,0.9122450535593611
|
32 |
-
4,50,0.9495609740799951,0.9147266921300965
|
33 |
-
4,-1,0.9495609740799951,0.9147266921300965
|
34 |
-
5,50,0.9509284655955195,0.914543270049704
|
35 |
-
5,-1,0.9509284655955195,0.914543270049704
|
36 |
-
0,50,0.8769600515545288,0.8871269174764043
|
37 |
-
0,-1,0.8769600515545288,0.8871269174764043
|
38 |
-
1,50,0.9245579489171054,0.8784258567954037
|
39 |
-
1,-1,0.9245579489171054,0.8784258567954037
|
40 |
-
2,50,0.9480165887969222,0.9301595056146273
|
41 |
-
2,-1,0.9480165887969222,0.9301595056146273
|
42 |
-
3,50,0.9509537570683666,0.9320302181621813
|
43 |
-
3,-1,0.9509537570683666,0.9320302181621813
|
44 |
-
4,50,0.9527921664806371,0.9309925694044452
|
45 |
-
4,-1,0.9527921664806371,0.9309925694044452
|
46 |
-
5,50,0.9526781526872711,0.9313792659915328
|
47 |
-
5,-1,0.9526781526872711,0.9313792659915328
|
48 |
-
6,50,0.954482858927449,0.9329198527954008
|
49 |
-
6,-1,0.954482858927449,0.9329198527954008
|
50 |
-
7,50,0.953667993155727,0.9318349325109867
|
51 |
-
7,-1,0.953667993155727,0.9318349325109867
|
52 |
-
0,50,0.9009366205573937,0.8841251669454295
|
53 |
-
0,-1,0.9009366205573937,0.8841251669454295
|
54 |
-
1,50,0.9377902395000197,0.8864781167682224
|
55 |
-
1,-1,0.9377902395000197,0.8864781167682224
|
56 |
-
2,50,0.9526408925989197,0.9106428337931949
|
57 |
-
2,-1,0.9526408925989197,0.9106428337931949
|
58 |
-
3,50,0.9601144587325283,0.920010369671858
|
59 |
-
3,-1,0.9601144587325283,0.920010369671858
|
60 |
-
4,50,0.9629702193217928,0.9225955082725437
|
61 |
-
4,-1,0.9629702193217928,0.9225955082725437
|
62 |
-
5,50,0.9614396022220375,0.9227764446780587
|
63 |
-
5,-1,0.9614396022220375,0.9227764446780587
|
64 |
-
6,50,0.9626509279130091,0.9245291205031555
|
65 |
-
6,-1,0.9626509279130091,0.9245291205031555
|
66 |
-
7,50,0.9617904585351846,0.9228447811745623
|
67 |
-
7,-1,0.9617904585351846,0.9228447811745623
|
|
|
1 |
epoch,steps,Pearson_Correlation,Spearman_Correlation
|
2 |
+
0,-1,0.8888841799688185,0.8860855585221055
|
3 |
+
1,-1,0.9433172575408377,0.899095726435546
|
4 |
+
2,-1,0.9543818279076624,0.9182857047232711
|
5 |
+
3,-1,0.9561063731084964,0.9276779309546022
|
6 |
+
4,-1,0.9568838576527665,0.92585888012172
|
7 |
+
5,-1,0.958280584064253,0.9264295627359576
|
8 |
+
0,-1,0.8674377304858821,0.8760607745831603
|
9 |
+
1,-1,0.9274483825359976,0.9092044306717953
|
10 |
+
2,-1,0.9414549054805882,0.9326797719917711
|
11 |
+
3,-1,0.9434662406344414,0.9349552144265474
|
12 |
+
4,-1,0.9423602934547362,0.9294904476564604
|
13 |
+
5,-1,0.9457376917330784,0.9352871141032617
|
14 |
+
6,-1,0.9482051166603169,0.9371009844294911
|
15 |
+
7,-1,0.9478484140539114,0.9367443852419516
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
README.md
CHANGED
@@ -21,17 +21,17 @@ model-index:
|
|
21 |
type: hotpotqa
|
22 |
metrics:
|
23 |
- type: mse
|
24 |
-
value: 0.
|
25 |
- type: mae
|
26 |
-
value: 0.
|
27 |
- type: rmse
|
28 |
-
value: 0.
|
29 |
- type: r2_score
|
30 |
-
value: 0.
|
31 |
- type: pearson_correlation
|
32 |
-
value: 0.
|
33 |
- type: spearman_correlation
|
34 |
-
value: 0.
|
35 |
---
|
36 |
|
37 |
# TrailRAG Cross-Encoder: HotpotQA Enhanced
|
@@ -53,20 +53,20 @@ This is a fine-tuned cross-encoder model specifically optimized for **Multi-hop
|
|
53 |
|
54 |
| Metric | Value | Description |
|
55 |
|--------|-------|-------------|
|
56 |
-
| **MSE** | **0.
|
57 |
-
| **MAE** | **0.
|
58 |
-
| **RMSE** | **0.
|
59 |
-
| **R² Score** | **0.
|
60 |
-
| **Pearson Correlation** | **0.
|
61 |
-
| **Spearman Correlation** | **0.
|
62 |
|
63 |
### Training Details
|
64 |
|
65 |
-
- **Training Duration**:
|
66 |
- **Epochs**: 8
|
67 |
- **Early Stopping**: No
|
68 |
-
- **Best Correlation Score**: 0.
|
69 |
-
- **Final MSE**: 0.
|
70 |
|
71 |
### Training Configuration
|
72 |
|
|
|
21 |
type: hotpotqa
|
22 |
metrics:
|
23 |
- type: mse
|
24 |
+
value: 0.0557947916534922
|
25 |
- type: mae
|
26 |
+
value: 0.1418474710541999
|
27 |
- type: rmse
|
28 |
+
value: 0.2362092116186248
|
29 |
- type: r2_score
|
30 |
+
value: 0.6484965021143569
|
31 |
- type: pearson_correlation
|
32 |
+
value: 0.8754595236036868
|
33 |
- type: spearman_correlation
|
34 |
+
value: 0.8618191776300459
|
35 |
---
|
36 |
|
37 |
# TrailRAG Cross-Encoder: HotpotQA Enhanced
|
|
|
53 |
|
54 |
| Metric | Value | Description |
|
55 |
|--------|-------|-------------|
|
56 |
+
| **MSE** | **0.055795** | Mean Squared Error (lower is better) |
|
57 |
+
| **MAE** | **0.141847** | Mean Absolute Error (lower is better) |
|
58 |
+
| **RMSE** | **0.236209** | Root Mean Squared Error (lower is better) |
|
59 |
+
| **R² Score** | **0.648497** | Coefficient of determination (higher is better) |
|
60 |
+
| **Pearson Correlation** | **0.875460** | Linear correlation (higher is better) |
|
61 |
+
| **Spearman Correlation** | **0.861819** | Rank correlation (higher is better) |
|
62 |
|
63 |
### Training Details
|
64 |
|
65 |
+
- **Training Duration**: 28 minutes
|
66 |
- **Epochs**: 8
|
67 |
- **Early Stopping**: No
|
68 |
+
- **Best Correlation Score**: 0.936744
|
69 |
+
- **Final MSE**: 0.055795
|
70 |
|
71 |
### Training Configuration
|
72 |
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 90866412
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b49f199a57085d2a7b518756e550554d7cb8b1275313e52ba6f1abb26a4456d1
|
3 |
size 90866412
|