OloriBern commited on
Commit
f20943c
·
verified ·
1 Parent(s): 3bf0b2e

Upload TrailRAG cross-encoder model for hotpotqa

Browse files
CECorrelationEvaluator_validation_results.csv CHANGED
@@ -1,67 +1,15 @@
1
  epoch,steps,Pearson_Correlation,Spearman_Correlation
2
- 0,50,nan,nan
3
- 0,-1,nan,nan
4
- 1,50,nan,nan
5
- 1,-1,nan,nan
6
- 2,50,nan,nan
7
- 2,-1,nan,nan
8
- 0,50,nan,nan
9
- 0,-1,nan,nan
10
- 1,50,nan,nan
11
- 1,-1,nan,nan
12
- 2,50,nan,nan
13
- 2,-1,nan,nan
14
- 3,50,nan,nan
15
- 3,-1,nan,nan
16
- 4,50,nan,nan
17
- 4,-1,nan,nan
18
- 5,50,nan,nan
19
- 5,-1,nan,nan
20
- 6,50,nan,nan
21
- 6,-1,nan,nan
22
- 7,50,nan,nan
23
- 7,-1,nan,nan
24
- 0,50,0.8844942424785296,0.8857172465764791
25
- 0,-1,0.8844942424785296,0.8857172465764791
26
- 1,50,0.9336146534653291,0.8904310386001277
27
- 1,-1,0.9336146534653291,0.8904310386001277
28
- 2,50,0.9450708705265047,0.908718841785034
29
- 2,-1,0.9450708705265047,0.908718841785034
30
- 3,50,0.9478919301215523,0.9122450535593611
31
- 3,-1,0.9478919301215523,0.9122450535593611
32
- 4,50,0.9495609740799951,0.9147266921300965
33
- 4,-1,0.9495609740799951,0.9147266921300965
34
- 5,50,0.9509284655955195,0.914543270049704
35
- 5,-1,0.9509284655955195,0.914543270049704
36
- 0,50,0.8769600515545288,0.8871269174764043
37
- 0,-1,0.8769600515545288,0.8871269174764043
38
- 1,50,0.9245579489171054,0.8784258567954037
39
- 1,-1,0.9245579489171054,0.8784258567954037
40
- 2,50,0.9480165887969222,0.9301595056146273
41
- 2,-1,0.9480165887969222,0.9301595056146273
42
- 3,50,0.9509537570683666,0.9320302181621813
43
- 3,-1,0.9509537570683666,0.9320302181621813
44
- 4,50,0.9527921664806371,0.9309925694044452
45
- 4,-1,0.9527921664806371,0.9309925694044452
46
- 5,50,0.9526781526872711,0.9313792659915328
47
- 5,-1,0.9526781526872711,0.9313792659915328
48
- 6,50,0.954482858927449,0.9329198527954008
49
- 6,-1,0.954482858927449,0.9329198527954008
50
- 7,50,0.953667993155727,0.9318349325109867
51
- 7,-1,0.953667993155727,0.9318349325109867
52
- 0,50,0.9009366205573937,0.8841251669454295
53
- 0,-1,0.9009366205573937,0.8841251669454295
54
- 1,50,0.9377902395000197,0.8864781167682224
55
- 1,-1,0.9377902395000197,0.8864781167682224
56
- 2,50,0.9526408925989197,0.9106428337931949
57
- 2,-1,0.9526408925989197,0.9106428337931949
58
- 3,50,0.9601144587325283,0.920010369671858
59
- 3,-1,0.9601144587325283,0.920010369671858
60
- 4,50,0.9629702193217928,0.9225955082725437
61
- 4,-1,0.9629702193217928,0.9225955082725437
62
- 5,50,0.9614396022220375,0.9227764446780587
63
- 5,-1,0.9614396022220375,0.9227764446780587
64
- 6,50,0.9626509279130091,0.9245291205031555
65
- 6,-1,0.9626509279130091,0.9245291205031555
66
- 7,50,0.9617904585351846,0.9228447811745623
67
- 7,-1,0.9617904585351846,0.9228447811745623
 
1
  epoch,steps,Pearson_Correlation,Spearman_Correlation
2
+ 0,-1,0.8888841799688185,0.8860855585221055
3
+ 1,-1,0.9433172575408377,0.899095726435546
4
+ 2,-1,0.9543818279076624,0.9182857047232711
5
+ 3,-1,0.9561063731084964,0.9276779309546022
6
+ 4,-1,0.9568838576527665,0.92585888012172
7
+ 5,-1,0.958280584064253,0.9264295627359576
8
+ 0,-1,0.8674377304858821,0.8760607745831603
9
+ 1,-1,0.9274483825359976,0.9092044306717953
10
+ 2,-1,0.9414549054805882,0.9326797719917711
11
+ 3,-1,0.9434662406344414,0.9349552144265474
12
+ 4,-1,0.9423602934547362,0.9294904476564604
13
+ 5,-1,0.9457376917330784,0.9352871141032617
14
+ 6,-1,0.9482051166603169,0.9371009844294911
15
+ 7,-1,0.9478484140539114,0.9367443852419516
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
README.md CHANGED
@@ -21,17 +21,17 @@ model-index:
21
  type: hotpotqa
22
  metrics:
23
  - type: mse
24
- value: 0.0502763435546878
25
  - type: mae
26
- value: 0.1253658650726789
27
  - type: rmse
28
- value: 0.224223869279539
29
  - type: r2_score
30
- value: 0.6987198891908811
31
  - type: pearson_correlation
32
- value: 0.8868365534337148
33
  - type: spearman_correlation
34
- value: 0.8719195144396966
35
  ---
36
 
37
  # TrailRAG Cross-Encoder: HotpotQA Enhanced
@@ -53,20 +53,20 @@ This is a fine-tuned cross-encoder model specifically optimized for **Multi-hop
53
 
54
  | Metric | Value | Description |
55
  |--------|-------|-------------|
56
- | **MSE** | **0.050276** | Mean Squared Error (lower is better) |
57
- | **MAE** | **0.125366** | Mean Absolute Error (lower is better) |
58
- | **RMSE** | **0.224224** | Root Mean Squared Error (lower is better) |
59
- | **R² Score** | **0.698720** | Coefficient of determination (higher is better) |
60
- | **Pearson Correlation** | **0.886837** | Linear correlation (higher is better) |
61
- | **Spearman Correlation** | **0.871920** | Rank correlation (higher is better) |
62
 
63
  ### Training Details
64
 
65
- - **Training Duration**: 33 minutes
66
  - **Epochs**: 8
67
  - **Early Stopping**: No
68
- - **Best Correlation Score**: 0.922845
69
- - **Final MSE**: 0.050276
70
 
71
  ### Training Configuration
72
 
 
21
  type: hotpotqa
22
  metrics:
23
  - type: mse
24
+ value: 0.0557947916534922
25
  - type: mae
26
+ value: 0.1418474710541999
27
  - type: rmse
28
+ value: 0.2362092116186248
29
  - type: r2_score
30
+ value: 0.6484965021143569
31
  - type: pearson_correlation
32
+ value: 0.8754595236036868
33
  - type: spearman_correlation
34
+ value: 0.8618191776300459
35
  ---
36
 
37
  # TrailRAG Cross-Encoder: HotpotQA Enhanced
 
53
 
54
  | Metric | Value | Description |
55
  |--------|-------|-------------|
56
+ | **MSE** | **0.055795** | Mean Squared Error (lower is better) |
57
+ | **MAE** | **0.141847** | Mean Absolute Error (lower is better) |
58
+ | **RMSE** | **0.236209** | Root Mean Squared Error (lower is better) |
59
+ | **R² Score** | **0.648497** | Coefficient of determination (higher is better) |
60
+ | **Pearson Correlation** | **0.875460** | Linear correlation (higher is better) |
61
+ | **Spearman Correlation** | **0.861819** | Rank correlation (higher is better) |
62
 
63
  ### Training Details
64
 
65
+ - **Training Duration**: 28 minutes
66
  - **Epochs**: 8
67
  - **Early Stopping**: No
68
+ - **Best Correlation Score**: 0.936744
69
+ - **Final MSE**: 0.055795
70
 
71
  ### Training Configuration
72
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:926392c30561825b4939ed008ad74e2fe368726eee6738fbebbd676e45063094
3
  size 90866412
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b49f199a57085d2a7b518756e550554d7cb8b1275313e52ba6f1abb26a4456d1
3
  size 90866412