redis
/

langcache-embed-v3

@@ -16,50 +16,54 @@ tags:
 - loss:CoSENTLoss
 base_model: Alibaba-NLP/gte-modernbert-base
 widget:
-- source_sentence: In 2015 Adolf Hitler appeared in the kickstarter short movie ``
-    Kung Fury `` as Taccone ( A.K.A .
   sentences:
-  - In 2015 , Adolf Hitler appeared in the Kickstarter - short film `` Kung Fury ``
-    as Taccone ( A.K.A .
-  - In 1795 , the only white residents were Dr. John Laidley and two brothers with
-    the surname Ainslie .
-  - The 125th University Match was played in March 2014 at the Rye Golf Club , Oxford
-    , East Sussex won the game 8.5 - 6.5 .
-- source_sentence: From 1973 to 1974 , Aubrey toured with the Cambridge Theatre Company
-    as Diggory in `` She Stoops to Conquer `` and again as Aguecheek .
   sentences:
-  - Oxide can be reduced to metallic samarium at higher temperatures by heating with
-    a reducing agent such as hydrogen or carbon monoxide .
-  - From 1973 to 1974 Aguecheek toured with the Cambridge Theatre Company as Diggory
-    in `` You Stoops to Conquer `` and again as Aubrey .
-  - The medals were presented by Barry Maister , IOC member , New Zealand and Sarah
-    Webb Gosling , Vice President of World Sailing .
-- source_sentence: There is no official wall on the border , although there are sections
-    of fence near populated areas and continuous border crossings .
   sentences:
-  - The 2014 -- 15 Boston Bruins season was the 91st season for the National Hockey
-    League franchise that was established on November 1 , 1924 .
-  - He was trained by the Inghams and owned by John Hawkes .
-  - There is no continuous wall on the border , although there are fence sections
-    near populated areas and official border crossings .
-- source_sentence: Capital . `` The French established similar hill stations in Indochina
-    , such as Dalat built in 1921 .
   sentences:
-  - Lubuk China is a small town in Alor Gajah District , Melaka , Malaysia . It is
-    situated near the border with Negeri Sembilan .
-  - The French established similar hill stations in Indochina , such as Dalat , built
-    in 1921 .
-  - John Potts ( or Pott ) was a doctor and colonial governor of Virginia in the Jamestown
-    settlement at Virginia Colony in the early 17th century .
-- source_sentence: The band pursued `` signals `` in January 2012 in three weeks ,
-    and drums were recorded in a day and a half .
   sentences:
-  - It was repaired at the beginning of the 20th century and is listed as closed in
-    our records .
-  - The band tracked `` Signals `` in three weeks in January 2012 . Drums were recorded
-    in a day and a half .
-  - Contributors include actor Anton LaVey , Satanist Christopher Lee , serial killer
-    expert Clive Barker , author Karen Greenlee , and necrophile Robert Ressler .
 datasets:
 - redis/langcache-sentencepairs-v1
 pipeline_tag: sentence-similarity
@@ -84,28 +88,28 @@ model-index:
       type: val
     metrics:
     - type: cosine_accuracy
-      value: 0.762879238548483
       name: Cosine Accuracy
     - type: cosine_accuracy_threshold
-      value: 0.8641344308853149
       name: Cosine Accuracy Threshold
     - type: cosine_f1
-      value: 0.6906413705224409
       name: Cosine F1
     - type: cosine_f1_threshold
-      value: 0.826151430606842
       name: Cosine F1 Threshold
     - type: cosine_precision
-      value: 0.6289324394017535
       name: Cosine Precision
     - type: cosine_recall
-      value: 0.7657770800627943
       name: Cosine Recall
     - type: cosine_ap
-      value: 0.7350886848165957
       name: Cosine Ap
     - type: cosine_mcc
-      value: 0.47694835496637344
       name: Cosine Mcc
   - task:
       type: binary-classification
@@ -115,28 +119,28 @@ model-index:
       type: test
     metrics:
     - type: cosine_accuracy
-      value: 0.7035036519888425
       name: Cosine Accuracy
     - type: cosine_accuracy_threshold
-      value: 0.8520702719688416
       name: Cosine Accuracy Threshold
     - type: cosine_f1
-      value: 0.7118695167174169
       name: Cosine F1
     - type: cosine_f1_threshold
-      value: 0.8109757900238037
       name: Cosine F1 Threshold
     - type: cosine_precision
-      value: 0.597953808752026
       name: Cosine Precision
     - type: cosine_recall
-      value: 0.8794040968342645
       name: Cosine Recall
     - type: cosine_ap
-      value: 0.6473233550443912
       name: Cosine Ap
     - type: cosine_mcc
-      value: 0.4409362621742405
       name: Cosine Mcc
 ---
@@ -190,9 +194,9 @@ from sentence_transformers import SentenceTransformer
 model = SentenceTransformer("redis/langcache-embed-v3")
 # Run inference
 sentences = [
-    'The band pursued `` signals `` in January 2012 in three weeks , and drums were recorded in a day and a half .',
-    'The band tracked `` Signals `` in three weeks in January 2012 . Drums were recorded in a day and a half .',
-    'Contributors include actor Anton LaVey , Satanist Christopher Lee , serial killer expert Clive Barker , author Karen Greenlee , and necrophile Robert Ressler .',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
@@ -201,9 +205,9 @@ print(embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
-# tensor([[1.0000, 0.9598, 0.4943],
-#         [0.9598, 0.9998, 0.5096],
-#         [0.4943, 0.5096, 1.0001]])
 ```
 <!--
@@ -241,14 +245,14 @@ You can finetune this model on your own dataset.
 | Metric                    | val        | test       |
 |:--------------------------|:-----------|:-----------|
-| cosine_accuracy           | 0.7629     | 0.7035     |
-| cosine_accuracy_threshold | 0.8641     | 0.8521     |
-| cosine_f1                 | 0.6906     | 0.7119     |
-| cosine_f1_threshold       | 0.8262     | 0.811      |
-| cosine_precision          | 0.6289     | 0.598      |
-| cosine_recall             | 0.7658     | 0.8794     |
-| **cosine_ap**             | **0.7351** | **0.6473** |
-| cosine_mcc                | 0.4769     | 0.4409     |
 <!--
 ## Bias, Risks and Limitations
@@ -269,19 +273,19 @@ You can finetune this model on your own dataset.
 #### LangCache Sentence Pairs (all)
 * Dataset: [LangCache Sentence Pairs (all)](https://huggingface.co/datasets/redis/langcache-sentencepairs-v1)
-* Size: 62,021 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
-  |         | sentence1                                                                         | sentence2                                                                         | label                                           |
-  |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:------------------------------------------------|
-  | type    | string                                                                            | string                                                                            | int                                             |
-  | details | <ul><li>min: 8 tokens</li><li>mean: 27.46 tokens</li><li>max: 53 tokens</li></ul> | <ul><li>min: 9 tokens</li><li>mean: 27.36 tokens</li><li>max: 52 tokens</li></ul> | <ul><li>0: ~50.30%</li><li>1: ~49.70%</li></ul> |
 * Samples:
-  | sentence1                                                                                                                                   | sentence2                                                                                                                                     | label          |
-  |:--------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
-  | <code>The newer Punts are still very much in existence today and race in the same fleets as the older boats .</code>                        | <code>The newer punts are still very much in existence today and run in the same fleets as the older boats .</code>                           | <code>1</code> |
-  | <code>Turner Valley , was at the Turner Valley Bar N Ranch Airport , southwest of the Turner Valley Bar N Ranch , Alberta , Canada .</code> | <code>Turner Valley Bar N Ranch Airport , , was located at Turner Valley Bar N Ranch , southwest of Turner Valley , Alberta , Canada .</code> | <code>0</code> |
-  | <code>After losing his second election , he resigned as opposition leader and was replaced by Geoff Pearsall .</code>                       | <code>Max Bingham resigned as opposition leader after losing his second election , and was replaced by Geoff Pearsall .</code>                | <code>1</code> |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
@@ -295,19 +299,19 @@ You can finetune this model on your own dataset.
 #### LangCache Sentence Pairs (all)
 * Dataset: [LangCache Sentence Pairs (all)](https://huggingface.co/datasets/redis/langcache-sentencepairs-v1)
-* Size: 62,021 evaluation samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
-  |         | sentence1                                                                         | sentence2                                                                         | label                                           |
-  |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:------------------------------------------------|
-  | type    | string                                                                            | string                                                                            | int                                             |
-  | details | <ul><li>min: 8 tokens</li><li>mean: 27.46 tokens</li><li>max: 53 tokens</li></ul> | <ul><li>min: 9 tokens</li><li>mean: 27.36 tokens</li><li>max: 52 tokens</li></ul> | <ul><li>0: ~50.30%</li><li>1: ~49.70%</li></ul> |
 * Samples:
-  | sentence1                                                                                                                                   | sentence2                                                                                                                                     | label          |
-  |:--------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
-  | <code>The newer Punts are still very much in existence today and race in the same fleets as the older boats .</code>                        | <code>The newer punts are still very much in existence today and run in the same fleets as the older boats .</code>                           | <code>1</code> |
-  | <code>Turner Valley , was at the Turner Valley Bar N Ranch Airport , southwest of the Turner Valley Bar N Ranch , Alberta , Canada .</code> | <code>Turner Valley Bar N Ranch Airport , , was located at Turner Valley Bar N Ranch , southwest of Turner Valley , Alberta , Canada .</code> | <code>0</code> |
-  | <code>After losing his second election , he resigned as opposition leader and was replaced by Geoff Pearsall .</code>                       | <code>Max Bingham resigned as opposition leader after losing his second election , and was replaced by Geoff Pearsall .</code>                | <code>1</code> |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
@@ -319,7 +323,7 @@ You can finetune this model on your own dataset.
 ### Training Logs
 | Epoch | Step | val_cosine_ap | test_cosine_ap |
 |:-----:|:----:|:-------------:|:--------------:|
-| -1    | -1   | 0.7351        | 0.6473         |
 ### Framework Versions

 - loss:CoSENTLoss
 base_model: Alibaba-NLP/gte-modernbert-base
 widget:
+- source_sentence: That is evident from their failure , three times in a row , to
+    get a big enough turnout to elect a president .
   sentences:
+  - 'given a text, decide to which of a predefined set of classes it belongs.  examples:
+    language identification, genre classification, sentiment analysis, and spam detection'
+  - Three times in a row , they failed to get a big _ enough turnout to elect a president
+    .
+  - He said the Government still did not know the real reason the original Saudi buyer
+    pulled out on August 21 .
+- source_sentence: these use built-in and learned knowledge to make decisions and
+    accomplish tasks that fulfill the intentions of the user.
   sentences:
+  - It also features a 4.5 in back-lit LCD screen and memory expansion facilities
+    .
+  - '- set of interrelated components - collect, process, store and distribute info.
+    - support decision-making, coordination, and control'
+  - software programs that work without direct human intervention to carry out specific
+    tasks for an individual user, business process, or software application -siri
+    adapts to your preferences over time
+- source_sentence: any location in storage can be accessed at any moment in approximately
+    the same amount of time.
   sentences:
+  - your study can adopt the original model used by the cited theorist but you can
+    modify different variables depending on your study of the whole theory
+  - an access method that can access any storage location directly and in any order;
+    primary storage devices and disk storage devices use random access...
+  - Branson said that his preference would be to operate a fully commercial service
+    on routes to New York , Barbados and Dubai .
+- source_sentence: United issued a statement saying it will " work professionally
+    and cooperatively with all its unions . "
   sentences:
+  - network that acts like the human brain; type of ai
+  - a database system consists of one or more databases and a database management
+    system (dbms).
+  - Senior vice president Sara Fields said the airline " will work professionally
+    and cooperatively with all our unions . "
+- source_sentence: A European Union spokesman said the Commission was consulting EU
+    member states " with a view to taking appropriate action if necessary " on the
+    matter .
   sentences:
+  - Justice Minister Martin Cauchon and Prime Minister Jean Chretien both have said
+    the government will introduce legislation to decriminalize possession of small
+    amounts of pot .
+  - Laos 's second most important export destination - said it was consulting EU member
+    states ' ' with a view to taking appropriate action if necessary ' ' on the matter
+    .
+  - the form data assumes and the possible range of values that the attribute defined
+    as that type of data may express  1. text 2. numerical
 datasets:
 - redis/langcache-sentencepairs-v1
 pipeline_tag: sentence-similarity
       type: val
     metrics:
     - type: cosine_accuracy
+      value: 0.7638310529446758
       name: Cosine Accuracy
     - type: cosine_accuracy_threshold
+      value: 0.8640533685684204
       name: Cosine Accuracy Threshold
     - type: cosine_f1
+      value: 0.6912742186395134
       name: Cosine F1
     - type: cosine_f1_threshold
+      value: 0.825770378112793
       name: Cosine F1 Threshold
     - type: cosine_precision
+      value: 0.6289243437982501
       name: Cosine Precision
     - type: cosine_recall
+      value: 0.7673469387755102
       name: Cosine Recall
     - type: cosine_ap
+      value: 0.7353968345121902
       name: Cosine Ap
     - type: cosine_mcc
+      value: 0.4778469995044085
       name: Cosine Mcc
   - task:
       type: binary-classification
       type: test
     metrics:
     - type: cosine_accuracy
+      value: 0.7037777526966672
       name: Cosine Accuracy
     - type: cosine_accuracy_threshold
+      value: 0.8524033427238464
       name: Cosine Accuracy Threshold
     - type: cosine_f1
+      value: 0.7122170715871171
       name: Cosine F1
     - type: cosine_f1_threshold
+      value: 0.8118724822998047
       name: Cosine F1 Threshold
     - type: cosine_precision
+      value: 0.5989283084033827
       name: Cosine Precision
     - type: cosine_recall
+      value: 0.8783612662942272
       name: Cosine Recall
     - type: cosine_ap
+      value: 0.6476665223951498
       name: Cosine Ap
     - type: cosine_mcc
+      value: 0.44182914870985407
       name: Cosine Mcc
 ---
 model = SentenceTransformer("redis/langcache-embed-v3")
 # Run inference
 sentences = [
+    'A European Union spokesman said the Commission was consulting EU member states " with a view to taking appropriate action if necessary " on the matter .',
+    "Laos 's second most important export destination - said it was consulting EU member states ' ' with a view to taking appropriate action if necessary ' ' on the matter .",
+    'the form data assumes and the possible range of values that the attribute defined as that type of data may express  1. text 2. numerical',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
+# tensor([[1.0078, 0.8789, 0.4961],
+#         [0.8789, 1.0000, 0.4648],
+#         [0.4961, 0.4648, 1.0078]], dtype=torch.bfloat16)
 ```
 <!--
 | Metric                    | val        | test       |
 |:--------------------------|:-----------|:-----------|
+| cosine_accuracy           | 0.7638     | 0.7038     |
+| cosine_accuracy_threshold | 0.8641     | 0.8524     |
+| cosine_f1                 | 0.6913     | 0.7122     |
+| cosine_f1_threshold       | 0.8258     | 0.8119     |
+| cosine_precision          | 0.6289     | 0.5989     |
+| cosine_recall             | 0.7673     | 0.8784     |
+| **cosine_ap**             | **0.7354** | **0.6477** |
+| cosine_mcc                | 0.4778     | 0.4418     |
 <!--
 ## Bias, Risks and Limitations
 #### LangCache Sentence Pairs (all)
 * Dataset: [LangCache Sentence Pairs (all)](https://huggingface.co/datasets/redis/langcache-sentencepairs-v1)
+* Size: 8,405 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
+  |         | sentence1                                                                         | sentence2                                                                        | label                                           |
+  |:--------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------------------------|:------------------------------------------------|
+  | type    | string                                                                            | string                                                                           | int                                             |
+  | details | <ul><li>min: 6 tokens</li><li>mean: 24.89 tokens</li><li>max: 50 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 24.3 tokens</li><li>max: 43 tokens</li></ul> | <ul><li>0: ~45.80%</li><li>1: ~54.20%</li></ul> |
 * Samples:
+  | sentence1                                                                                                                             | sentence2                                                                                                                                          | label          |
+  |:--------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
+  | <code>He said the foodservice pie business doesn 't fit the company 's long-term growth strategy .</code>                             | <code>" The foodservice pie business does not fit our long-term growth strategy .</code>                                                           | <code>1</code> |
+  | <code>Magnarelli said Racicot hated the Iraqi regime and looked forward to using his long years of training in the war .</code>       | <code>His wife said he was " 100 percent behind George Bush " and looked forward to using his years of training in the war .</code>                | <code>0</code> |
+  | <code>The dollar was at 116.92 yen against the yen , flat on the session , and at 1.2891 against the Swiss franc , also flat .</code> | <code>The dollar was at 116.78 yen JPY = , virtually flat on the session , and at 1.2871 against the Swiss franc CHF = , down 0.1 percent .</code> | <code>0</code> |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
 #### LangCache Sentence Pairs (all)
 * Dataset: [LangCache Sentence Pairs (all)](https://huggingface.co/datasets/redis/langcache-sentencepairs-v1)
+* Size: 8,405 evaluation samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
+  |         | sentence1                                                                         | sentence2                                                                        | label                                           |
+  |:--------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------------------------|:------------------------------------------------|
+  | type    | string                                                                            | string                                                                           | int                                             |
+  | details | <ul><li>min: 6 tokens</li><li>mean: 24.89 tokens</li><li>max: 50 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 24.3 tokens</li><li>max: 43 tokens</li></ul> | <ul><li>0: ~45.80%</li><li>1: ~54.20%</li></ul> |
 * Samples:
+  | sentence1                                                                                                                             | sentence2                                                                                                                                          | label          |
+  |:--------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
+  | <code>He said the foodservice pie business doesn 't fit the company 's long-term growth strategy .</code>                             | <code>" The foodservice pie business does not fit our long-term growth strategy .</code>                                                           | <code>1</code> |
+  | <code>Magnarelli said Racicot hated the Iraqi regime and looked forward to using his long years of training in the war .</code>       | <code>His wife said he was " 100 percent behind George Bush " and looked forward to using his years of training in the war .</code>                | <code>0</code> |
+  | <code>The dollar was at 116.92 yen against the yen , flat on the session , and at 1.2891 against the Swiss franc , also flat .</code> | <code>The dollar was at 116.78 yen JPY = , virtually flat on the session , and at 1.2871 against the Swiss franc CHF = , down 0.1 percent .</code> | <code>0</code> |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
 ### Training Logs
 | Epoch | Step | val_cosine_ap | test_cosine_ap |
 |:-----:|:----:|:-------------:|:--------------:|
+| -1    | -1   | 0.7354        | 0.6477         |
 ### Framework Versions

config.json CHANGED Viewed

@@ -12,7 +12,7 @@
   "cls_token_id": 50281,
   "decoder_bias": true,
   "deterministic_flash_attn": false,
-  "dtype": "float32",
   "embedding_dropout": 0.0,
   "eos_token_id": 50282,
   "global_attn_every_n_layers": 3,

   "cls_token_id": 50281,
   "decoder_bias": true,
   "deterministic_flash_attn": false,
+  "dtype": "bfloat16",
   "embedding_dropout": 0.0,
   "eos_token_id": 50282,
   "global_attn_every_n_layers": 3,

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:08f1cbe54e64c8e83baea353ce1d434c5bea3aaac96bed50b43e7d3fd8053485
-size 596070136

 version https://git-lfs.github.com/spec/v1
+oid sha256:95d02211c4cca89113f9f3e93ed91f5176bf50170faa2cb835f7bfea15bb9dd2
+size 298041696