Update README.md
Browse files
    	
        README.md
    CHANGED
    
    | @@ -39,7 +39,28 @@ datasets: | |
| 39 |  | 
| 40 | 
             
            This model is a SentenceTransformer fine-tuned from [`Shuu12121/CodeModernBERT-Owl🦉`](https://huggingface.co/Shuu12121/CodeModernBERT-Owl) on the [BigCloneBench](https://huggingface.co/datasets/google/code_x_glue_cc_clone_detection_big_clone_bench) dataset for **code clone detection**. It maps code snippets into a 768-dimensional dense vector space for semantic similarity tasks.
         | 
| 41 |  | 
| 42 | 
            -
             | 
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
| 43 |  | 
| 44 | 
             
            ## 📌 Model Overview
         | 
| 45 |  | 
|  | |
| 39 |  | 
| 40 | 
             
            This model is a SentenceTransformer fine-tuned from [`Shuu12121/CodeModernBERT-Owl🦉`](https://huggingface.co/Shuu12121/CodeModernBERT-Owl) on the [BigCloneBench](https://huggingface.co/datasets/google/code_x_glue_cc_clone_detection_big_clone_bench) dataset for **code clone detection**. It maps code snippets into a 768-dimensional dense vector space for semantic similarity tasks.
         | 
| 41 |  | 
| 42 | 
            +
             | 
| 43 | 
            +
             | 
| 44 | 
            +
            ## 🎯 Distinctive Performance and Stability
         | 
| 45 | 
            +
             | 
| 46 | 
            +
            This model achieves **very high accuracy and F1 scores** in code clone detection.  
         | 
| 47 | 
            +
            One particularly noteworthy characteristic is that **changing the similarity threshold has minimal impact on classification performance**.  
         | 
| 48 | 
            +
            This indicates that the model has learned to **clearly separate clones from non-clones**, resulting in a **stable and reliable similarity score distribution**.
         | 
| 49 | 
            +
             | 
| 50 | 
            +
            | Threshold         | Accuracy          | F1 Score           |
         | 
| 51 | 
            +
            |-------------------|-------------------|--------------------|
         | 
| 52 | 
            +
            | 0.5               | 0.9900            | 0.9633             |
         | 
| 53 | 
            +
            | 0.85              | 0.9903            | 0.9641             |
         | 
| 54 | 
            +
            | 0.90              | 0.9902            | 0.9637             |
         | 
| 55 | 
            +
            | 0.95              | 0.9887            | 0.9579             |
         | 
| 56 | 
            +
            | 0.98              | 0.9879            | 0.9540             |
         | 
| 57 | 
            +
             | 
| 58 | 
            +
            - **High Stability**: Between thresholds of 0.85 and 0.98, accuracy and F1 scores remain nearly constant.  
         | 
| 59 | 
            +
              _(This suggests that code pairs considered clones generally score between 0.9 and 1.0 in cosine similarity.)_
         | 
| 60 | 
            +
             | 
| 61 | 
            +
            - **Reliable in Real-World Applications**: Even if the similarity threshold is slightly adjusted for different tasks or environments, the model maintains consistent performance without significant degradation.
         | 
| 62 | 
            +
             | 
| 63 | 
            +
             | 
| 64 |  | 
| 65 | 
             
            ## 📌 Model Overview
         | 
| 66 |  | 
