updated model summary
Browse files
README.md
CHANGED
@@ -26,13 +26,13 @@ widget:
|
|
26 |
|
27 |
## Model Summary
|
28 |
|
29 |
-
The
|
30 |
|
31 |
-
- **Repository:**
|
32 |
- **Project Website:**
|
33 |
- **Paper:**
|
34 |
- **Point of Contact:**
|
35 |
-
- **Languages:**
|
36 |
|
37 |
## Use
|
38 |
|
|
|
26 |
|
27 |
## Model Summary
|
28 |
|
29 |
+
The Narrow Transformer (NT) model NT-Java-1.1B is an open-source specialized code model built on StarCoderBase, designed for code completion tasks in Java programming. The model is a decoder-only transformer with Multi-Query-Attention and learned absolute positional embeddings and was finetuned for Java subset of the training data (starcoderdata) which is ~22B tokens and with a context of 8192 tokens.
|
30 |
|
31 |
+
- **Repository:** [bigcode/Megatron-LM](https://github.com/bigcode-project/Megatron-LM)
|
32 |
- **Project Website:**
|
33 |
- **Paper:**
|
34 |
- **Point of Contact:**
|
35 |
+
- **Languages:** Java
|
36 |
|
37 |
## Use
|
38 |
|