rajabmondal commited on
Commit
e7ecaf3
·
verified ·
1 Parent(s): 9057e2d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -124,9 +124,9 @@ outputs = model.generate(inputs)
124
  print(tokenizer.decode(outputs[0]))
125
  ```
126
 
127
- ### Attribution & Other Requirements
128
 
129
- The pretraining dataset of the model was filtered for permissive licenses only. Nevertheless, the model can generate source code verbatim from the dataset. The code's license might require attribution and/or other specific requirements that must be respected. We provide a [search index](https://huggingface.co/spaces/bigcode/starcoder-search) that let's you search through the pretraining data to identify where generated code came from and apply the proper attribution to your code.
130
 
131
  # Limitations
132
 
 
124
  print(tokenizer.decode(outputs[0]))
125
  ```
126
 
127
+ # Attribution & Other Requirements
128
 
129
+ The pretraining dataset for the model was curated to include only data with permissive licenses. Despite this, the model is capable of generating source code verbatim from the dataset. The licenses of such code may necessitate attribution and adherence to other specific conditions. To facilitate compliance, we provide a search index that enables users to trace the origins of generated code within the pretraining data, allowing for proper attribution and adherence to licensing requirements.
130
 
131
  # Limitations
132