Spaces:

double-ai
/

FormulaOne-Leaderboard

Running on CPU Upgrade

App Files Files Community

Alvinn-aai commited on Jul 30

Commit

9eba8d6

1 Parent(s): daa3ab0

submission extraction

Browse files

Files changed (1) hide show

src/about.py +34 -21

src/about.py CHANGED Viewed

@@ -45,35 +45,48 @@ To reproduce our results, here is the commands you can run:
 """
 EVALUATION_QUEUE_TEXT = """
-## Some good practices before submitting a model
-### 1) Make sure you can load your model and tokenizer using AutoClasses:
-```python
-from transformers import AutoConfig, AutoModel, AutoTokenizer
-config = AutoConfig.from_pretrained("your model name", revision=revision)
-model = AutoModel.from_pretrained("your model name", revision=revision)
-tokenizer = AutoTokenizer.from_pretrained("your model name", revision=revision)
 ```
-If this step fails, follow the error messages to debug your model before submitting it. It's likely your model has been improperly uploaded.
-Note: make sure your model is public!
-Note: if your model needs `use_remote_code=True`, we do not support this option yet but we are working on adding it, stay posted!
-### 2) Convert your model weights to [safetensors](https://huggingface.co/docs/safetensors/index)
-It's a new format for storing weights which is safer and faster to load and use. It will also allow us to add the number of parameters of your model to the `Extended Viewer`!
-### 3) Make sure your model has an open license!
-This is a leaderboard for Open LLMs, and we'd love for as many people as possible to know they can use your model 🤗
-### 4) Fill up your model card
-When we add extra information about models to the leaderboard, it will be automatically taken from the model card
-## In case of model failure
-If your model is displayed in the `FAILED` category, its execution stopped.
-Make sure you have followed the above steps first.
-If everything is done, check you can launch the EleutherAIHarness on your model locally, using the above command without modifications (you can add `--limit` to limit the number of examples per task).
 """
 CITATION_BUTTON_LABEL = """📚 How to cite FormulaOne"""
 CITATION_BUTTON_TEXT = r"""
 @misc{beniamini2025formulaonemeasuringdepthalgorithmic,

 """
 EVALUATION_QUEUE_TEXT = """
+## 🧪 Submitting to the FormulaOne Leaderboard
+This leaderboard evaluates systems on the FormulaOne core dataset. Submissions consist of a .jsonl file with solution code for each problem.
+### 📁 1. Format Your Submission File
+Your submission must be a .jsonl file with one entry per problem:
+```json
+{"problem_id": 1, "solution": "<your Python code here>"}
+{"problem_id": 2, "solution": "<your Python code here>"}
+...
 ```
+- problem_id: Must match the official list of FormulaOne core problems.
+- solution: A Python code implementing the required callback functions.
+📄 Full list of problem_ids:
+View the [FormulaOne core dataset](https://github.com/double-ai/formulaone-dataset-release/dataset/formulaone) for the complete list of problem IDs.
+⚠️ Validation Rules:
+Submissions must:
+	•	Contain exactly two columns: ["problem_id", "solution"]
+	•	Include all required problems (no missing/unknown IDs)
+	•	Provide solutions as Python strings
+	•	Avoid duplicates
+### 📤 2. Submit via the Web UI
+	1.	Upload your .jsonl file.
+	2.	Fill in:
+	•	System Name
+	•	Organization
+	•	System Type
+	3.	Click Submit.
+### ⏱️ After Submission
+Submissions are validated and evaluated within ~24 hours. Results will appear on the leaderboard once processed.
 """
 CITATION_BUTTON_LABEL = """📚 How to cite FormulaOne"""
 CITATION_BUTTON_TEXT = r"""
 @misc{beniamini2025formulaonemeasuringdepthalgorithmic,