tensorplex-labs
/

DOJO-INTERFACE-CODER-7B

Model card Files Files and versions

tensorplex-labs commited on Jul 2

Commit

5973efd

·

verified ·

1 Parent(s): 746bbf4

Update README.md

Files changed (1) hide show

README.md +16 -10

README.md CHANGED Viewed

@@ -5,15 +5,23 @@ datasets:
 - tensorplex-labs/Dojo-HumanFeedback-DPO
 language:
 - en
 ---
-# Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
-## Model Details
 ### Model Description
@@ -21,13 +29,11 @@ language:
-- **Developed by:** [More Information Needed]
-- **Funded by [optional]:** [More Information Needed]
-- **Shared by [optional]:** [More Information Needed]
-- **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
 ### Model Sources [optional]

 - tensorplex-labs/Dojo-HumanFeedback-DPO
 language:
 - en
+license: cc-by-4.0
 ---
+# INTERFACE-CODER-7B: First-of-its-kind Interface Generation Model Trained On High Quality Synthetic Data Curated Using Distributed Human Feedback
+We are thrilled to release INTERFACE-CODER-7B, a-first-of-its-kind Large Language Model (LLM) specialized in generating complex, interactive, and visually appealing frontend interfaces.
+INTERFACE-CODER-7B is trained on high quality synthetic data generated by state-of-the-art AI models. Data quality is further guaranteed using code verifiers, LLM-as-judge, and distributed human feedback.
+Leveraging Dojo's distributed human feedback infrastructure, we curated two datasets:
+- Dojo-SFT: A comprehensive dataset for supervised fine-tuning (SFT), filtered using LLM-as-judge.
+- Dojo-DPO: A preference dataset for Direct Preference Optimization (DPO), curated using human feedback scores to align the model's output with human aesthetic and functional preferences.
+Our development process followed a two-stage post-training methodology. We began with the powerful **Qwen2.5-Coder-7B-Instruct** as our base model. This foundation was then elevated through a supervised fine-tuning phase with Dojo-SFT, followed by a direct preference optimization stage using Dojo-DPO. This produced the final, highly specialized INTERFACE-CODER-7B.
+INTERFACE-CODER-7B is capable of generating functional and visually appealing frontend, far exceeding the interface generation capabilities of its base model. Beyond its primary use case, the model demonstrates remarkable generalization against other benchmarks beyond MMLU, GSM8k, and HumanEval.
 ### Model Description
+- **Developed by:** Shi Jie Yu, Tensorplex Labs
+- **Model type:** LoRA DPO
+- **Language(s) (NLP):** English
+- **License:** Creative Commons Attribution 4.0
+- **Finetuned from model:** [tensorplex-labs/INTERFACE-CODER-7B-SFT](https://huggingface.co/tensorplex-labs/INTERFACE-CODER-7B-SFT)
 ### Model Sources [optional]