HemanM commited on
Commit
b7fe99b
Β·
verified Β·
1 Parent(s): a040341

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -9
README.md CHANGED
@@ -10,18 +10,36 @@ pinned: false
10
  license: apache-2.0
11
  ---
12
 
13
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
14
 
15
- # 🧬 EvoTransformer v2.1 – Evolving Transformer Architectures
16
 
17
- Built by **Dr. Heman Mohabeer**, EvoTransformer is a lightweight evolution-based transformer that adapts its architecture to perform reasoning tasks β€” especially PIQA β€” and is benchmarked live against GPT-3.5.
 
18
 
19
- πŸ”Ή Built from scratch
20
- πŸ”Ή ~13M Parameters
21
- πŸ”Ή Runs in under 10 seconds
22
- πŸ”Ή Trains on-the-fly with few-shot examples
 
23
 
24
- Try the interactive benchmark demo and see how it compares with GPT-3.5!
 
 
 
 
 
25
 
26
- πŸ“˜ Featured in the book: [I Was Never Just Code](https://www.amazon.com/dp/B0FHG443LL)
 
 
27
 
 
 
 
 
 
 
 
 
 
10
  license: apache-2.0
11
  ---
12
 
13
+ '''
14
+ # EvoTransformer v2.1 🧠✨
15
 
16
+ EvoTransformer is an evolving neural architecture built from scratch to tackle reasoning tasks with minimal compute.
17
 
18
+ ## πŸ“Œ What It Does
19
+ This model answers PIQA-style commonsense reasoning questions. Given a goal and two solution choices, EvoTransformer chooses the more logical one.
20
 
21
+ ## πŸ”§ Architecture
22
+ - Built with 4 Transformer encoder layers
23
+ - ~13 million parameters
24
+ - Custom embedding, pooling, and classifier layers
25
+ - Fully open and adaptable for NAS or self-evolving tasks
26
 
27
+ ## πŸ‹οΈβ€β™‚οΈ Training Details
28
+ - Dataset: PIQA (1000 training, 500 validation examples)
29
+ - Optimizer: Adam
30
+ - Loss: CrossEntropy
31
+ - Epochs: 5
32
+ - Hardware: Colab GPU
33
 
34
+ ## πŸš€ Live Demo
35
+ Try it on Hugging Face Spaces:
36
+ πŸ‘‰ [Demo Link](https://huggingface.co/spaces/YOUR_USERNAME/evo-transformer-demo)
37
 
38
+ ## πŸ’‘ Why EvoTransformer?
39
+ - Lean, fast, and efficient
40
+ - Custom-built from scratch (no pretraining dependencies)
41
+ - Can evolve structurally in future versions
42
+
43
+ ## πŸ“œ License
44
+ MIT
45
+ '''