abhinavv3
/

GPT_with_Modified_Memorizing_Transformer

Text Generation

Model card Files Files and versions

abhinavv3 commited on 25 days ago

Commit

329492e

·

verified ·

1 Parent(s): a6124fc

Update README.md

Files changed (1) hide show

README.md +39 -9

README.md CHANGED Viewed

@@ -31,16 +31,46 @@ This model is designed for scalable training, long-context understanding, and ef
 ## 📁 Project Structure
-memGPT/
-├── configs/ → Training & model hyperparams
-├── data/ → Tokenized and sharded datasets
-├── model_core/ → Model + attention + dataloader logic
-├── scripts/ → Training, evaluation, generation scripts
-├── evaluation/ → HellaSwag benchmark evaluation
-├── logs/ → Checkpoints and logs
-├── requirements.txt → Python dependencies
-└── README.md → This model card
 ---

 ## 📁 Project Structure
+```bash
+MEM_TRANSFORMER/
+├── configs/
+│   └── config.json                  # Model + training hyperparameters
+│
+├── data/
+│   ├── edu_fineweb/                 # Token-sharded training data
+│   │   ├── train_000001.npy
+│   │   ├── train_000002.npy
+│   │   └── test_000001.npy
+│   ├── hellaswag/
+│   │   └── hellaswag_val.jsonl
+│   └── fineweb.py                   # Sharding logic with memory-aligned sequence control
+│
+├── model_core/
+│   ├── __init__.py
+│   ├── attention.py                 # Grouped Query Attention, KNN & XL attention logic.Rotary Positional Encoding implementation
+│   ├── model.py                     # Transformer model with memory and RoPE support
+│   ├── dataloader.py                # Memory-aware DataLoader
+│   └── training.py                  # train_memgpt function
+│
+├── scripts/
+│   ├── train.py                     # Training script (DDP-compatible)
+│   ├── evaluate.py                  # Evaluation on benchmarks
+│   └── generate.py                  # Text generation from trained model
+│
+├── evaluation/
+│   ├── __init__.py
+│   ├── hellaswag.py                 # HellaSwag data loader
+│   └── val_hellaswag.py             # Evaluation logic with loss-based scoring
+│
+├── logs/
+│   ├── log.txt                      # Training logs
+│   └── model_*.pt                   # Checkpoints
+│
+├── .gitignore
+├── README.md
+├── requirements.txt
+```
 ---