PrakhAI commited on
Commit
0e0cdf8
·
1 Parent(s): 3720930

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -0
README.md ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-sa-3.0
3
+ datasets:
4
+ - mnist
5
+ ---
6
+
7
+ [WGAN-GP](https://arxiv.org/abs/1704.00028) model trained on the [MNIST dataset](https://www.tensorflow.org/datasets/catalog/mnist) using [JAX in Colab](https://colab.research.google.com/drive/1RzQfrc4Xf_pvGJD2PaNJyaURLh0nO4Fp?usp=sharing).
8
+
9
+ | Real Images | Generated Images |
10
+ | ------- | -------- |
11
+ | ![image.png](https://cdn-uploads.huggingface.co/production/uploads/649f9483d76ca0fe679011c2/YlmgxAdyvJl-oy4Ae_fGB.png) | ![image.png](https://cdn-uploads.huggingface.co/production/uploads/649f9483d76ca0fe679011c2/sNDUja9lFPKiH8UDUqBvl.png) |
12
+
13
+ # Training Progression
14
+ <video width="50%" controls>
15
+ <source src="https://cdn-uploads.huggingface.co/production/uploads/649f9483d76ca0fe679011c2/EB8tZYvT3t7GXuO7XgbYq.mp4" type="video/mp4">
16
+ </video>
17
+
18
+ # Details
19
+ This model is based on [WGAN-GP](https://arxiv.org/abs/1704.00028).
20
+
21
+ The model was trained for ~9h40m on a GCE VM instance (n1-standard-4, 1 x NVIDIA T4).
22
+
23
+ The Critic consists of 4 Convolutional Layers with strides for downsampling, and Leaky ReLU activation. The critic does not use Batch Normalization or Dropout.
24
+
25
+ The Generator consists of 4 Transposed Convolutional Layers with ReLU activation and Batch Normalization.
26
+
27
+ The learning rate was kept constant at 1e-4 for the first 50,000 steps, which was followed by cosine annealing cycles with a peak LR of 1e-3.
28
+
29
+ The Lambda (gradient penalty coefficient) used was 10 (same as the original paper).
30
+
31
+ For more details, please refer to the [Colab Notebook](https://colab.research.google.com/drive/1RzQfrc4Xf_pvGJD2PaNJyaURLh0nO4Fp?usp=sharing).