Skip to content

Commit e0d82af

Browse files
committed
Merge remote-tracking branch 'github/master'
2 parents 9b0a0b9 + 535659a commit e0d82af

File tree

2 files changed

+7
-6
lines changed

2 files changed

+7
-6
lines changed

README.md

+4-3
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,8 @@
1313
[arXiv](https://arxiv.org/abs/2012.09841) | [BibTeX](#bibtex) | [Project Page](https://compvis.github.io/taming-transformers/)
1414

1515
### News
16-
16+
- We added a [colab notebook](https://colab.research.google.com/github/CompVis/taming-transformers/blob/master/scripts/reconstruction_usage.ipynb) which compares two VQGANs and OpenAI's [DALL-E](). See also [this section](#more-resources).
17+
- We now include an overview of pretrained models in [Tab.1](#overview-of-pretrained-models)
1718
- The streamlit demo now supports image completions.
1819
- We now include a couple of examples from the D-RIN dataset so you can run the
1920
[D-RIN demo](#d-rin) without preparing the dataset first.
@@ -39,8 +40,8 @@ for a comparison and discussion of reconstruction capabilities.
3940
| ------------- | ------------- |------------- | ------------- |------------- |
4041
| FFHQ (f=16) | 11.4 | coming soon... |
4142
| CelebA-HQ (f=16) | 10.7 | coming soon... |
42-
| ADE20K (f=16) | 35.5 | [ade20k_transformer](https://k00.fr/ot46cksa) | [ade20k_samples.zip](https://heibox.uni-heidelberg.de/f/70bb78cbaf844501b8fb/) [2k]
43-
| COCO-Stuff (f=16) | 20.4 | [coco_transformer](https://k00.fr/2zz6i2ce) | [coco_samples.zip](https://heibox.uni-heidelberg.de/f/a395a9be612f4a7a8054/) [5k]
43+
| ADE20K (f=16) | 35.5 | [ade20k_transformer](https://k00.fr/ot46cksa) | [ade20k_samples.zip](https://heibox.uni-heidelberg.de/f/70bb78cbaf844501b8fb/) [2k] | evaluated on val split (2k images)
44+
| COCO-Stuff (f=16) | 20.4 | [coco_transformer](https://k00.fr/2zz6i2ce) | [coco_samples.zip](https://heibox.uni-heidelberg.de/f/a395a9be612f4a7a8054/) [5k] | evaluated on val split (5k images)
4445
| ImageNet (cIN) (f=16) | | coming soon...
4546
| | | | || |
4647
| FacesHQ (f=16) | -- | [faceshq_transformer](https://k00.fr/qqfl2do8)

scripts/reconstruction_usage.ipynb

+3-3
Original file line numberDiff line numberDiff line change
@@ -273,7 +273,7 @@
273273
"# Taming Transformers\n",
274274
"## Reconstruction Capabilities of VQGAN (Colab Notebook)\n",
275275
"\n",
276-
"This notebook provides code to (visually) analyze the first stage models used to generate images as in [Taming Transformers for High-Resolution Image Synthesis](https://github.com/CompVis/taming-transformers) \n",
276+
"This notebook provides code to (visually) analyze the first stage models used to generate images as in [Taming Transformers for High-Resolution Image Synthesis](https://github.com/CompVis/taming-transformers)\n",
277277
"and a comparison to the first stage model used in [DALL-E](https://openai.com/blog/dall-e/)."
278278
]
279279
},
@@ -284,8 +284,8 @@
284284
},
285285
"source": [
286286
"### Setup\n",
287-
"Clone the repository and download pretrained VQGANs: a small one with a codebook dimensionality $\\dim \\mathcal{Z} = 1024$ \n",
288-
"and a larger one with $ \\dim \\mathcal{Z} = 16384$. Both perform *four* downsampling steps, e.g. an input image of \n",
287+
"Clone the repository and download pretrained VQGANs: a small one with a codebook dimensionality $\\dim \\mathcal{Z} = 1024$\n",
288+
"and a larger one with $ \\dim \\mathcal{Z} = 16384$. Both perform *four* downsampling steps, e.g. an input image of\n",
289289
"size $256 \\times 256$ will be mapped to a latent code of size $16 \\times 16$."
290290
]
291291
},

0 commit comments

Comments
 (0)