You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- We added a [colab notebook](https://colab.research.google.com/github/CompVis/taming-transformers/blob/master/scripts/reconstruction_usage.ipynb) which compares two VQGANs and OpenAI's [DALL-E](). See also [this section](#more-resources).
17
+
- We now include an overview of pretrained models in [Tab.1](#overview-of-pretrained-models)
17
18
- The streamlit demo now supports image completions.
18
19
- We now include a couple of examples from the D-RIN dataset so you can run the
19
20
[D-RIN demo](#d-rin) without preparing the dataset first.
@@ -39,8 +40,8 @@ for a comparison and discussion of reconstruction capabilities.
Copy file name to clipboardExpand all lines: scripts/reconstruction_usage.ipynb
+3-3
Original file line number
Diff line number
Diff line change
@@ -273,7 +273,7 @@
273
273
"# Taming Transformers\n",
274
274
"## Reconstruction Capabilities of VQGAN (Colab Notebook)\n",
275
275
"\n",
276
-
"This notebook provides code to (visually) analyze the first stage models used to generate images as in [Taming Transformers for High-Resolution Image Synthesis](https://github.com/CompVis/taming-transformers)\n",
276
+
"This notebook provides code to (visually) analyze the first stage models used to generate images as in [Taming Transformers for High-Resolution Image Synthesis](https://github.com/CompVis/taming-transformers)\n",
277
277
"and a comparison to the first stage model used in [DALL-E](https://openai.com/blog/dall-e/)."
278
278
]
279
279
},
@@ -284,8 +284,8 @@
284
284
},
285
285
"source": [
286
286
"### Setup\n",
287
-
"Clone the repository and download pretrained VQGANs: a small one with a codebook dimensionality $\\dim \\mathcal{Z} = 1024$\n",
288
-
"and a larger one with $ \\dim \\mathcal{Z} = 16384$. Both perform *four* downsampling steps, e.g. an input image of\n",
287
+
"Clone the repository and download pretrained VQGANs: a small one with a codebook dimensionality $\\dim \\mathcal{Z} = 1024$\n",
288
+
"and a larger one with $ \\dim \\mathcal{Z} = 16384$. Both perform *four* downsampling steps, e.g. an input image of\n",
289
289
"size $256 \\times 256$ will be mapped to a latent code of size $16 \\times 16$."
0 commit comments