Merge pull request #4 from pedropark99/truncate-cells

Add new scripts for truncating chunk results
pedropark99 · Apr 4, 2023 · ef4f17c · ef4f17c
2 parents fdc42af + fb05eb0
commit ef4f17c
Show file tree

Hide file tree

Showing 15 changed files with 8,853 additions and 349 deletions.
diff --git a/.gitignore b/.gitignore
@@ -15,4 +15,6 @@ metastore_db
 Chapters/metastore_db
 
 Chapters/*.html
-Chapters/*/*
+Chapters/*/*
+
+Scripts/__pycache__/
diff --git a/Chapters/04-dataframes.qmd b/Chapters/04-dataframes.qmd
@@ -63,29 +63,14 @@ Like any python class, the `DataFrame` class comes with multiple methods that ar
 
 As an example, in the code below I expose all the available methods from this `DataFrame` class. First, I create a Spark DataFrame with `spark.range(5)`, and, store it in the object `df5`. After that, I use the `dir()` function to show all the methods that I can use through this `df5` object:
 
-```{python}
-#| include: false
-import sys 
-import os
-sys.path.append(os.path.abspath("./../Scripts/"))
-from print_big_list import print_big_list
-```
-
 
 ```{python}
-#| eval: false
+#| eval: true
 df5 = spark.range(5)
 available_methods = dir(df5)
 print(available_methods)
 ```
 
-```{python}
-#| echo: false
-df5 = spark.range(5)
-available_methods = dir(df5)
-print_big_list(available_methods)
-```
-
 
 All the methods present in this `DataFrame` class, are commonly referred as the *DataFrame API of Spark*. Remember, this is the most important API of Spark. Because much of your Spark applications will heavily use this API to compose your data transformations and data flows [@chambers2018].