tweak notebook text

iamdonovan · iamdonovan · commit bc102adbe656 · 2025-01-03T15:17:08.000Z
diff --git a/04.pandas/pandas.ipynb b/04.pandas/pandas.ipynb
@@ -127,7 +127,7 @@
     "\n",
     "Here, we use an **f-string** to combine the `station` variable (which takes on a value from the `new_stations` **list** on each pass through the loop) with `'data.csv'`, so that the resulting file names will be `'oxforddata.csv'`, `'southamptondata.csv'`, and `'stornowaydata.csv'`. We then use `Path()` to combine this with the `'data'` directory name, so that the value of `fn_data` is the complete relative path to each file.\n",
     "\n",
-    "Next, we use `pd.read_csv()` to read in the file, and add a `station` variable to the table, just like we did with the Armagh data. \n",
+    "Next, we again use `pd.read_csv()` to read in the file, and add a `station` variable to the table, just like we did with the Armagh data. \n",
     "\n",
     "Finally, we use `pd.concat()` ([documentation](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.concat.html)) to combine the existing table, `station_data`, with the newly loaded table (`data`), and overwrite the value of `station_data` with this combined table:\n",
     "\n",
@@ -136,7 +136,7 @@
     "\n",
     "```\n",
     "\n",
-    "Each time through the **for** loop, the value of `station` is updated:"
+    "Remember that each time through the **for** loop, the value of `station` is updated - so on the first time through, the value of `station` will be `'oxford'`, on the second time through it will be `'southampton'`, and on the final time through it will be `'stornoway'`:"
    ]
   },
   {
@@ -161,7 +161,7 @@
    "id": "325becd3-fb17-48c4-9314-59f225aa8239",
    "metadata": {},
    "source": [
-    "Note that this is one advantage of using clear, consistent naming and formatting for data files - we can easily write a loop to load multiple files, instead of having to write individual paths.\n",
+    "Note that this is one advantage of using **clear, consistent naming and formatting for data files** - it means that we can easily write a loop to load multiple files, instead of having to write individual paths or treat each file differently!\n",
     "\n",
     "## selecting rows using expressions\n",
     "\n",
@@ -185,7 +185,7 @@
    "id": "ff430bcf-9e94-4fb6-9931-6b1fde1ef64e",
    "metadata": {},
    "source": [
-    "If we want to use multiple conditions - for example, all observations where the monthly maximum temperature is greater than 20°C, and the monthly rainfall is grater than 100 mm, we can't simply use the `&` operator with the two statements:"
+    "Remember that if we want to use multiple conditions - for example, all observations where the monthly maximum temperature is greater than 20°C, and the monthly rainfall is greater than 100 mm, we can't simply use the `&` operator with the two statements:"
    ]
   },
   {
@@ -955,7 +955,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "sample = station_data \\\n",
+    "station_data \\\n",
     "    .groupby('station') \\\n",
     "    .sample(5)"
    ]
@@ -1004,7 +1004,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.11.6"
+   "version": "3.10.15"
   }
  },
  "nbformat": 4,