DeepRank · gcroci2 · Jul 5, 2024 · Jul 1, 2024 · Jul 1, 2024 · Jul 5, 2024
diff --git a/docs/getstarted.md b/docs/getstarted.md
@@ -391,6 +391,8 @@ output_test = pd.read_hdf(os.path.join("<output_folder_path>", "output_exporter.
 
 The dataframes contain `phase`, `epoch`, `entry`, `output`, `target`, and `loss` columns, and can be easily used to visualize the results.
 
+For classification tasks, the `output` column contains the results from a [softmax function](https://pytorch.org/docs/stable/generated/torch.nn.functional.softmax.html). This means each entry in the `output` column is a list with one element for each class. Each element represents the probability of that class occurring.
+
 Example for plotting training loss curves using [Plotly Express](https://plotly.com/python/plotly-express/):
 
 ```python

diff --git a/tutorials/training.ipynb b/tutorials/training.ipynb
@@ -420,12 +420,8 @@
       "metadata": {},
       "outputs": [],
       "source": [
-        "output_train = pd.read_hdf(\n",
-        "    os.path.join(output_path, f\"gnn_{task}\", \"output_exporter.hdf5\"), key=\"training\"\n",
-        ")\n",
-        "output_test = pd.read_hdf(\n",
-        "    os.path.join(output_path, f\"gnn_{task}\", \"output_exporter.hdf5\"), key=\"testing\"\n",
-        ")\n",
+        "output_train = pd.read_hdf(os.path.join(output_path, f\"gnn_{task}\", \"output_exporter.hdf5\"), key=\"training\")\n",
+        "output_test = pd.read_hdf(os.path.join(output_path, f\"gnn_{task}\", \"output_exporter.hdf5\"), key=\"testing\")\n",
         "output_train.head()"
       ]
     },
@@ -436,7 +432,11 @@
       "source": [
         "The dataframes contain `phase`, `epoch`, `entry`, `output`, `target`, and `loss` columns, and can be easily used to visualize the results.\n",
         "\n",
-        "For example, the loss across the epochs can be plotted for the training and the validation sets:\n"
+        "For classification tasks, as in the current tutorial, the `output` column contains the results from a [softmax function](https://pytorch.org/docs/stable/generated/torch.nn.functional.softmax.html). This means each entry in the `output` column is a list with one element for each class. Each element represents the probability of that class occurring.\n",
+        "\n",
+        "Here specifically, the `output` column contains a list with two elements, respectively representing the predicted probabilities that the data point is 0 (first element of the list, representing non-binder class) and 1 (second element of the list, representing binder class).\n",
+        "\n",
+        "The loss across the epochs can be plotted for the training and the validation sets:\n"
       ]
     },
     {
@@ -671,12 +671,8 @@
       "metadata": {},
       "outputs": [],
       "source": [
-        "output_train = pd.read_hdf(\n",
-        "    os.path.join(output_path, f\"cnn_{task}\", \"output_exporter.hdf5\"), key=\"training\"\n",
-        ")\n",
-        "output_test = pd.read_hdf(\n",
-        "    os.path.join(output_path, f\"cnn_{task}\", \"output_exporter.hdf5\"), key=\"testing\"\n",
-        ")\n",
+        "output_train = pd.read_hdf(os.path.join(output_path, f\"cnn_{task}\", \"output_exporter.hdf5\"), key=\"training\")\n",
+        "output_test = pd.read_hdf(os.path.join(output_path, f\"cnn_{task}\", \"output_exporter.hdf5\"), key=\"testing\")\n",
         "output_train.head()"
       ]
     },
@@ -767,7 +763,7 @@
       "name": "python",
       "nbconvert_exporter": "python",
       "pygments_lexer": "ipython3",
-      "version": "3.10.13"
+      "version": "3.10.12"
     },
     "orig_nbformat": 4
   },