DOC-753 | Graph ML UI #709

bluepal-thirumala-thotapalli · 2025-06-10T07:11:58Z

Description

Upstream PRs

3.10:
3.11:
3.12:
3.13:

arangodb-docs-automation · 2025-06-10T07:12:01Z

Deploy Preview Available Via
https://deploy-preview-709--docs-hugo.netlify.app

Simran-B · 2025-06-11T08:02:30Z

site/content/3.13/data-science/arangographml/arangograph-ml.md

+title: ArangoGraphML Web Interface
+menuTitle: ArangoGraphML Web Interface


Title to be discussed (we might rename it to just GraphML)

Simran-B · 2025-06-11T08:04:15Z

site/content/3.13/data-science/arangographml/arangograph-ml.md

This shouldn't be the same name twice, but I'm not settled on a particular name. Maybe just ui.md?

Simran-B · 2025-06-11T08:07:12Z

site/content/3.13/data-science/arangographml/arangograph-ml.md

+aliases:
+  - getting-started-with-arangographml
+---
+Solve high-computational graph problems with Graph Machine Learning. Apply ML on a selected graph to predict connections, get better product recommendations, classify nodes, and perform node embeddings. Configure and run the whole machine learning flow entirely in the web interface.


We only have node classification and embeddings available as immediate options. If we mention something like link predictions, we should at least outline how to achieve that.

Would also be good to have a more technical explanation here about how GraphML works (GraphSage, using depth 2 neighborhood, as mentioned in Slack team channel).

Please also add an overview over the process instead of immediately starting with project creation etc., users should first get an understanding of the hierarchy and steps involved.

Simran-B · 2025-06-11T08:11:30Z

site/content/3.13/data-science/arangographml/arangograph-ml.md

+
+To create a new GraphML project using the ArangoDB Web Interface, follow these steps:
+
+- **Select the Target Database** – From the **Database** dropdown in the left-hand sidebar, select the database where the project should reside.


These are steps that should be followed in order, so use an ordered list here.
dropdown -> dropdown menu (or simply just write to select the database without mentioning the specific widget type)

Simran-B · 2025-06-11T08:12:35Z

site/content/3.13/data-science/arangographml/arangograph-ml.md

+To create a new GraphML project using the ArangoDB Web Interface, follow these steps:
+
+- **Select the Target Database** – From the **Database** dropdown in the left-hand sidebar, select the database where the project should reside.
+- **Navigate to the Data Science Section** – In the left-hand navigation menu, click on Data Science to open the GraphML project management interface, then click on RunGraphML.


Should we call it the Data Science Suite perhaps?
click on Data Science -> click **Data Science**
RunGraphML -> **Run GraphML**

Simran-B · 2025-06-11T08:19:50Z

site/content/3.13/data-science/arangographml/arangograph-ml.md

+{{< info >}}
+The following attributes cannot be used: imdb_feat_description, imdb_feat_genre, imdb_feat_homepage, imdb_feat_id, imdb_feat_imageUrl, imdb_feat_imdb_x_hash, imdb_feat_imdbId, imdb_feat_label, imdb_feat_language, imdb_feat_lastModified, imdb_feat_released, imdb_feat_releaseDate, imdb_feat_runtime, imdb_feat_studio, imdb_feat_tagline, imdb_feat_title, imdb_feat_trailer, imdb_feat_type, imdb_feat_version, imdb_x, imdb_y, prediction_model_output. As some of their values are lists or arrays.
+{{< /info >}}


It's fine to mention that certain attributes are not eligible for GraphML but there shouldn't be a list of attributes here that are specific to the dataset, graph, and GraphML project. Users will not have these on the first run, and they will be different based on the mentioned things.

Simran-B · 2025-06-11T08:21:21Z

site/content/3.13/data-science/arangographml/arangograph-ml.md

+- **Batch size** – The number of documents to process in a single batch.
+- **Run analysis checks** – Whether to run analysis checks to perform a high-level analysis of the data quality before proceeding. Default is `true`.
+- **Skip labels** – Skip the featurization process for attributes marked as labels. Default is `false`.
+- **Overwrite FS graph** – Whether to overwrite the Feature Store graph if features were previously generated. Default is `false`, so features are written to an existing graph.
+- **Write to source graph** – Whether to store the generated features in the source graph. Default is `true`.
+- **Use feature store** – Enable the use of the Feature Store database, which stores features separately from the source graph. Default is `false`, so features are written to the source graph.


There should be a reasonable amount of additional explanation over the available labels and toolstip in the UI to add value.

Simran-B · 2025-06-11T08:22:21Z

site/content/3.13/data-science/arangographml/arangograph-ml.md

+
+This is the second step in the ML workflow after featurization. In the training phase, you configure and launch a machine learning training job on your graph data.
+
+#### Select Type of Training Job


This shouldn't be a headline, especially not with the same level as the GraphML tasks

Simran-B · 2025-06-11T08:24:42Z

site/content/3.13/data-science/arangographml/arangograph-ml.md

+
+## Prediction Phase
+
+Once the best-performing model has been selected, the final step of the GraphML pipeline is to generate predictions for new or unlabeled data


As I explained, we don't have the capability to only process new/unlabeled data

Simran-B · 2025-06-11T08:25:38Z

site/content/3.13/data-science/arangographml/arangograph-ml.md

+
+### Overview
+
+The Prediction interface allows inference to be run using the selected model. It enables configuration of how predictions are executed, which collections are involved, and whether new or outdated documents should be automatically featurized before prediction.


Should add a statement about effects on quality when featurizing new/outdated docs

Thirumala added 3 commits June 9, 2025 20:48

create md file for graph ml ui

21ec540

Reduce the size of images

8c818b5

Changes in documentation

aeb5c96

bluepal-thirumala-thotapalli requested a review from Simran-B June 10, 2025 07:11

bluepal-thirumala-thotapalli self-assigned this Jun 10, 2025

This comment was marked as duplicate.

Sign in to view

Simran-B changed the title ~~Doc 753~~ DOC-753 | Graph ML UI Jun 10, 2025

Simran-B requested changes Jun 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DOC-753 | Graph ML UI #709

DOC-753 | Graph ML UI #709

Uh oh!

bluepal-thirumala-thotapalli commented Jun 10, 2025

Uh oh!

arangodb-docs-automation bot commented Jun 10, 2025

Uh oh!

This comment was marked as duplicate.

Simran-B Jun 11, 2025

Uh oh!

Simran-B Jun 11, 2025

Uh oh!

Simran-B Jun 11, 2025

Uh oh!

Simran-B Jun 11, 2025

Uh oh!

Simran-B Jun 11, 2025

Uh oh!

Simran-B Jun 11, 2025

Uh oh!

Simran-B Jun 11, 2025

Uh oh!

Simran-B Jun 11, 2025

Uh oh!

Simran-B Jun 11, 2025

Uh oh!

Simran-B Jun 11, 2025

Uh oh!

Uh oh!

		title: ArangoGraphML Web Interface
		menuTitle: ArangoGraphML Web Interface


		To create a new GraphML project using the ArangoDB Web Interface, follow these steps:

		- Select the Target Database – From the Database dropdown in the left-hand sidebar, select the database where the project should reside.


		This is the second step in the ML workflow after featurization. In the training phase, you configure and launch a machine learning training job on your graph data.

		#### Select Type of Training Job


		## Prediction Phase

		Once the best-performing model has been selected, the final step of the GraphML pipeline is to generate predictions for new or unlabeled data


		### Overview

		The Prediction interface allows inference to be run using the selected model. It enables configuration of how predictions are executed, which collections are involved, and whether new or outdated documents should be automatically featurized before prediction.

DOC-753 | Graph ML UI #709

Are you sure you want to change the base?

DOC-753 | Graph ML UI #709

Uh oh!

Conversation

bluepal-thirumala-thotapalli commented Jun 10, 2025

Description

Upstream PRs

Uh oh!

arangodb-docs-automation bot commented Jun 10, 2025

Uh oh!

This comment was marked as duplicate.

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!