Skip to content

Commit

Permalink
add automatic github cloning
Browse files Browse the repository at this point in the history
  • Loading branch information
anuprulez committed May 24, 2024
1 parent d74f699 commit e3b5daf
Showing 1 changed file with 3 additions and 15 deletions.
18 changes: 3 additions & 15 deletions topics/statistics/tutorials/fine_tuning_protTrans/tutorial.md
Original file line number Diff line number Diff line change
Expand Up @@ -60,30 +60,18 @@ Dephosphorylation is a biochemical process (post-translational modification) inv
> <hands-on-title>GPU-enabled Interactive Jupyter Notebook for Machine Learning</hands-on-title>
>
> - {% tool [GPU-enabled Interactive Jupyter Notebook for Machine Learning](interactive_tool_ml_jupyter_notebook) %}
> - *"Do you already have a notebook?"*: `Start with default notebooks`
> - *"Do you already have a notebook?"*: `Start with a code repository`
> - *"Online code repository (Git-based) URL"*: `https://github.com/anuprulez/fine-tune-protTrans-repository`
> - Click *"Run Tool"*
>
> > <comment-title></comment-title>
> > The above step automatically fetches the notebook and datasets from the provided GitHub URL and initiates a JupyterLab.
> > If you do not have access to this resource in Galaxy Europe, please apply for it at: [Access GPU-JupyterLab](http://usegalaxy.eu/gpu-request). It may take a day or two to receive access.
> >
> {: .comment}
{: .hands_on}


### Fetch notebook and protein sequences

> <hands-on-title>Fetch data from Zenodo</hands-on-title>
>
> 1. Create a new folder named `fine-tuning` alongside other folders such as "data", "outputs", "elyra" or you can use your favourite folder name.
> 2. Inside the created folder, clone a code repository by clicking on "Git" icon.
> 3. In the shown popup, provide the repository path as shown below and then, click on "clone":
> ```
> https://github.com/anuprulez/fine-tune-protTrans-repository
> ```
>
{: .hands_on}
## Fine-tuning notebook
From the cloned repository, open the `fine-tune-protTrans-dephophorylation.ipynb` notebook. The notebook contains all the necessary scripts for processing protein sequences, creating and configuring protein large language models, training it on the protein sequences evaluating them on the test protein sequences and visualising results. Let's look at these key steps of fine-tuning.

Expand Down

0 comments on commit e3b5daf

Please sign in to comment.