Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

News item ChatGPT #2750

Closed
wants to merge 1 commit into from
Closed
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
56 changes: 56 additions & 0 deletions content/news/2024-09-02-ChatGPT/index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,56 @@
---
title: 'Using Large Language Models in complex workflows'
date: '2024-09-02'
tease: "Use ChatGPT in your analysis on the European Galaxy Server to leverage the Large Language Model in your automated workflows"
hide_tease: false
authors: Arash Kadkhodaei, Daniela Schneider
subsites: [global, eu, us, freiburg]
bgruening marked this conversation as resolved.
tags: [tool]
---

## Using Large Language Models in complex workflows

The amount of data researchers face is growing daily. Finding ways of tackling the information overflow to get and analyse relevant data effectively is more critical than ever. Large Language Models (LLMs) like GPT can help in this endeavour and (with good prompting) analyse text data flexibly and effectively.
To help do so, GPT is now available on [Galaxy](https://usegalaxy.eu/), an open-source platform for FAIR data analysis. This enables users to incorporate the LLM into more complex, automatable workflows and analyses. Galaxy Europe contains over 3,500 tools that users can combine in workflows to cover digital analysis without programming skills or expensive hardware. Users can share all steps and results of the analysis according to the FAIR principles, facilitating high reproducibility, transparency and efficient research data management.


## Setting up GPT

To access GPT in Galaxy, create an OpenAI account [here](https://platform.openai.com/signup). If needed, [add API credits](https://platform.openai.com/settings/organization/billing) according to your usage. Go to https://platform.openai.com/api-keys to “+Create new secret key”, name it “Galaxy”, and click the green button “create secret key”. Copy this key to add the API into Galaxy.

![OpenAI_API_key-ezgif com-video-to-gif-converter](https://github.com/user-attachments/assets/9424a44d-11e3-4594-bca9-d759c67956bd)


Back in Galaxy, click “User” and then click on “Manage Information”. In the opened window, scroll down to “Your OpenAI API key”. Paste the API key you created on [OpenAI](https://platform.openai.com/api-keys) here and click “save”.


## Using GPT in Galaxy

Now you are set to use ChatGPT in Galaxy and can choose from different models:

| Model | Description |
| ------------- | ------------- |
| GPT-4o | For complex, multi-step tasks, with vision capabilities |
| GPT-4o mini | Affordable and intelligent small model for fast, lightweight tasks, with vision capabilities |
| GPT-4 Turbo and GPT-4 | The previous set of high-intelligence models, Turbo with vision capabilities |
| GPT-3.5 Turbo | A fast, inexpensive model for simple tasks |

You can also combine the ChatGPT tool in Galaxy with the other tools within a more complex workflow:

You could, for example, upload audio and video files from various sources to be automatically converted into text in Galaxy using [Whisper](https://usegalaxy.eu/?tool_id=whisper). After a short cleaning, ChatGPT can summarise the text, as visualised below.

![screenshot 171](https://github.com/user-attachments/assets/bd1a82d5-be79-464f-8f57-9fe5487e2abf)

The screenshot shows the integration of ChatGPT as a tool in Galaxy. The penal on the right side, the ‘History’, shows an example workflow. Here, a mp3-recording from Martin Luther King’s ‘I have a dream’ was converted into text, cleaned, and translated using ChatGPT. The prompt was adapted from [Prompting ChatGPT for Translation](https://doi.org/10.48550/arXiv.2403.00127).

Alternatively, the transcribed audio file could undergo Named Entity Recognition (NER), and the results could be visualised in the next step by other tools, such as [QGIS](https://usegalaxy.eu/root?tool_id=interactive_tool_qgis) directly in Galaxy.

Another application example is the direct retrieval of research data from Zenodo in Galaxy. For example, published interviews can be imported directly and prepared for ChatGPT. Using sentiment analysis, the LLM analyses the interviewees' attitudes, groups them and extracts suitable passages from the material. The output can then be cleaned up with Galaxy’s other text processing tools or in the interactive environment of [Jupyter Notebooks](https://usegalaxy.eu/?tool_id=interactive_tool_jupyter_notebook), which is another tool in Galaxy.


## Sharing your analysis and data

The compiled workflow, the individual analysis steps and the data of each sub-step can be shared and repeated at any time for good scientific practice, simplifying research data management and reinforcing FAIR analysis. With the help of the [Galaxy Training Network](https://training.galaxyproject.org/), users can create an open-source tutorial generated out of their workflow to explain and share their analysis with other users.

Curious? What is your use case with GPT in Galaxy going to be?
Loading