Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Newsitem GPT #2751

Merged
merged 7 commits into from
Sep 9, 2024
Merged
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 6 additions & 4 deletions content/news/2024-09-02-ChatGPT/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@ tags: [tool]
## Using Large Language Models in complex workflows

The amount of data researchers face is growing daily. Finding ways of tackling the information overflow to get and analyse relevant data effectively is more critical than ever. Large Language Models (LLMs) like GPT can help in this endeavour and (with good prompting) analyse text data flexibly and effectively.
To help do so, GPT is now available on [Galaxy](https://usegalaxy.eu/), an open-source platform for FAIR data analysis. This enables users to incorporate the LLM into more complex, automatable workflows and analyses. Galaxy Europe contains over 3,500 tools that users can combine in workflows to cover digital analysis without programming skills or expensive hardware. Users can share all steps and results of the analysis according to the FAIR principles, facilitating high reproducibility, transparency and efficient research data management.
To help do so, [GPT](https://usegalaxy.eu/root?tool_id=toolshed.g2.bx.psu.edu/repos/bgruening/chatgpt_openai_api/chatgpt_openai_api/2024+galaxy1) is now available on [Galaxy](https://usegalaxy.eu/), an open-source platform for FAIR data analysis. This enables users to incorporate the LLM into more complex, automatable workflows and analyses. Galaxy Europe contains over 3,500 tools that users can combine in workflows to cover digital analysis without programming skills or expensive hardware. Users can share all steps and results of the analysis according to the FAIR principles, facilitating high reproducibility, transparency and efficient research data management.
Sch-Da marked this conversation as resolved.
Show resolved Hide resolved


## Setting up GPT
Expand All @@ -30,7 +30,7 @@ Now you are set to use ChatGPT in Galaxy and can choose from different models:

| Model | Description |
| ------------- | ------------- |
| GPT-4o | For complex, multi-step tasks, with vision capabilities |
| GPT-4o | For complex, multi-step tasks, model with vision capabilities |
| GPT-4o mini | Affordable and intelligent small model for fast, lightweight tasks, with vision capabilities |
| GPT-4 Turbo and GPT-4 | The previous set of high-intelligence models, Turbo with vision capabilities |
| GPT-3.5 Turbo | A fast, inexpensive model for simple tasks |
Expand All @@ -41,7 +41,7 @@ You could, for example, upload audio and video files from various sources to be

![ChatGPT tool](https://github.com/user-attachments/assets/bd1a82d5-be79-464f-8f57-9fe5487e2abf)

The screenshot shows the integration of ChatGPT as a tool in Galaxy. The penal on the right side, the ‘History’, shows an example workflow. Here, a mp3-recording from Martin Luther King’s ‘I have a dream’ was converted into text, cleaned, and translated using ChatGPT. The prompt was adapted from [Prompting ChatGPT for Translation](https://doi.org/10.48550/arXiv.2403.00127).
The screenshot shows the integration of ChatGPT as a tool in Galaxy. The panel on the right side, the ‘History’, shows an example workflow. Here, a mp3-recording from Martin Luther King’s ‘I have a dream’ was converted into text, cleaned, and translated using ChatGPT. The prompt was adapted from [Prompting ChatGPT for Translation](https://doi.org/10.48550/arXiv.2403.00127).

Alternatively, the transcribed audio file could undergo Named Entity Recognition (NER), and the results could be visualised in the next step by other tools, such as [QGIS](https://usegalaxy.eu/root?tool_id=interactive_tool_qgis) directly in Galaxy.

Expand All @@ -50,6 +50,8 @@ Another application example is the direct retrieval of research data from Zenodo

## Sharing your analysis and data

The compiled workflow, the individual analysis steps and the data of each sub-step can be shared and repeated at any time for good scientific practice, simplifying research data management and reinforcing FAIR analysis. With the help of the [Galaxy Training Network](https://training.galaxyproject.org/), users can create an open-source tutorial generated out of their workflow to explain and share their analysis with other users.
The compiled workflow, the individual analysis steps and the data of each sub-step can be shared and repeated at any time for good scientific practice, simplifying research data management and reinforcing FAIR analysis. With the help of the [Galaxy Training Network](https://training.galaxyproject.org/), users can create an open-source tutorial generated out of their workflow to explain and share their analysis with other users. For an example, see here:



Curious? What is your use case with GPT in Galaxy going to be?