Skip to article frontmatterSkip to article content

Galaxy Europe for Earth System subdomain

Ifremer

As part of FAIR-EASE, the Galaxy Europe Earth System subdomain ([earth-system.usegalaxy.eu]) was created to serve the needs of environmental scientists working with marine, atmospheric, land, and biodiversity data.

Earth System subdomain welcome page

Figure 1:Earth System subdomain welcome page

It supports a wide range of scientific and operational needs, and strengthens links with European data infrastructures such as Copernicus, CMEMS and OBIS.

These tools help make environmental data more accessible and usable by:

Tools integrated

These tools show how FAIR-EASE helps bridge the gap between data providers and users, making complex data more usable for science, policy, and operational services.

ODV interactive tool in Galaxy

Figure 2:ODV interactive tool in Galaxy

Tools for Oceanographic Data

ToolDescriptionLinks
Argo_getdataAllows retrieval of Argo glider data (physical and biogeochemical).

GitHub content

GitHub Galaxy tool

Galaxy PR

DIVA_full_analysisImplemented as both batch and interactive tools, this module enables advanced spatial interpolation of marine data.

GitHub content

GitHub Galaxy tool

Batch PR

GxIT PR

Copernicus Marine Data Store (copernicusmarine)Batch tool to query and download datasets from CMEMS.

GitHub content

GitHub Galaxy tool

Galaxy_PR

Ocean Data ViewUsed to plot geo-referenced ocean data from NetCDF and other formats.

GitHub Galaxy tool

GxIT_PR

ODV collection manager (tool_odv)Merges various datasets with a common vocabulary and creates a single generic ODV spreadsheet in an automatic way

GitHub content

GitHub Galaxy tool

ODV history manager (tool_odv_history)Report in the input file the ODV history including the change of QC flag

GitHub content

GitHub Galaxy tool

Canyon B

(bgc_canyonb)

Robust Estimation of Open Ocean CO2 Variables and Nutrient Concentrations From T, S, and O2 Data Using Bayesian Neural Network

GitHub content

GitHub Galaxy tool

Conda recipe

Galaxy_PR

Sanntis

(sanntis_marine)

The Sanntis tool identify biosynthetic gene clusters (BGCs) in genomic & metagenomic data

GitHub content

GitHub Galaxy tool

Galaxy_PR Galaxy_PR_2

QCV Harmonizer (harmonize_insitu_to_netcdf)Harmonizes oceanographic biogeochemical data.

GitHub content

GitHub Galaxy tool

Galaxy_PR

Tools for Interactive Visualisation and Data Handling

ToolDescriptionLinks
QGISFull integration of QGIS as an interactive tool in Galaxy.

GitHub Galaxy tool

GxIT_PR

HoloViz EcosystemA set of interactive notebooks for data visualisation using Python libraries as Panel, Bokeh, Datashader, etc.

GitHub Galaxy tool

GxIT PR

STAC BrowserAccess and navigation interface for STAC (SpatioTemporal Asset Catalogs).

GitHub content

GxIT PR

TerriaMapGeospatial visualisation

GitHub content

GxIt

Access to Global and Biodiversity Data

ToolDescriptionLinks
OBIS occurences (obis_data)A tool to search and retrieve species occurrences from the OBIS database.

GitHub content

GitHub Galaxy tool

Galaxy_PR

Copernicus Data Space EcosystemJupyter notebooks to explore Copernicus data using SentinelHub and OpenEO.

GitHub content

GxIT PR

Trends.EarthBatch tool for computing land cover and degradation indicators, supporting monitoring of SDG 15.3.1.

GitHub content

GitHub Galaxy tool

Galaxy_PR

Workflows developed and shared

Several Galaxy workflows have been developed and shared as part of FAIR-EASE to support Earth system science. These include workflows to process and analyse Argo float data, extract biogeochemical variables like phosphate from large NetCDF datasets, and combine oceanographic data with tools like ODV and the Pangeo ecosystem. All workflows are openly available on WorkflowHub - [FAIR-EASE Galaxy Project on WorkflowHub] - and can be imported into Galaxy by clicking the “Run on Galaxy” button on the WorkflowHub pages. They are designed for environmental scientists working with oceanographic and Earth system data, and are compatible with the [Galaxy Europe For Earth System instance.]

In addition, these workflows were demonstrated during major scientific events such as EGU 2024, and training materials have been created to help new users. Tutorials are available through the Galaxy Training Network, showing how to run the workflows, understand the data, and use FAIR practices like RO-Crate to describe and share results.

Water Coastal Dynamics workflow

Figure 3:Water Coastal Dynamics workflow

WorkflowDescriptionAccess
Marine Omics: Biosynthetic Gene ClustersDetects biosynthetic gene clusters in marine omics data using tools like Prodigal and SanntiS.WorkflowHub
Marine Omics Visualisation (OBIS Indicators)Converts OBIS biodiversity records into indicators such as Shannon and Simpson indices.WorkflowHub
Process Argo Data with Pangeo & ODVProcesses Argo float data and visualizes oceanographic variables using Pangeo and Ocean Data View (ODV).WorkflowHub
Subset Mediterranean Sea & Extract PhosphateSubsets NetCDF data for the Mediterranean Sea and extracts phosphate levels for analysis.WorkflowHub
Full Analysis of Argo DataEnd-to-end workflow to analyse and visualise Argo profile datasets.WorkflowHub
Argo-Glider Nitrate QCVQualification, Calibration and validation of Argo floats and Gliders ocean Biogeochemical Data Using GalaxyGalaxy Earth System

Galaxy Training Network (GTN)

Several high-quality, FAIR-aligned tutorials and learning pathways have been developed and published on the Galaxy Training Network (GTN). These resources are tagged with the “earth-system” label and cover a wide range of topics including marine biodiversity, oceanographic data processing, land monitoring, and FAIR metadata practices. Designed for interdisciplinary Earth and environmental scientists, they support hands-on learning with real datasets and workflows, and encourage reuse, openness, and reproducibility. All materials are licensed under CC-BY 4.0.

GTN example

Figure 4:GTN example

TitleTypeDescriptionAccess
Getting your hands-on earth dataLearning PathwayIntroduction to accessing and analyzing ocean, land, atmosphere, biodiversity data in Galaxy.Run Tutorial
OBIS Marine IndicatorsTutorialCalculate biodiversity indices (Shannon, Simpson, ES50) from OBIS.Run Tutorial
From NDVI with OpenEO to time series with HoloviewsTutorialProcess NDVI satellite data for land degradation analysis and time-series visualization.Run Tutorial
Marine Omics: Identifying BGCsTutorialDetect biosynthetic gene clusters using Prodigal, InterProScan, SanntiS.Run Tutorial
Ocean’s Variables StudyTutorialSubset Mediterranean ocean data and explore variables (e.g., phosphate).Run Tutorial
Ocean Data View (ODV)TutorialVisualize NetCDF-based oceanographic variables using ODV.Run Tutorial
Sentinel 5P Data VisualisationTutorialExplore and analyze Sentinel-5P atmosphere data interactively.Run Tutorial
Analyse Argo DataTutorialProcess Argo float datasets with Pangeo tools and visualize using ODVRun Tutorial
Make your tools available on your subdomainTutorialGuide to managing tools in a Galaxy subdomain.Run Tutorial
Create a subdomain for your communityTutorialSteps to create and administer a Galaxy subdomain for your community.Run Tutorial

Connecting IT resources

As part of FAIR-EASE, several actions were undertaken to integrate and leverage PULSAR as a distributed execution backend within Galaxy workflows. PULSAR endpoints were deployed on multiple infrastructures, including at the University of Clermont Auvergne (UCA) and the Hellenic Centre for Marine Research (HCMR) in Greece, enabling remote processing from Galaxy. A proof-of-concept with the EGI Federated Cloud was also carried out, demonstrating the ability to dynamically deploy PULSAR nodes using the EGI Infrastructure Manager. These deployments supported the execution of real-world workflows, validating the portability, scalability, and interoperability of distributed processing in an EAL platform.

French Galaxy Pulsar endpoint at the UCA

Figure 5:French Galaxy Pulsar endpoint at the UCA