Skip to content

Instantly share code, notes, and snippets.

@mvdbeek
mvdbeek / bwa_mem2_schema.json
Created March 9, 2026 10:47
BWA-MEM2 Galaxy tool parameter request schema
{
"$defs": {
"BatchDataInstance": {
"additionalProperties": false,
"properties": {
"src": {
"enum": [
"hda",
"ldda",
"hdca"
@mvdbeek
mvdbeek / plan.md
Created March 3, 2026 17:45
Plan: Fix Interrupted Celery set_meta Causes Stuck Non-Terminal Jobs (#20186)

Plan: Fix Interrupted Celery set_meta Causes Stuck Non-Terminal Jobs (#20186)

Problem Analysis

When metadata_strategy: directory_celery (or celery_extended) is configured, if the Celery process is interrupted (OOM killed, process restart, etc.) while executing a set_job_metadata task, jobs become permanently stuck in a non-terminal state (running) with no recovery mechanism.

Root Cause

The handler blocks forever on .get() when a worker dies.

@mvdbeek
mvdbeek / ISSUE_21642.md
Created February 26, 2026 12:58
Triage artifacts for Galaxy issue #21642 - Remote data fetch not respecting quota

Issue #21642: Fetching data from repositories does not seem to respect storage quota

State: OPEN Author: martenson Labels: area/backend, area/jobs, kind/bug Assignees: mvdbeek Comments: 1

Description

@mvdbeek
mvdbeek / migration.diff
Created February 24, 2026 10:23
26.0 db migratons
diff --git a/lib/galaxy/model/migrations/alembic/env.py b/lib/galaxy/model/migrations/alembic/env.py
index 98091912b80..c9db41cc84d 100644
--- a/lib/galaxy/model/migrations/alembic/env.py
+++ b/lib/galaxy/model/migrations/alembic/env.py
@@ -1,7 +1,7 @@
import logging
import re
+from collections.abc import Callable
from typing import (
- Callable,
@mvdbeek
mvdbeek / merged_test_output.md
Created February 24, 2026 10:15
IWC Workflow Test Results - Run 22339836776

Test Results (powered by Planemo)

Test Summary

@mvdbeek
mvdbeek / plan.md
Created February 23, 2026 15:42
Plan: Migrate `data_input` Forms to Client-Side

Plan: Migrate data_input Forms to Client-Side

Context

Galaxy's workflow engine currently defines input forms server-side for data_input steps, requiring server round-trips and complex translation code (step_state_to_tool_state, _parse_state_into_dict). PR #19313 successfully migrated data_collection_input to client-side rendering. This plan extends that pattern to data_input, which is simpler since it manages only three parameters: optional, format, and tag.

Implementation

1. Server: Empty get_inputs() for InputDataModule

@mvdbeek
mvdbeek / fetch_small_accessions.py
Created February 13, 2026 19:40
ENA/Galaxy stress test: fetch random small accessions, upload via ascp, run cat1
#!/usr/bin/env python3
"""
Fetch 100 random small (<500MB) SRR/ERR accessions from ENA and write their FTP links to a file.
"""
import csv
import io
import random
import sys
import urllib.request
@mvdbeek
mvdbeek / rnaseq_de.yml
Created December 22, 2025 16:32
rnaseq_de_sample
rnaseq_de:
workflow_id: 8cd7bdc8d3cf35cd
workflow_target_type: stored_workflow
request_state:
Sample sheet of sequencing reads:
class: Collection
collection_type: sample_sheet:paired
name: RNA-Seq DE sample sheet
elements:
- class: Collection

Aspera ascp File Source Implementation Plan

Overview

This document outlines the implementation plan for creating a Galaxy file source plugin that uses Aspera ascp for high-speed file downloads. The implementation will be a configured plugin (not stock) with download-only functionality, using a custom fsspec filesystem.

Requirements Summary

  • Plugin Type: Configured plugin only (requires explicit configuration)
  • Features: Download-only (no upload or browsing)
@mvdbeek
mvdbeek / ld-markup.json
Created November 5, 2025 09:50
json-ld
{
"@context": "http://schema.org",
"@type": "LearningResource",
"http://purl.org/dc/terms/conformsTo": {
"@id": "https://bioschemas.org/profiles/TrainingMaterial/1.0-RELEASE",
"@type": "CreativeWork"
},
"audience": {
"@type": "EducationalAudience",
"educationalRole": "Students"