data-inspect-availability¶
data-inspect-availability is the inspection step that checks which Cell Painting data sources are available before any download begins.
It tells you which dataset the project is currently pointed at and which discovery routes are ready to use.
Purpose¶
Use this skill when you want:
see which dataset and source the current run is using
check whether downloadable data can be found before starting a download
Main Outcome¶
This skill does not download any data. It tells you what this run is currently pointed at and whether the configured discovery routes are working.
In the demo setup, it confirms:
the default dataset
the default source
whether Gallery discovery works
whether Quilt discovery works
whether CPGData inventory discovery works
Inputs¶
This skill reads:
a project config such as
configs/project_config.demo.jsonthe data-access settings inside that config
an optional output directory
In the demo config, those settings already define the default dataset, default source, and the configured discovery routes.
Outputs¶
This skill writes:
data_access_summary.jsonThe main inspection result. It records the configured dataset, source, and discovery results.pipeline_skill_manifest.jsonThe run record for this skill invocation.
Agent Demo¶
This page is based on a real local OpenClaw turn against the repository demo config:
recorded on
2026-05-06 13:31 UTCmodel:
vibe/gpt-5-mini
Request¶
Before I download anything, I want to confirm which Cell Painting dataset this demo is pointing to and whether the configured source is actually reachable.
Structured Trace¶
user_input:
Before I download anything, I want to confirm which Cell Painting dataset this demo is pointing to and whether the configured source is actually reachable.
recorded_turn:
- the saved OpenClaw turn answered this request directly
- the reply confirmed the configured dataset and source
- the reply reported that discovery and planning routes were reachable in the current environment
final_answer_snapshot:
The demo points to cpg0016-jump / source_4, and the configured source is reachable for discovery and download planning.
Observed Result¶
The recorded turn confirmed that the demo config is currently pointed at:
dataset
cpg0016-jumpsource
source_4
The recorded reply also confirmed that the current environment could reach:
Gallery / S3 listing
CPGData inventory discovery
the configured Quilt adapter path
download planning for the configured dataset/source pair
The reply explicitly mentioned these installed access tools:
boto3botocorecpgdataquilt3awsCLI
Recorded Reply¶
I confirmed the demo is configured to point to:
- Dataset: `cpg0016-jump`
- Source: `source_4`
So the configured default target is:
- `cpg0016-jump/source_4/`
Is the configured source reachable?
Yes — the configured source is reachable through the data-access layer I checked.
What I verified
- the Cell Painting Gallery route is working
- dataset `cpg0016-jump` is present
- source `source_4` is present under that dataset
- a download plan can be created successfully for that dataset/source pair
Access routes
- Gallery / S3 listing: working
- cpgdata inventory route: working
- Quilt adapter: available and did not error in the summary check
Environment readiness
- `boto3`
- `botocore`
- `cpgdata`
- `quilt3`
- `aws` CLI
Bottom line
- the demo points to `cpg0016-jump / source_4`
- the configured source is reachable
- the environment can discover and plan downloads from it successfully