mzpeak.org examples Study Design Embedding

🧬 Study Design Embedding7 datasets · 152.1 GB

Proteomics & metabolomics studies shipping an SDRF / ISA-Tab sample annotation, kept alongside the original vendor RAW and the mzML → mzPeak conversions.

Provenance. HUPO-PSI / bigbio community-curated SDRF annotations over PRIDE (PXD009465 · PXD009909 · PXD011799 · PXD014145 · PXD020187) and MetaboLights (MTBLS1129 SDRF · MTBLS5358 native ISA-Tab) studies — label-free plus TMT 6/10/11-plex isobaric designs. The full chain — SDRF/ISA metadata + vendor RAW + mzML + mzPeak — is stored here; each dataset's urls.txt records the original source.

Raw / mzML / mzPeak sizes for Study Design Embedding
Size vs vendor RAW (=100%) across 7 dataset(s) > 50 MB · bars = mean, dots = runs.
MTBLS1129MetaboLights MTBLS1129 · label-free metabolomics (Waters Xevo G2-XS; Cai et al. 2020) — clean SDRF↔mzML pair.8 files
Raw n.a., mzML 244.5 MB, mzPeak 134.2 MB (n.a./55%)
MTBLS5358MetaboLights MTBLS5358 · label-free GC-MS oral-cancer metabolomics — native ISA-Tab (i_/s_/a_; Wang et al. 2024).62 files
raw 621.3 MB, mzML 1.2 GB, mzPeak 477.1 MB (77%/40%)
PXD009465PRIDE PXD009465 · TMT 6-plex Plasmodium falciparum PfPK7 phosphoproteome (LTQ Orbitrap Velos; Pease et al. 2018).33 files
raw 1.4 GB, mzML 1.8 GB, mzPeak 560.5 MB (38%/31%)
PXD009909PRIDE PXD009909 · label-free mouse retina proteome (Orbitrap Fusion; Harman et al. 2018).40 files
raw 9.6 GB, mzML 26.5 GB, mzPeak 4.9 GB (51%/19%)
PXD011799PRIDE PXD011799 · TMT 10-plex melanoma B cells (Orbitrap Fusion Lumos; Griss et al. 2019) — the TMT channel-model fixture.77 files
raw 15.0 GB, mzML 35.0 GB, mzPeak 8.3 GB (55%/24%)
PXD014145PRIDE PXD014145 · TMT 11-plex KMT9 lung cancer (Q Exactive; Baumert et al. 2020).39 files
raw 6.4 GB, mzML 14.9 GB, mzPeak 3.7 GB (58%/25%)
PXD020187PRIDE PXD020187 · label-free decellularized umbilical artery (LTQ Orbitrap Elite; Mallis et al. 2020).33 files
raw 5.8 GB, mzML 12.7 GB, mzPeak 2.9 GB (51%/23%)