0% found this document useful (0 votes)

70 views9 pages

Schneider Et Al 2025 A Scalable Web Based Platform For Proteomics Data Processing Result Storage and Analysis

The MSAID Platform is a scalable, web-based solution designed to streamline proteomics data processing, storage, and analysis, addressing challenges posed by traditional fragmented workflows. Utilizing cloud-native technology and an API-driven architecture, it automates data handling from raw acquisition to biological insights, enabling efficient processing of large datasets. The platform supports multiple user interfaces and provides tools for statistical analysis and visualization, making advanced proteomic workflows accessible to a wider range of scientists.

Uploaded by

ShahinuzzamanAda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views9 pages

Schneider Et Al 2025 A Scalable Web Based Platform For Proteomics Data Processing Result Storage and Analysis

Uploaded by

ShahinuzzamanAda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

This article is licensed under CC-BY 4.

pubs.acs.org/jpr Article

A Scalable, Web-Based Platform for Proteomics Data Processing,

Result Storage and Analysis
Published as part of Journal of Proteome Research special issue “Software Tools and Resources 2025”.
Markus Schneider,$ Daniel P. Zolg,$ Patroklos Samaras, Samia Ben Fredj, Dulguun Bold,
Agnes Guevende, Alexander Hogrebe, Michelle T. Berger, Michael Graber, Vishal Sukumar,
Lizi Mamisashvili, Igor Bronsthein, Layla Eljagh, Siegfried Gessulat, Florian Seefried, Tobias Schmidt,
and Martin Frejno*
See https://pubs.acs.org/sharingguidelines for options on how to legitimately share published articles.

Cite This: J. Proteome Res. 2025, 24, 1241−1249 Read Online

ACCESS Metrics & More Article Recommendations *

sı Supporting Information
Downloaded via 220.247.165.132 on March 23, 2025 at 04:35:15 (UTC).

ABSTRACT: The exponential increase in proteomics data presents critical challenges for
conventional processing workflows. These pipelines often consist of fragmented software
packages, glued together using complex in-house scripts or error-prone manual workflows
running on local hardware, which are costly to maintain and scale. The MSAID Platform
offers a fully automated, managed proteomics data pipeline, consolidating formerly
disjointed functions into unified, API-driven services that cover the entire process from raw
data to biological insights. Backed by the cloud-native search algorithm CHIMERYS, as well
as scalable cloud compute instances and data lakes, the platform facilitates efficient
processing of large data sets, automation of processing via the command line, systematic result storage, analysis, and visualization.
The data lake supports elastically growing storage and unified query capabilities, facilitating large-scale analyses and efficient reuse of
previously processed data, such as aggregating longitudinally acquired studies. Users interact with the platform via a web interface,
CLI client, or API, providing flexible, automated access. Readily available tools for accessing result data include browser-based
interrogation and one-click visualizations for statistical analysis. The platform streamlines research processes, making advanced and
automated proteomic workflows accessible to a broader range of scientists. The MSAID Platform is globally available via https://
platform.msaid.io.
KEYWORDS: proteomics, platform, pipeline, CHIMERYS, compute infrastructure, data processing, cloud, AWS, scalable, SaaS

■ INTRODUCTION
Proteomics is an indispensable technology for the compre-
In parallel, recent years have seen a fast-paced development
and improvement of software for proteomics data processing,
hensive identification and quantification of proteins, which are fueled by the introduction of deep learning-based prediction of
pivotal for understanding cellular functions and disease peptide properties.6−9 Academic software, such as MSFrag-
mechanisms. Over the past decade, there have been substantial ger10 and rescoring concepts like Prosit,6 MSBooster,11
advancements in the key components of proteomic workflows, MS2Rescore,12 EncyclopeDIA,13 DeepDIA,14 AlphaDIA,15
including sample preparation techniques, liquid chromatog- DIA-NN,16 and commercial products like INFERYS,17
raphy (LC), and mass spectrometry (MS) instrumentation.1,2 Spectronaut18 and CHIMERYS,19 have pushed the boundaries
These improvements, particularly the advent of fast-scanning of data extraction, enabling deeper insights into complex
mass spectrometers, have significantly enhanced the sensitivity, proteomics data sets by leveraging fragment ion intensities.
comprehensiveness, and throughput of proteomic analyses.3 Together, the combination of instrumentation and more
Consequently, researchers can now conduct large-scale sensitive algorithms allows researchers to generate protein
proteomic studies that generate an unprecedented volume of profiles to an unprecedented depth and throughput. However,
raw data. However, this surge in data production presents in contrast to today’s streamlined sample workflows in the wet
substantial challenges for data processing pipelines generating
protein identifications and associated quantitative information.
The growing size of raw and result data and sheer number of Received: September 30, 2024
mass spectrometry measurements that can be performed in a Revised: December 20, 2024
short period of time regularly exceed the capabilities of Accepted: January 23, 2025
conventional on-premises compute infrastructure, particularly Published: February 21, 2025
with respect to the demanded processing power and storage
space.4,5
© 2025 The Authors. Published by
American Chemical Society https://doi.org/10.1021/acs.jproteome.4c00871
1241 J. Proteome Res. 2025, 24, 1241−1249
Journal of Proteome Research pubs.acs.org/jpr Article

Figure 1. The MSAID Platform for proteomics comprises a cloud-native, microservices-based architecture, orchestrated by Kubernetes. It is hosted
on Amazon Web Services (AWS), utilizing Elastic Kubernetes Service (EKS). The platform supports multiple interfaces, including a web interface,
command-line interface (CLI), and an application programming interface (API) access for seamless user interaction. Uploaded raw and fasta files
are stored as on AWS Simple Storage Service (S3). A relational database (RDS) manages the data lake and meta-attributes for files and processing
jobs. Scalable CHIMERYS workflows for data-dependent acquisition (DDA), data-independent acquisition (DIA) or parallel reaction monitoring
(PRM) data processing can be executed on AWS Elastic Compute Cloud (EC2) instances. The platform offers the option to continuously acquire
and search raw files, while raw file overarching postprocessing like protein grouping are performed later without researching the data. Result data is
systematically stored as parquet files and can be interactively explored and visualized in the browser or downloaded via browser, CLI, or API for
further exploration.

lab leading up to the mass spectrometer, the data flow from the often hastily developed, lack robustness and are costly to
acquired raw data to the extraction of biological insights maintain long-term. Bespoke in-house solutions frequently do
remains fragmented and often inefficient. Laboratories are not scale well with the size of projects, growing infrastructure,
confronted with a range of computational challenges as they or team size, due to the effort of coordinating limited resources
attempt to process and analyze their data: frequently in an exponentially growing data environment.
encountered manual workflows are not only time-consuming To streamline the proteomic data workflow, we introduce
but also prone to errors and inconsistencies, particularly when the MSAID Platform�a comprehensive, managed, and cloud-
repetitive tasks are involved. A data pipeline might involve the based one-stop-shop for proteomics. It facilitates data
manual transfer of raw files from the acquisition computer to a handling, storage and analysis, allowing researchers to focus
storage medium, which may be a local personal computer, a on scientific questions of interest. By leveraging the scalability
laptop, or, in some cases, a network-attached storage (NAS) and flexibility of cloud computing, this platform eliminates the
system, rarely a cloud-based service. Subsequent processing of limitations of local hardware, enabling researchers to run
the data with a proteomic search engine involves the use of experiments at any time, without having to worry about
local consumer hardware such as personal computers or, in resource limitations and processing vast data sets automati-
some cases, high-performance servers. This reliance on local cally, efficiently, and reproducibly. Through its application
hardware inherently limits the scalability of proteomic studies, programming interface (API)-based design, advanced users
as the computational demands of large-scale analyses often retain the ability to tailor workflows to their needs, facilitating
exceed the capabilities of on-premises infrastructure or the seamless integration with existing or new tools, providing a
scalability of the software package itself. Once processed, the “best of both worlds” approach if desired.
results are usually manually moved to user- or project-specific
directories and shuffled between different storages to avoid
disk exhaustion, causing confusion and parallel systems of data
■ MATERIALS AND METHODS
The MSAID Platform is designed as a cloud-native solution,
organization, limiting accessibility to other researchers or data employing a microservices architecture orchestrated by
mining. The level of subsequent data interrogation and Kubernetes to ensure both scalability and flexibility across
interpretation varies drastically depending on the researcher’s various computational tasks (Figure 1). In its current
skill set, with approaches ranging from basic spreadsheet inception, it is hosted on Amazon Web Services (AWS) but
analyses to more sophisticated bioinformatics tools (e.g., is compartmentalized for future deployment into other cloud
Perseus20) or scripting languages. Dedicated statistics and service providers or a local server solution. Platform services
visualization suites like MSstats21 and Mass Dynamics22 can and compute resources are deployed using an AWS Elastic
aid non-bioinformaticians in drilling down on their biological Kubernetes Service (EKS) cluster, with automated infra-
question, but also present standalone solutions, adding to a structure management facilitated by Terraform and Helm.
fragmented tooling landscape. The described patchwork of User management, including authentication and authorization,
disconnected local infrastructures and applications create is handled through AWS Cognito, incorporating multifactor
highly redundant work streams, render it difficult to automate authentication to ensure secure access and compliance with
processes, risk loss of data integrity, and hinder the generation data protection protocols. Centralized control of the platform’s
of reproducible results. While custom scripts and pipelines operations is achieved through an API server, which governs all
might attenuate the manual labor in the process, these are aspects of data handling, processing, and user interactions
1242 https://doi.org/10.1021/acs.jproteome.4c00871
J. Proteome Res. 2025, 24, 1241−1249
Journal of Proteome Research pubs.acs.org/jpr Article

Figure 2. (A) The welcome screen of the platform presents key statistics of the user account, including the number of running searches, quick links
to the latest triggered experiments and the available processing quota. (B) Speed comparison of the browser-based (Firefox v130.0) and CLI-based
10 GB raw data upload into the AWS S3 data lake using a Windows Server 2022 server connected with a 1 Gbit/s uplink. Theoretical limit is
determined as the maximum achievable throughput of a 1 Gbit/s uplink. (C) Data management and organization are facilitated by adding free text
tags. Both tags and auto-generated metadata can be used in no-code queries for data retrieval. Images reproduced with permission from MSAID.

(Supporting Figure S1). The platform supports multiple

interfaces for user interaction: an interactive web interface
■ RESULTS
One of the major goals of the platform is to further
built with Vue.js provides an accessible and intuitive interface democratize and streamline access to proteomic data
and offers a responsive design for use with personal computers, processing, to enable wet-lab scientists and non-bioinformatic
tablets, and even smartphones; a command-line interface experts to work with large-scale data and obtain results
(CLI) caters to users who require automation or scripting efficiently. The landing page of the platform aids the user
capabilities; and direct API access allows for seamless experience by providing an overview of the most important
integration with other tools and systems. Data storage within metrics, like file statistics, last visited experiments, available
the platform is managed as a data lake on AWS Simple Storage processing quotas and user management (Figure 2A). The first
Service (S3), with parquet files as the backing file format. step to a proteomic processing job (here called Experiments),
Trino and DuckDB are used to provide a systematic and is to upload data for processing.
uniform query layer across experiment results, making use of Proteomic data size has grown drastically over time, with
distributed computation to enable powerful online analyses several petabytes of data being deposited into repositories of
without downloading large result files. The primary user of an the ProteomeXchange consortium23 every year. Today,
account may invite other users to share a single workspace, individual studies may encompass several thousand raw files
enabling collaboration. Two-factor authentication is available and terabytes of raw data. To accommodate the efficient and
to protect user accounts. Data locality is fixed to European data secure upload of such large datasets, the platform offers two
centers for European users, with the architecture designed to distinct upload methods: the web interface offers simple
accommodate future expansion to additional regions. AWS interaction with the S3 data lake. The upload process is
Relational Database Service (RDS) is utilized to store data optimized for reliability and speed, with functionalities such as
data chunking, integrity checks, and the ability to automatically
attributes, metadata, and details of processing jobs, though the
retry and resume partial uploads in case of interruptions.
processed results themselves are stored separately. Data
Uploads reach speeds of >60 MB/s via common web browsers
processing workflows are orchestrated by Argo Workflows, without the need for plugins or dedicated software (Figure
which manages the distribution and execution of tasks on 2B). The robustness of this method has been validated through
Elastic Compute Cloud (EC2) instances. The platform allows extensive testing and handles single files >50 GB without
users to search data with the CHIMERYS search algorithm, inducing noticeable load on the browser.
which is described in detail in Frejno et al., 2024.19 The For scientists requiring more flexibility or those managing
platform offers multiple avenues for data interaction, including large volumes of data regularly, a CLI client is available for
file download via the browser, a simple web-based data Windows, Linux, and macOS. The CLI client offers similarly
browser, as well as statistical testing and interactive visual- secure, error-resilient uploading but with enhanced function-
ization capabilities facilitated by the Vega library (https://vega. ality and speed. Notably, the CLI client includes a ‘watch’
github.io/vega/). command that allows continuous monitoring of a specified
1243 https://doi.org/10.1021/acs.jproteome.4c00871
J. Proteome Res. 2025, 24, 1241−1249
Journal of Proteome Research pubs.acs.org/jpr Article

Figure 3. (A) The experimental design acts as an additional layer of meta data annotation. Raw files can be annotated as samples for later
visualization. In the case of tandem mass tag (TMT)-labeled samples, the individual channels can be annotated. (B) Runtime comparison of a 1h Q
Exactive HF-X HeLa Files run individually and as copies of the same file in parallel. Identification, error control, and quantification were performed
across all files. (C) Identification numbers for an offline-fractionated DIA data set acquired with an Orbitrap Astral. Peptide-spectrum matches
(PSMs) are represented at 1% file-local false discovery rate (FDR), all other levels at 1% data set-global FDR. Raw data reprocessed from Serrano et
al.24 (PRIDE data set identifier PXD049028). (D) Comparison of 2 HeLa samples searched together or combined later in postprocessing
demonstrates the result identity of longitudinally processed and later aggregated data. Displayed are PSMs at 1% PSM FDR (Venn diagram),
unique precursors at 1% precursor FDR (bar chart), a scatterplot of mokapot SVM score of all precursors irrespective of FDR with a Pearson
correlation of 1.00 and the delta in precursor quantitation at 1% precursor FDR, all indicating result identity.

folder using freely configurable regular expressions to include protein names, gene names, and organisms to cater for the
and exclude expressions, such as “HeLa” or “QC”. Upon various sources of fasta files.
completion of raw data acquisition, files matching these Processing of proteomic data is facilitated through an
expressions are automatically uploaded. This feature is intuitive, multistep wizard that guides users in setting up
particularly advantageous for longitudinal studies or quality experiments. This wizard assists browsing, filtering, and
control applications, where data files are generated repeatedly selecting input files, making it straightforward for users to
or over a longer period. The CLI client has been optimized to initiate their analyses. It also allows recording the experimental
achieve upload speeds of >100 MB/s on a 1 Gbit/s uplink, design of a study for record keeping and to facilitate later
rendering the upload of even large studies feasible in just a few statistical testing and visualization (Figure 3A).
hours (Figure 2B). The platform’s design is search engine-agnostic, enabling
Data security and compliance are central to the platform’s integration with any search engine that can operate within a
design; all hosting is performed on AWS, an ISO norm (by the Docker container. Currently, the platform is powered by
International Organization for Standardization) and CSA CHIMERYS 4,19 with plans to incorporate additional search
engines in the future. CHIMERYS is capable of handling Data-
STAR (Security, Trust, Assurance, and Risk) program by the
Dependent Acquisition (DDA), Data-Independent Acquisition
CSA Group certified provider. All data are encrypted in transit
(DIA), and Parallel Reaction Monitoring (PRM) experiments.
and at rest and securely stored on S3, benefiting from its
It operates in a fully spectrum-centric manner, features the
inherent redundancy and recovery features. Stringent access deconvolution of chimeric spectra, and incorporates the
control via Access Control Lists (ACLs) ensures that each INFERYS 4 deep-learning model, which provides retention
user’s data is isolated from others, aligning with state-of-the-art time and fragment ion intensity predictions for the most
security practices and compliance requirements, including the common post-translational modifications (PTMs) like phos-
EU General Data Protection Regulation (GDPR). phorylation, acetylation, ubiquitination, cysteine modifications,
During and after upload, the platform provides data tagging oxidation, tandem mass tags (TMT), and isotopically labeled
and metadata management capabilities. Users can tag data with amino acids. An in-depth characterization of the CHIMERYS
free-text labels, facilitating fully customizable organization and algorithm is available in a separate manuscript.19 The
retrieval. This tagging system integrates seamlessly with the processing pipeline also includes comprehensive postprocess-
platform’s table-based data management feature, allowing users ing features, such as MS1- and MS2-based quantification via
to organize their raw or fasta files and construct powerful no- deconvolution and TMT reporter ion-based quantification.
code filtering queries based on various data attributes within During postprocessing, Mokapot26 and Picked Protein Group
the browser (Figure 2C). Uploaded fasta protein databases can FDR25 and are employed for rigorous error control. Processing
be associated with parse rules to ensure proper extraction of templates for experiments can be saved by the user to facilitate
1244 https://doi.org/10.1021/acs.jproteome.4c00871
J. Proteome Res. 2025, 24, 1241−1249
Journal of Proteome Research pubs.acs.org/jpr Article

Figure 4. (A) Exploration of results directly in the browser, including nested associations of all contributing data levels (PSMs, precursors, modified
peptides, protein groups). (B) Volcano plot on protein group level created within the platform contrasting a CRISPR-Cas9 mitochondrial MGME1
gene Knockout (KO) in human HAP1 cells with wildtype (WT) HAP1 cells. Raw data reprocessed from Serrano et al.24 (PRIDE data set identifier
PXD049028). A two-sided t test was performed for all proteins with complete observation on n = 3 replicate single shots for WT and KO.
Benjamini−Hochberg was used to calculate false discovery rate (FDR). CHIMERYS processing yields 639 significantly (q-value ≤ 0.05) regulated
proteins with an absolute fold-change of ≥2.

setting up standardized experiments. The settings can also be via the CLI client. Once a study is completed, experiments
exported to directly submit jobs using the CLI client instead of processed with compatible settings can be easily combined
interacting with the graphical user interface (GUI). At the time through a simple wizard in the browser or the CLI. This
of writing, CHIMERYS and hence the platform is compatible combination triggers a rerun of the computationally inex-
with all Thermo Scientific mass spectrometers. Compatibility pensive postprocessing steps only, including quantification,
with other vendors and open formats like mzML is expected FDR roll-up, and picked-protein grouping, allowing users to
within the year 2025. benefit from the thorough analysis of individual files while also
The cloud-native setup allows for the deployment of several obtaining comprehensive results from the entire study without
hundred compute pods, backed by hundreds of central the need for a full search engine run. Due to the deterministic
processing unit (CPU) cores and graphics processing unit and reproducible results of the processing step, no difference in
(GPU) instances, ensuring that processing remains efficient results (Pearson correlation of R = 1.00) is observed whether
and fast regardless of the data volume or parallel usage of the data is processed together or combined later (Figure 3D).
platform. Raw files are processed in parallel, with subsequent During the execution of experiments, users can monitor their
combination of results of all searches during postprocessing to progress in real-time via the browser. Once an experiment
optimize the overall analysis runtime. Elastic scaling of the concludes, the platform provides an overview of identified
platform is achieved using an autoscaler, which analyzes PSMs, peptides, and protein groups, offering immediate insight
submitted workloads and dynamically acquires or releases into the results.
computation resources. This strategy keeps the cluster size To allow the user to engage with their results, the platform
appropriate to the scheduled compute tasks and ensures provides a range of interactive tools. The processed data is
efficient processing from low activity times to load spikes. systematically stored in a data lake, enabling complex queries
Currently, the cluster can simultaneously spawn >1,000
across potentially thousands of files. A Trino/DuckDB data
compute instances, and we are working to expand this capacity
lake query layer allows users to retrieve or analyze data directly
by an additional order of magnitude and expand to more than
in the browser (Figure 4A). Tab-Separated Values (TSV) files
a single datacenter/availability zone to service users across the
can be exported and downloaded, providing users complete
globe. Performance benchmarks demonstrate the scalability of
the platform. Processing a single 1 GB HeLa file including MS1 control over their results for offline storage and processing if
quantification takes 36 min, while processing 100 files desired. File downloads can be fine-tuned with options to
concurrently extends the total time to 108 min (Figure 3B), apply FDR filtering, formatting and level selection (PSMs,
resulting in 56x faster processing than acquisition time. A precursors, modified peptides, peptides and protein groups).
published, fractionated Orbitrap Astral DIA data set24 The platform output is evolving to conform to existing
comprising 103 GB in size was processed 2.7x faster than standards (e.g., SDRF23) and will soon offer integration with
acquisition time (198 min processing, 552 min acquisition frequently used tools such as Skyline. Additionally, the CLI
time), further highlighting the platform’s capability to handle allows to download the results of submitted jobs directly,
large data sets, even if they are not automatically uploaded and enabling users to upload, process, and download results within
streamed (data not shown). The analysis resulted in 5,859,533 their pipelines without requiring any interaction with the GUI.
peptide-spectrum matches (PSMs) at 1% file-local FDR, As an alternative to downloading data, users can explore
342,146 precursors, 236,882 peptides and 11,048 protein their results online through a data browser that provides an
groups (at 1% dataset-global FDR), underlining the excep- intuitive tabular overview of the full result set set, including
tional depth of proteomic profiling that can be achieved advanced filtering and search functionalities backed by Trino’s
nowadays from a single biological sample (Figure 3C). distributed query engine. This data browser allows users to
The platform also supports the efficient reuse of existing gain valuable insights into their data before committing to a
data, via combination of previously generated experiments, large download, for example, to quickly determine if proteins
benefiting quality control (QC) applications and longitudinal of interest have been detected. Nested tables link all evidence
data collection and analysis. Users can process each raw file as levels, facilitating detailed examination of data, such as the
it becomes available, also through a fully automated workflow quality of all detected PSMs associated with a specific protein.
1245 https://doi.org/10.1021/acs.jproteome.4c00871
J. Proteome Res. 2025, 24, 1241−1249
Journal of Proteome Research pubs.acs.org/jpr Article

In addition to allowing access to fully searchable online platform includes tools for automated raw data uploads and
results, the data lake structure enables scientists to perform result downloads, simplifying the analysis process for
statistical testing and visualization directly in the browser. Each researchers. Future developments will leverage the data lake
experiment includes an interactive, modifiable, and restorable to provide advanced features, such as generating insights from
visualization dashboard (Figure 4B). This dashboard offers previous experiments, creating downstream analyses, and
simple, one-click creation of a variety of customizable plots, producing aggregated data views and additional visualizations.
such as bar plots for identification numbers and scatter plots Programming libraries for R and Python will offer direct
visualizing the correlation between files, as well as common interaction with the results, enabling custom analysis. Addi-
tools like UpSet plots, Principal Component Analysis (PCA) tionally, the API will facilitate programmatic access to both
and differential expression analysis with Volcano plot visual- experiment-specific and cross-experimental data, ensuring
izations. Both the data points underlying the plot (TSV files) flexibility and integration into diverse research workflows.
and the plots themselves (vector graphics or Portable Network The cloud-based nature of the platform may raise concerns
Graphics [PNGs]) can be downloaded. The plotting regarding security and associated costs. To address these
capabilities of the platform will expand continuously, aiming concerns, the platform follows state-of-the-art data handling
to eliminate the need for external analysis tools like R or including encryption and strict ACLs. Further reinforcing the
Python for straightforward data exploration by integrating commitment to security, we pursue an ISO27001 certification,
more functionalities over time. which will make it easier to adopt the platform for companies
Overall, we have introduced the first publicly accessible all- and researchers operating in regulated environments. To
in-one Software as a Service (SaaS) platform for proteomics. provide scientists with the opportunity of exploring the
Our goal was to create an easy-to-use solution for managing platform, a generous free processing package is available.
and processing proteomic data that can handle swiftly growing Currently, the SaaS solution is fully managed by us, but we
volumes of data, while relieving users from the need to buy and are aware of the demand for additional compliance and access
manage large compute and storage systems to keep up with the management through alternative deployment options. In
speed of data acquisition. The cloud-native design ensures response, we plan to offer Virtual Private Cloud (VPC)
scalable data upload, management, processing, and result deployments into user-owned cloud accounts, in turn
deposition, with features for systematic result exploration and providing enhanced compliance, access control, and data
advanced online data interaction directly in the browser. We sovereignty. Initially, this will be available for AWS, with future
believe this platform provides a strong foundation, marking the expansion to other cloud providers. While the platform
beginning of moving proteomic data processing to the cloud. It currently relies on AWS services, the core components of the
empowers researchers by decoupling scientific tasks from the platform are cloud-native technologies not specific to AWS,
underlying compute and to focus on solving problems instead enabling adaptation to other Kubernetes environments in the
of spending time managing infrastructure. future. For example, the S3 data storage can be replaced with

■ DISCUSSION
The MSAID Platform represents a pioneering effort in the field
any S3 compatible object storage solution like Google Cloud
Storage, Azure Blob Storage, or MinIO with reasonable effort.
For organizations with existing high-performance computing
of proteomics, offering a managed proteomic pipeline and (HPC) infrastructure or those preferring on-premises
storage solution with an intuitive browser-based interface that solutions, we are also developing a local server deployment
eliminates the need for individual laboratories to manage their option. This approach offers key advantages, including
own infrastructure. This approach significantly lowers entry complete data control, offline access, and tailored cost
barriers, particularly for smaller laboratories that may lack the management. It provides a highly viable solution for
resources to establish and maintain complex data processing laboratories operating in sensitive environments. In addition,
pipelines. This contrasts our solution to pipelines like public funding opportunities often favor one-time hardware
quantms,4 which require a self-managed compute environment and software purchases and have yet to fully adapt to
and only provide a command line interface. By automating the supporting recurring compute costs, even though modern
data pipelines from raw data to conclusions, the platform software packages, including essential tools like office suites,
streamlines research processes, making advanced proteomic are transitioning to SaaS.
workflows accessible to a broader range of scientists. SaaS allows for continuous feature delivery and improve-
Implementing a cloud-native proteomic workflow addresses ment and to quickly patch critical software exploits. To ensure
a critical need for scalable analyses that keep pace with the reproducibility, a two-tiered deprecation strategy is followed:
rapid growth of data volume fueled by recent developments of Updates to CHIMERYS that change results are released as new
faster and more sensitive instruments. A single mass minor versions (e.g., 4.1 → 4.2) and remain available for at
spectrometer running at full efficiency can generate more least one year. Critical security patches may replace earlier
than a terabyte of raw data within a week, presenting versions within the same minor release without affecting results
substantial storage and resource challenges that quickly exceed (e.g., 4.1.0 → 4.1.1). This approach balances software security
the capacity for local compute clusters, which are not easily with reproducibility of prior data. While the platform is not
scalable. Even batch-processing cloud models struggle with intended as a permanent data storage solution at this stage, we
scalability, prolonged transfer times, and lack of integrated plan to introduce data archiving options at a fraction of the
storage. cost of S3 storage, reducing the need for local backups. We will
In contrast, the platform offers a private proteomic data lake, also focus on simplifying data integration, including importing
enabling users to store and analyze large data sets without data from public proteomic repositories to facilitate a neglected
hardware constraints. Integrated online workflows eliminate data workflow in the proteomic community: the reuse and
the need for repeated data uploads and downloads, allowing reanalysis of the wealth of publicly available and previously
efficient data reuse. To further streamline the workflow, the analyzed data. In addition, we plan to streamline publishing
1246 https://doi.org/10.1021/acs.jproteome.4c00871
J. Proteome Res. 2025, 24, 1241−1249
Journal of Proteome Research pubs.acs.org/jpr Article

results obtained on the platform, by e.g. directly uploading Agnes Guevende − MSAID GmbH, Garching b. München
data, metadata and results to repositories like PRIDE or 85748, Germany
allowing a public “view-only” option for obtained results. Alexander Hogrebe − MSAID GmbH, Berlin 13347,
Looking ahead, we aim to enhance the platform’s visual- Germany; orcid.org/0000-0002-0203-6803
ization capabilities, driven by user feedback. This includes Michelle T. Berger − MSAID GmbH, Garching b. München
developing intuitive QC reports and plots and offering diverse 85748, Germany
views into the underlying MS data (e.g., visualization of Michael Graber − MSAID GmbH, Garching b. München
quantification traces or a potential Skyline integration), 85748, Germany
emphasizing our viewpoint that visual inspection of raw data Vishal Sukumar − MSAID GmbH, Garching b. München
remains crucial and should be taught. While the platform does 85748, Germany
not yet fully replace offline data analysis, ongoing development Lizi Mamisashvili − MSAID GmbH, Garching b. München
aims to close this gap. Our vision is for biologists to focus on 85748, Germany
interpreting results, such as understanding the Volcano plot on Igor Bronsthein − MSAID GmbH, Berlin 13347, Germany
a molecular level, rather than worrying about how to juggle Layla Eljagh − MSAID GmbH, Garching b. München 85748,
terabytes of experiment data in the process of generating it. Germany
Looking further ahead, we consider the platform as the Siegfried Gessulat − MSAID GmbH, Berlin 13347, Germany
foundation for large-scale proteomic result generation and Florian Seefried − MSAID GmbH, Garching b. München
exploitation. After addressing the data processing challenges, 85748, Germany
our focus will shift to advanced data interrogation. This will Tobias Schmidt − MSAID GmbH, Garching b. München
involve linking with external resources (like PhopshoSite- 85748, Germany
Plus,27 UniProt28,29 and ProteomicsDB30,31), better utilizing Complete contact information is available at:
insights already available from other tools, and culminates in https://pubs.acs.org/10.1021/acs.jproteome.4c00871
integrating large language models (LLMs) for conversational
data analysis.32 Ultimately, we aim to help researchers generate Author Contributions
hypotheses to follow up on the ever-growing volume of data, $
made possible by an integrated workflow from raw files to M.S. and D.P.Z. contributed equally
results and the systematic storage provided by the platform. Author Contributions
In summary, we are advancing into the cloud age of M.S., D.P.Z., and M.F. conceived the study. M.S. con-
proteomics and project that the MSAID Platform will become ceptualized the cloud-backend infrastructure. D.P.Z., M.F.,
an essential tool with a low entry barrier for researchers. Our M.S., and P.S. conceptualized the browser front end. M.S., P.S.,
long-term vision is making proteomic research more accessible S.B.F., D.B., A.H., and T.S. implemented the backend and the
and efficient for the expert and non-expert proteomic CLI, and mediated the front-end interaction. P.S., M.S., M.G.,
community. I.B., L.M, V.S., and F.S. integrated the CHIMERYS software.

■ ASSOCIATED CONTENT
Data Availability Statement
F.S. and S.G. integrated post-processing routines. A.G., A.H.,
M.T.B., and L.E. evaluated the platform. D.P.Z., M.S., and M.F.
wrote the manuscript.
The platform can be tested free of charge after registering at Funding
https://platform.msaid.io Work presented in this manuscript was in part funded by the
*
sı Supporting Information German Federal Ministry of Education and Research (BMBF)
with grant no. 13GW0603B.
The Supporting Information is available free of charge at
https://pubs.acs.org/doi/10.1021/acs.jproteome.4c00871. Notes
(Figure S1) Structure of the MSAID Platform API The authors declare the following competing financial
(PDF) interest(s): All authors are employees of MSAID GmbH, a
commercial entity which develops the software described in

■ AUTHOR INFORMATION
Corresponding Author
the study. M.F., D.P.Z., S.G. and T.S. are co-founders and
shareholders of MSAID.

Martin Frejno − MSAID GmbH, Garching b. München

85748, Germany; orcid.org/0000-0002-6651-1773;
■ ABBREVIATIONS
API: application programming interface
Email: [email protected] AWS: Amazon Web Services
ACL: access control list
Authors CLI: command-line interface
Markus Schneider − MSAID GmbH, Garching b. München CPU: central processing unit
85748, Germany CSA STAR: CSA Group created Security, Trust, Assurance,
Daniel P. Zolg − MSAID GmbH, Garching b. München and Risk (STAR) Program
85748, Germany EC2: AWS elastic compute cloud
Patroklos Samaras − MSAID GmbH, Garching b. München EKS: AWS elastic kubernetes service
85748, Germany; orcid.org/0000-0001-6042-1585 FDR: false discovery rate
Samia Ben Fredj − MSAID GmbH, Garching b. München GDPR: EU General Data Protection Regulation
85748, Germany GUI: graphical user interface
Dulguun Bold − MSAID GmbH, Garching b. München GPU: graphics processing unit
85748, Germany HPC: high-performance computing
1247 https://doi.org/10.1021/acs.jproteome.4c00871
J. Proteome Res. 2025, 24, 1241−1249
Journal of Proteome Research pubs.acs.org/jpr Article

ISO: International Organization for Standardization (11) Yang, K. L.; Yu, F.; Teo, G. C.; Li, K.; Demichev, V.; Ralser, M.;
LC: liquid chromatography Nesvizhskii, A. I. MSBooster: Improving Peptide Identification Rates
LLM: large language model Using Deep Learning-Based Features. Nat. Commun. 2023, 14 (1),
MS: mass spectrometry 4539.
(12) Declercq, A.; Bouwmeester, R.; Hirschler, A.; Carapito, C.;
NAS: network-attached storage
Degroeve, S.; Martens, L.; Gabriels, R. MS2Rescore: Data-Driven
PCA: principal component analysis Rescoring Dramatically Boosts Immunopeptide Identification Rates.
PNG: portable network graphic, file format Mol. Cell. Proteom.: MCP 2022, 21 (8), No. 100266.
PSM: peptide-spectrum match (13) Searle, B. C.; Pino, L. K.; Egertson, J. D.; Ting, Y. S.; Lawrence,
QC: quality control R. T.; MacLean, B. X.; Villén, J.; MacCoss, M. J. Chromatogram
R: R statistical computing language Libraries Improve Peptide Detection and Quantification by Data
RAW: raw data, file format Independent Acquisition Mass Spectrometry. Nat. Commun. 2018, 9
RDS: AWS relational database service (1), 5128.
S3: AWS simple storage service (14) Yang, Y.; Liu, X.; Shen, C.; Lin, Y.; Yang, P.; Qiao, L. In Silico
SaaS: software as a service Spectral Libraries by Deep Learning Facilitate Data-Independent
SDK: software development kit Acquisition Proteomics. Nat. Commun. 2020, 11 (1), 146.
(15) Wallmann, G.; Skowronek, P.; Brennsteiner, V.; Lebedev, M.;
TMT: tandem mass tag
Thielert, M.; Steigerwald, S.; Kotb, M.; Heymann, T.; Zhou, X.-X.;
TSV: tab-separated values, file format Schwörer, M.; Strauss, M. T.; Ammar, C.; Willems, S.; Zeng, W.-F.;
VPC: virtual private cloud Mann, M. AlphaDIA Enables End-to-End Transfer Learning for

■ REFERENCES
(1) Heil, L. R.; Damoc, E.; Arrey, T. N.; Pashkova, A.; Denisov, E.;
Feature-Free Proteomics. bioRxiv 2024, 2024.05.28.596182. .
(16) Demichev, V.; Messner, C. B.; Vernardis, S. I.; Lilley, K. S.;
Ralser, M. DIA-NN: Neural Networks and Interference Correction
Enable Deep Proteome Coverage in High Throughput. Nat. Methods
Petzoldt, J.; Peterson, A. C.; Hsu, C.; Searle, B. C.; Shulman, N.; 2020, 17 (1), 41−44.
Riffle, M.; Connolly, B.; MacLean, B. X.; Remes, P. M.; Senko, M. W.; (17) Zolg, D. P.; Gessulat, S.; Paschke, C.; Graber, M.; Rathke-
Stewart, H. I.; Hock, C.; Makarov, A. A.; Hermanson, D.; Zabrouskov, Kuhnert, M.; Seefried, F.; Fitzemeier, K.; Berg, F.; Lopez-Ferrer, D.;
V.; Wu, C. C.; MacCoss, M. J. Evaluating the Performance of the Horn, D.; Henrich, C.; Huhmer, A.; Delanghe, B.; Frejno, M.
Astral Mass Analyzer for Quantitative Proteomics Using Data- INFERYS Rescoring: Boosting Peptide Identifications and Scoring
Independent Acquisition. J. Proteome Res. 2023, 22 (10), 3290−3300. Confidence of Database Search Results. Rapid Commun. Mass
(2) Peters-Clarke, T. M.; Coon, J. J.; Riley, N. M. Instrumentation at Spectrom. 2021, No. e9128.
the Leading Edge of Proteomics. Anal. Chem. 2024, 96 (20), 7976− (18) Bruderer, R.; Bernhardt, O. M.; Gandhi, T.; Xuan, Y.;
8010. Sondermann, J.; Schmidt, M.; Gomez-Varela, D.; Reiter, L.
(3) Guzman, U. H.; Martinez-Val, A.; Ye, Z.; Damoc, E.; Arrey, T. Optimization of Experimental Parameters in Data-Independent
N.; Pashkova, A.; Renuse, S.; Denisov, E.; Petzoldt, J.; Peterson, A. C.; Mass Spectrometry Significantly Increases Depth and Reproducibility
Harking, F.; Østergaard, O.; Rydbirk, R.; Aznar, S.; Stewart, H.; Xuan, of Results*. Mol. Cell. Proteom.: MCP 2017, 16 (12), 2296−2309.
Y.; Hermanson, D.; Horning, S.; Hock, C.; Makarov, A.; Zabrouskov, (19) Frejno, M.; Berger, M. T.; Tüshaus, J.; Hogrebe, A.; Seefried,
V.; Olsen, J. V. Ultra-Fast Label-Free Quantification and Compre- F.; Graber, M.; Samaras, P.; Fredj, S. B.; Sukumar, V.; Eljagh, L.;
hensive Proteome Coverage with Narrow-Window Data-Independent Brohnshtein, I.; Mamisashvili, L.; Schneider, M.; Gessulat, S.;
Acquisition. Nat. Biotechnol. 2024, 42, 1855−1866. Schmidt, T.; Kuster, B.; Zolg, D. P.; Wilhelm, M. Unifying the
(4) Dai, C.; Pfeuffer, J.; Wang, H.; Zheng, P.; Käll, L.; Sachsenberg, Analysis of Bottom-up Proteomics Data with CHIMERYS. bioRxiv
T.; Demichev, V.; Bai, M.; Kohlbacher, O.; Perez-Riverol, Y. 2024, 2024.05.27.596040. .
Quantms: A Cloud-Based Pipeline for Quantitative Proteomics (20) Tyanova, S.; Temu, T.; Sinitcyn, P.; Carlson, A.; Hein, M. Y.;
Enables the Reanalysis of Public Proteomics Data. Nat. Methods Geiger, T.; Mann, M.; Cox, J. The Perseus Computational Platform
2024, 21 (9), 1603−1607. for Comprehensive Analysis of (Prote)Omics Data. Nat. Methods
(5) Perez-Riverol, Y.; Bai, J.; Bandla, C.; García-Seisdedos, D.; 2016, 13 (9), 731−740.
Hewapathirana, S.; Kamatchinathan, S.; Kundu, D. J.; Prakash, A.; (21) Kohler, D.; Staniak, M.; Tsai, T.-H.; Huang, T.; Shulman, N.;
Frericks-Zipper, A.; Eisenacher, M.; Walzer, M.; Wang, S.; Brazma, A.; Bernhardt, O. M.; MacLean, B. X.; Nesvizhskii, A. I.; Reiter, L.;
Vizcaíno, J. A. The PRIDE Database Resources in 2022: A Hub for Sabido, E.; Choi, M.; Vitek, O. MSstats Version 4.0: Statistical
Mass Spectrometry-Based Proteomics Evidences. Nucleic Acids Res. Analyses of Quantitative Mass Spectrometry-Based Proteomic
2022, 50 (D1), D543−D552. Experiments with Chromatography-Based Quantification at Scale. J.
(6) Gessulat, S.; Schmidt, T.; Zolg, D. P.; Samaras, P.; Schnatbaum, Proteome Res. 2023, 22 (5), 1466−1482.
K.; Zerweck, J.; Knaute, T.; Rechenberger, J.; Delanghe, B.; Huhmer, (22) Bloom, J.; Triantafyllidis, A.; Quaglieri, A.; Ngov, P. B.; Infusini,
A.; Reimer, U.; Ehrlich, H.-C.; Aiche, S.; Kuster, B.; Wilhelm, M. G.; Webb, A. Mass Dynamics 1.0: A Streamlined, Web-Based
Prosit: Proteome-Wide Prediction of Peptide Tandem Mass Spectra Environment for Analyzing, Sharing, and Integrating Label-Free
by Deep Learning. Nat. methods 2019, 16 (6), 509−518. Data. J. Proteome Res. 2021, 20 (11), 5180−5188.
(7) Zhou, X.-X.; Zeng, W.-F.; Chi, H.; Luo, C.; Liu, C.; Zhan, J.; He, (23) Deutsch, E. W.; Bandeira, N.; Perez-Riverol, Y.; Sharma, V.;
S.-M.; Zhang, Z. PDeep: Predicting MS/MS Spectra of Peptides with Carver, J. J.; Mendoza, L.; Kundu, D. J.; Wang, S.; Bandla, C.;
Deep Learning. Anal. Chem. 2017, 89 (23), 12690−12697. Kamatchinathan, S.; Hewapathirana, S.; Pullman, B. S.; Wertz, J.; Sun,
(8) Zeng, W.-F.; Zhou, X.-X.; Willems, S.; Ammar, C.; Wahle, M.; Z.; Kawano, S.; Okuda, S.; Watanabe, Y.; MacLean, B.; MacCoss, M.
Bludau, I.; Voytik, E.; Strauss, M. T.; Mann, M. AlphaPeptDeep: A J.; Zhu, Y.; Ishihama, Y.; Vizcaíno, J. A. The ProteomeXchange
Modular Deep Learning Framework to Predict Peptide Properties for Consortium at 10 Years: 2023 Update. Nucleic Acids Res. 2023, 51
Proteomics. Nat. Commun. 2022, 13 (1), 7238. (D1), D1539−D1548.
(9) Meyer, J. G. Deep Learning Neural Network Tools for (24) Serrano, L. R.; Peters-Clarke, T. M.; Arrey, T. N.; Damoc, E.;
Proteomics. Cell Rep. methods 2021, 1 (2), No. 100003. Robinson, M. L.; Lancaster, N. M.; Shishkova, E.; Moss, C.; Pashkova,
(10) Yu, F.; Teo, G. C.; Kong, A. T.; Fröhlich, K.; Li, G. X.; A.; Sinitcyn, P.; Brademan, D. R.; Quarmby, S. T.; Peterson, A. C.;
Demichev, V.; Nesvizhskii, A. I. Analysis of DIA Proteomics Data Zeller, M.; Hermanson, D.; Stewart, H.; Hock, C.; Makarov, A.;
Using MSFragger-DIA and FragPipe Computational Platform. Nat. Zabrouskov, V.; Coon, J. J. The One Hour Human Proteome. Mol.
Commun. 2023, 14 (1), 4154. Cell. Proteom.: MCP 2024, 23 (5), No. 100760.

1248 https://doi.org/10.1021/acs.jproteome.4c00871
J. Proteome Res. 2025, 24, 1241−1249
Journal of Proteome Research pubs.acs.org/jpr Article

(25) The, M.; Samaras, P.; Kuster, B.; Wilhelm, M. Reanalysis of

ProteomicsDB Using an Accurate, Sensitive, and Scalable False
Discovery Rate Estimation Approach for Protein Groups. Mol. Cell.
Proteom. 2022, 21 (12), No. 100437.
(26) Fondrie, W. E.; Noble, W. S. Mokapot: Fast and Flexible
Semisupervised Learning for Peptide Detection. J. Proteome Res. 2021,
20 (4), 1966−1971.
(27) Hornbeck, P. V.; Zhang, B.; Murray, B.; Kornhauser, J. M.;
Latham, V.; Skrzypek, E. PhosphoSitePlus, 2014: Mutations, PTMs
and Recalibrations. Nucleic Acids Res. 2015, 43 (D1), D512−D520.
(28) UniProt Consortium. The Universal Protein Resource
(UniProt) in 2010. Nucleic Acids Res. 2010, 38 (suppl_1), D142−
D148.
(29) Bateman, A.; Martin, M. J.; Orchard, S.; Magrane, M.; Ahmad,
S.; Alpi, E.; Bowler-Barnett, E. H.; Britto, R.; Bye-A-Jee, H.; Cukura,
A.; Denny, P.; Dogan, T.; Ebenezer, T.; Fan, J.; Garmiri, P.; da Costa
Gonzales, L. J.; Hatton-Ellis, E.; Hussein, A.; Ignatchenko, A.; Insana,
G.; Ishtiaq, R.; Joshi, V.; Jyothi, D.; Kandasaamy, S.; Lock, A.; Luciani,
A.; Lugaric, M.; Luo, J.; Lussi, Y.; MacDougall, A.; Madeira, F.;
Mahmoudy, M.; Mishra, A.; Moulang, K.; Nightingale, A.; Pundir, S.;
Qi, G.; Raj, S.; Raposo, P.; Rice, D. L.; Saidi, R.; Santos, R.; Speretta,
E.; Stephenson, J.; Totoo, P.; Turner, E.; Tyagi, N.; Vasudev, P.;
Warner, K.; Watkins, X.; Zaru, R.; Zellner, H.; Bridge, A. J.; Aimo, L.;
Argoud-Puy, G.; Auchincloss, A. H.; Axelsen, K. B.; Bansal, P.;
Baratin, D.; Batista Neto, T. M.; Blatter, M. C.; Bolleman, J. T.;
Boutet, E.; Breuza, L.; Gil, B. C.; Casals-Casas, C.; Echioukh, K. C.;
Coudert, E.; Cuche, B.; de Castro, E.; Estreicher, A.; Famiglietti, M.
L.; Feuermann, M.; Gasteiger, E.; Gaudet, P.; Gehant, S.; Gerritsen,
V.; Gos, A.; Gruaz, N.; Hulo, C.; Hyka-Nouspikel, N.; Jungo, F.;
Kerhornou, A.; Le Mercier, P.; Lieberherr, D.; Masson, P.; Morgat, A.;
Muthukrishnan, V.; Paesano, S.; Pedruzzi, I.; Pilbout, S.; Pourcel, L.;
Poux, S.; Pozzato, M.; Pruess, M.; Redaschi, N.; Rivoire, C.; Sigrist, C.
J. A.; Sonesson, K.; Sundaram, S.; Wu, C. H.; Arighi, C. N.; Arminski,
L.; Chen, C.; Chen, Y.; Huang, H.; Laiho, K.; McGarvey, P.; Natale,
D. A.; Ross, K.; Vinayaka, C. R.; Wang, Q.; Wang, Y.; Zhang, J.
UniProt: The Universal Protein Knowledgebase in 2023. Nucleic Acids
Res. 2022, 51 (D1), D523−D531.
(30) Schmidt, T.; Samaras, P.; Frejno, M.; Gessulat, S.; Barnert, M.;
Kienegger, H.; Krcmar, H.; Schlegl, J.; Ehrlich, H.-C.; Aiche, S.;
Kuster, B.; Wilhelm, M. ProteomicsDB. Nucleic acids Res. 2018, 46
(D1), D1271−D1281.
(31) Lautenbacher, L.; Samaras, P.; Muller, J.; Grafberger, A.;
Shraideh, M.; Rank, J.; Fuchs, S. T.; Schmidt, T. K.; The, M.; Dallago,
C.; Wittges, H.; Rost, B.; Krcmar, H.; Kuster, B.; Wilhelm, M.
ProteomicsDB: Toward a FAIR Open-Source Resource for Life-
Science Research. Nucleic Acids Res. 2022, 50 (D1), D1541−D1552.
(32) Gyori, B. M.; Vitek, O. Beyond Protein Lists: AI-Assisted
Interpretation of Proteomic Investigations in the Context of Evolving
Scientific Knowledge. Nat. Methods 2024, 21 (8), 1387−1389.

1249 https://doi.org/10.1021/acs.jproteome.4c00871
J. Proteome Res. 2025, 24, 1241−1249

COMPUTER STUDIES Form 3 Term 2 Joint Exam 2022 Questions
No ratings yet
COMPUTER STUDIES Form 3 Term 2 Joint Exam 2022 Questions
11 pages
JLG - PC Analyzer Kit Instruction
No ratings yet
JLG - PC Analyzer Kit Instruction
4 pages
Rubrik CDM Version 6.0 Rubrik Edge Install and Upgrade Guide (Rev. A2)
No ratings yet
Rubrik CDM Version 6.0 Rubrik Edge Install and Upgrade Guide (Rev. A2)
34 pages
Design Notation and Specification
No ratings yet
Design Notation and Specification
12 pages
Week 1 - Introduction To Discrete Structures
No ratings yet
Week 1 - Introduction To Discrete Structures
3 pages
TRSS Overview Presentation - SCHNEIDER
No ratings yet
TRSS Overview Presentation - SCHNEIDER
99 pages
Foundation Web Development Essentials: Definitive Reference for Developers and Engineers
From Everand
Foundation Web Development Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Metabase Administration and Automation: Definitive Reference for Developers and Engineers
From Everand
Metabase Administration and Automation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Section 5: Cluster Maintenance
No ratings yet
Section 5: Cluster Maintenance
7 pages
OpenHAB Solutions and Integration: Definitive Reference for Developers and Engineers
From Everand
OpenHAB Solutions and Integration: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Paper - Advanced Bioinformatics Methods For Practical Applications in Proteomics
No ratings yet
Paper - Advanced Bioinformatics Methods For Practical Applications in Proteomics
17 pages
晶门科技SSD2829T 1.2
No ratings yet
晶门科技SSD2829T 1.2
159 pages
Citrix Workspace App
No ratings yet
Citrix Workspace App
157 pages
w23 mr2 Us23 Enga
No ratings yet
w23 mr2 Us23 Enga
160 pages
Pandas Essentials for Data Analysis: Definitive Reference for Developers and Engineers
From Everand
Pandas Essentials for Data Analysis: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Designing Scalable IoT Solutions with ThingsBoard: Definitive Reference for Developers and Engineers
From Everand
Designing Scalable IoT Solutions with ThingsBoard: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Shahin MS-Thesis Final
No ratings yet
Shahin MS-Thesis Final
114 pages
02 Architecture
No ratings yet
02 Architecture
76 pages
Biomni: A General-Purpose Biomedical AI Agent
No ratings yet
Biomni: A General-Purpose Biomedical AI Agent
88 pages
1-s2.0-S0141813024058471-main
No ratings yet
1-s2.0-S0141813024058471-main
47 pages
Engineering Data Mesh in Azure Cloud: Implement data mesh using Microsoft Azure's Cloud Adoption Framework
From Everand
Engineering Data Mesh in Azure Cloud: Implement data mesh using Microsoft Azure's Cloud Adoption Framework
Aniruddha Deswandikar
No ratings yet
Package Msstats': March 1, 2022
No ratings yet
Package Msstats': March 1, 2022
59 pages
ABC Peptide Sequencing
No ratings yet
ABC Peptide Sequencing
30 pages
Exemplos de CI Controladores
No ratings yet
Exemplos de CI Controladores
32 pages
Databricks Platform Essentials: Definitive Reference for Developers and Engineers
From Everand
Databricks Platform Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Rust
No ratings yet
Rust
1 page
1 s2.0 S0021967310012823 Main
No ratings yet
1 s2.0 S0021967310012823 Main
31 pages
Computer Science Quiz Questions and Answers
71% (7)
Computer Science Quiz Questions and Answers
7 pages
Azure
No ratings yet
Azure
28 pages
Proteomics 935
No ratings yet
Proteomics 935
19 pages
Advanced Apache Tez Techniques: Definitive Reference for Developers and Engineers
From Everand
Advanced Apache Tez Techniques: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
NMJmag17 Gastrointestinal Health
No ratings yet
NMJmag17 Gastrointestinal Health
32 pages
System Requirement & Installation Guide
No ratings yet
System Requirement & Installation Guide
23 pages
A Comprehensive Study On Security Attacks On SSLTLS Protocol
No ratings yet
A Comprehensive Study On Security Attacks On SSLTLS Protocol
10 pages
Computer Science Calendar
No ratings yet
Computer Science Calendar
9 pages
De Amorim 2024 Structure and Biosynthesis of Peroc
No ratings yet
De Amorim 2024 Structure and Biosynthesis of Peroc
12 pages
Snyder 2008
No ratings yet
Snyder 2008
9 pages
BeeGFS System Administration and Optimization: Definitive Reference for Developers and Engineers
From Everand
BeeGFS System Administration and Optimization: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Business Requirement Document (BRD)
No ratings yet
Business Requirement Document (BRD)
5 pages
Koa Web Development Essentials: Definitive Reference for Developers and Engineers
From Everand
Koa Web Development Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Assessment of False Discovery Rate Control in Tandem Mass Spectrometry Analysis Using Entrapment
No ratings yet
Assessment of False Discovery Rate Control in Tandem Mass Spectrometry Analysis Using Entrapment
17 pages
Technical Guide to H2O Application and Workflow: Definitive Reference for Developers and Engineers
From Everand
Technical Guide to H2O Application and Workflow: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Strapi Development and Best Practices: Definitive Reference for Developers and Engineers
From Everand
Strapi Development and Best Practices: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Automation and Integration with Adverity: Definitive Reference for Developers and Engineers
From Everand
Automation and Integration with Adverity: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
10.1007@s13361 015 1161 7
No ratings yet
10.1007@s13361 015 1161 7
7 pages
StreamSets Pipeline Design and Best Practices: Definitive Reference for Developers and Engineers
From Everand
StreamSets Pipeline Design and Best Practices: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
10.1038_s41598-025-91410-4
No ratings yet
10.1038_s41598-025-91410-4
16 pages
Neutralino.js Essentials: Definitive Reference for Developers and Engineers
From Everand
Neutralino.js Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Kinesis Stream Processing Essentials: Definitive Reference for Developers and Engineers
From Everand
Kinesis Stream Processing Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Online Voting System SRS
No ratings yet
Online Voting System SRS
7 pages
DataGrip Essentials: Definitive Reference for Developers and Engineers
From Everand
DataGrip Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
A Biocompatible Lossen Rearrangement In: Escherichia Coli
No ratings yet
A Biocompatible Lossen Rearrangement In: Escherichia Coli
10 pages
Comprehensive Guide to Dash Applications: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Dash Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
AMD AM29LV641DH90REI Datasheet
No ratings yet
AMD AM29LV641DH90REI Datasheet
5 pages
Data Integration with Blendo: Definitive Reference for Developers and Engineers
From Everand
Data Integration with Blendo: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
813G GH Etsi
No ratings yet
813G GH Etsi
4 pages
CS F351 - TOC - Dr. Raghunath Reddy (2023-24)
No ratings yet
CS F351 - TOC - Dr. Raghunath Reddy (2023-24)
2 pages
Logstash Essentials: Definitive Reference for Developers and Engineers
From Everand
Logstash Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Big 2013 0036
No ratings yet
Big 2013 0036
6 pages
RisingWave for Real-Time Data Processing: The Complete Guide for Developers and Engineers
From Everand
RisingWave for Real-Time Data Processing: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Comprehensive Guide to BLAST: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to BLAST: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
CNC1990 TNC4T1000 KX1S
No ratings yet
CNC1990 TNC4T1000 KX1S
4 pages
Nexus Repository Management and Automation: Definitive Reference for Developers and Engineers
From Everand
Nexus Repository Management and Automation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
MicroStrategy System Architecture and Administration: Definitive Reference for Developers and Engineers
From Everand
MicroStrategy System Architecture and Administration: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Practical Parquet Engineering: Definitive Reference for Developers and Engineers
From Everand
Practical Parquet Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Start Download: Code in Code::blocks
No ratings yet
Start Download: Code in Code::blocks
4 pages
1-s2.0-S0039914019307891-main
No ratings yet
1-s2.0-S0039914019307891-main
10 pages
Advanced Resilient Distributed Datasets in Distributed Computing: Definitive Reference for Developers and Engineers
From Everand
Advanced Resilient Distributed Datasets in Distributed Computing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Hacker Disassembling Uncovered (Second Edition) : (Draft, For Internal Usage ONLY)
No ratings yet
Hacker Disassembling Uncovered (Second Edition) : (Draft, For Internal Usage ONLY)
5 pages
Python Notes Typed
No ratings yet
Python Notes Typed
3 pages
Efficient Data Preparation with AWS Glue DataBrew: Definitive Reference for Developers and Engineers
From Everand
Efficient Data Preparation with AWS Glue DataBrew: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Querying Clouds and APIs with SQL via Steampipe: The Complete Guide for Developers and Engineers
From Everand
Querying Clouds and APIs with SQL via Steampipe: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Comprehensive Guide to Glue for Scientific Data Exploration: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Glue for Scientific Data Exploration: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
[email protected]
No ratings yet
[email protected]
9 pages
CodeIgniter Development Essentials: Definitive Reference for Developers and Engineers
From Everand
CodeIgniter Development Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
A7 Firmware Installation Instructions 2
No ratings yet
A7 Firmware Installation Instructions 2
2 pages
Quantitative Analysis Using Raman in Pharma
No ratings yet
Quantitative Analysis Using Raman in Pharma
4 pages
Lose WiFi Connection After Suspend Ubuntu 20.04 - Unix & Linux Stack Exchange
No ratings yet
Lose WiFi Connection After Suspend Ubuntu 20.04 - Unix & Linux Stack Exchange
4 pages
Cohesity Architecture and Administration: Definitive Reference for Developers and Engineers
From Everand
Cohesity Architecture and Administration: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Cloud Access Security Broker Technologies: Definitive Reference for Developers and Engineers
From Everand
Cloud Access Security Broker Technologies: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Acs.jnatprod.1c01179
No ratings yet
Acs.jnatprod.1c01179
8 pages
UID Assignment 2020
No ratings yet
UID Assignment 2020
2 pages
2024-10-06 16-26
No ratings yet
2024-10-06 16-26
2 pages
2024-10-15 18-34
No ratings yet
2024-10-15 18-34
2 pages
Distributed File Systems Engineering: Definitive Reference for Developers and Engineers
From Everand
Distributed File Systems Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
[email protected]
No ratings yet
[email protected]
7 pages
Comprehensive Guide to Azure HDInsight: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Azure HDInsight: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Virtuoso Database Systems: The Complete Guide for Developers and Engineers
From Everand
Virtuoso Database Systems: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Dataiku Platform Foundations: Definitive Reference for Developers and Engineers
From Everand
Dataiku Platform Foundations: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Synapse Administration and Deployment: The Complete Guide for Developers and Engineers
From Everand
Synapse Administration and Deployment: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
ThoughtSpot Analytics and Administration: Definitive Reference for Developers and Engineers
From Everand
ThoughtSpot Analytics and Administration: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
acsbiomedchemau.1c00016
No ratings yet
acsbiomedchemau.1c00016
6 pages
Duplicati Essentials: Definitive Reference for Developers and Engineers
From Everand
Duplicati Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Advanced Workshop - Coarse-Grained and Steered MD Simulation - Schedule
No ratings yet
Advanced Workshop - Coarse-Grained and Steered MD Simulation - Schedule
1 page
Redash Data Analytics and Dashboarding: Definitive Reference for Developers and Engineers
From Everand
Redash Data Analytics and Dashboarding: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
QuickSight Essentials: Definitive Reference for Developers and Engineers
From Everand
QuickSight Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Technical Guide to Ghost Publishing Platform: Definitive Reference for Developers and Engineers
From Everand
Technical Guide to Ghost Publishing Platform: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
StreamSets Data Integration Architecture and Design: The Complete Guide for Developers and Engineers
From Everand
StreamSets Data Integration Architecture and Design: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Redwood Pipeline Automation: The Complete Guide for Developers and Engineers
From Everand
Redwood Pipeline Automation: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
A D A Shahinuzzaman - Participant
No ratings yet
A D A Shahinuzzaman - Participant
1 page
Superset Data Exploration and Analysis Framework: Definitive Reference for Developers and Engineers
From Everand
Superset Data Exploration and Analysis Framework: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Snowflake Data Platform Engineering: Definitive Reference for Developers and Engineers
From Everand
Snowflake Data Platform Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Essential Apache Beam: Definitive Reference for Developers and Engineers
From Everand
Essential Apache Beam: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Store Checkout Thank You 02
No ratings yet
Store Checkout Thank You 02
1 page
Multi - Pilecap Design Spreadsheet
No ratings yet
Multi - Pilecap Design Spreadsheet
1 page
Kestra Pipeline Orchestration Essentials: The Complete Guide for Developers and Engineers
From Everand
Kestra Pipeline Orchestration Essentials: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Data Pipeline Automation with Airbyte: Definitive Reference for Developers and Engineers
From Everand
Data Pipeline Automation with Airbyte: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
DataDog Operations and Monitoring Guide: Definitive Reference for Developers and Engineers
From Everand
DataDog Operations and Monitoring Guide: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Deepset Cloud for Intelligent Search and Question Answering: The Complete Guide for Developers and Engineers
From Everand
Deepset Cloud for Intelligent Search and Question Answering: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
2022-04-21-07-44-1a5ead93ce31fe9ce44c6a36b541e753
No ratings yet
2022-04-21-07-44-1a5ead93ce31fe9ce44c6a36b541e753
3 pages
Dataproc Administration and Engineering Solutions: Definitive Reference for Developers and Engineers
From Everand
Dataproc Administration and Engineering Solutions: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Zabbix Systems Monitoring and Management: Definitive Reference for Developers and Engineers
From Everand
Zabbix Systems Monitoring and Management: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Administration of NITOR Final
No ratings yet
Administration of NITOR Final
2 pages
Comprehensive Guide to Zipkin: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Zipkin: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Airflow for Data Workflow Automation
From Everand
Airflow for Data Workflow Automation
Richard Johnson
No ratings yet
01 Ggplot2 Fundamentals Cheatsheets Basic Geoms
No ratings yet
01 Ggplot2 Fundamentals Cheatsheets Basic Geoms
1 page
Bioorthogonal Chemistry and Its Applications
No ratings yet
Bioorthogonal Chemistry and Its Applications
23 pages
2024-09-04 17-44
No ratings yet
2024-09-04 17-44
1 page
2024-10-06 16-49
No ratings yet
2024-10-06 16-49
1 page
2024-09-22 15-40
No ratings yet
2024-09-22 15-40
1 page

Schneider Et Al 2025 A Scalable Web Based Platform For Proteomics Data Processing Result Storage and Analysis

Uploaded by

Schneider Et Al 2025 A Scalable Web Based Platform For Proteomics Data Processing Result Storage and Analysis

Uploaded by

This article is licensed under CC-BY 4.

A Scalable, Web-Based Platform for Proteomics Data Processing,

Cite This: J. Proteome Res. 2025, 24, 1241−1249 Read Online

ACCESS Metrics & More Article Recommendations *

(Supporting Figure S1). The platform supports multiple

Martin Frejno − MSAID GmbH, Garching b. München

(25) The, M.; Samaras, P.; Kuster, B.; Wilhelm, M. Reanalysis of

You might also like