Architectural Design Decisions for Machine Learning Deployment: Dataset and Code

doi:10.5281/zenodo.5823459

Published January 12, 2022 | Version v1

Dataset Open

Architectural Design Decisions for Machine Learning Deployment: Dataset and Code

1. University of Vienna

Title: Architectural Design Decisions for Machine Learning Deployment: Dataset and Code

Authors: Stephen John Warnett; Uwe Zdun

About: This is the dataset and code artefact for the paper entitled "Architectural Design Decisions for Machine Learning Deployment".

Contents: The "_generated" directory contains the generated results, including latex files with tables for use in publications and the Architectural Design Decision model in textual and graphical form. "Generators" contains Python applications that can be run to generate the above. "Metamodels" contains a Python file with type definitions. "Sources_coding" contains our source codings and audit trail. "Add_models" contains the Python implementation of our model and source codings. Finally, "appendix" contains a detailed description of our research method.

Paper Abstract: Deploying machine learning models to production is challenging, partially due to the misalignment between software engineering and machine learning disciplines but also due to potential practitioner knowledge gaps. To reduce this gap and guide decision-making, we conducted a qualitative investigation into the technical challenges faced by practitioners based on studying the grey literature and applying the Straussian Grounded Theory research method. We modelled current practices in machine learning, resulting in a UML-based architectural design decision model based on current practitioner understanding of the domain and a subset of the decision space and identified seven architectural design decisions, various relations between them, twenty-six decision options and forty-four decision drivers in thirty-five sources. Our results intend to help bridge the gap between science and practice, increase understanding of how practitioners approach the deployment of their solutions, and support practitioners in their decision-making.

Objective: This paper aims to study current practitioner understanding of architectural concepts associated with machine learning deployment.

Method: Applying Straussian Grounded Theory to gray literature sources containing practitioner views on machine learning practices, we studied methods and techniques currently applied by practitioners in the context of machine learning solution development and gained valuable insights into the software engineering and architectural state of the art as applied to ML.

Results: Our study resulted in a model of Architectural Design Decisions, practitioner practices, and decision drivers in the field of software engineering and software architecture for machine learning.

Conclusions: The resulting Architectural Design Decisions model can help researchers better understand practitioners' needs and the challenges they face, and guide their decisions based on existing practices. The study also opens new avenues for further research in the field, and the design guidance provided by our model can also help reduce design effort and risk. In future work, we plan on using our findings to provide automated design advice to machine learning engineers.

Notes

This work was supported by: FFG (Austrian Research Promotion Agency) project AMMONIS, no. 879705.

Files

ml_deployment_adds_v1.zip

Files (5.6 MB)

Name	Size	Download all
ml_deployment_adds_v1.zip md5:71e9a0464f8111e6e44973c99ae9dfa4	5.6 MB	Preview Download

	All versions	This version
Views	488	483
Downloads	47	46
Data volume	270.9 MB	265.3 MB

Architectural Design Decisions for Machine Learning Deployment: Dataset and Code

Creators

Description

Notes

Files

ml_deployment_adds_v1.zip

Files (5.6 MB)