0% found this document useful (0 votes)

89 views7 pages

12 Factor Apps and Mlops Maturity

The document outlines 12 principles for building modern data applications called the Twelve-Factor App methodology. It discusses each principle in 1-2 paragraphs, explaining the importance of things like storing code in version control, explicitly declaring dependencies, storing configs in environments, treating backing services as attached resources, separating build and run stages, executing apps as stateless processes, and others. The principles are intended to help apps be more scalable, fault-tolerant, and easily deployable.

Uploaded by

anish@sparta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

89 views7 pages

12 Factor Apps and Mlops Maturity

Uploaded by

anish@sparta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

12 FACTOR APPS

Popular platform-as-a-service provider Heroku (now a subsidiary

of Salesforce) maintains a manifesto of sorts called The Twelve-Factor App. It
outlines a methodology for building modern data applications. Despite being
partly self-serving (apps built like this will translate more naturally to running
on Heroku), there’s a lot of meaty opinioned material worth examining. These
principles should be familiar to users of Django and Kedro frameworks over in
the Python community.

Strive for These Best Practices

These conceptual ideals are important, even if you aren’t the developer! For
those who desire to know why this stuff is important, or who want to have an
intelligent conversation with their team about these issues, this article is a
short summary.

I. Codebase — A single codebase tracked in revision control,

corresponding one to one with the app, with many deploys

Put all code in the source control system, in one repository, all the time. A
codebase is forked, branched, modified, and run by developers on their own
dev VMs. Changes are committed to the branch, and when ready, a pull
request is made to merge the branch into the main, with review, at a new
version level. Over time, the code base is deployed to any number of other
environments, including many sets of testing machines and ultimately the live
production servers.

Importance: Non-negotiable Everyone does this, even in machine learning,

and developers will laugh at you if you aren’t.

II. Dependencies — Explicitly declare and isolate dependencies

All the environments that the code runs in need to have dependencies, like a
database, or an image processing library, or a command-line tool. Never let
an application assume those things will be in place on any given machine.
Ensure it by baking those dependencies into the package description.
Most languages and frameworks provide a natural way to do this. List all the
versions of all the libraries expected to be in place, and when the code is
deployed, a command is run to download all the right versions and put them in
place. In R, use renv or build packages to be curated via a package manager.
This philosophy extends to the team managing entire machine configurations
using management tools like Docker.

Importance: High Without this, the team will have a constant slow time-suck
of confusion and frustration, multiplied by their size and number of
applications. Spare yourselves.

III. Config — Store config in the environment

Configuration is anything that may vary between different compute

environments. Code is all the stuff that doesn’t.

The code that talks to your database will always be the same. But the location
of that database (which machine it’s running on) will be different for a local
developer machine than it will be for production servers. Likewise, in the
testing environment, the team will want to log debugging information about
each web request, but in production that would be overkill. The same principle
applies to blob storage locations and taking advantage of parallel compute
cores.

Usernames and passwords for various servers and services also count as
configuration, and should never be stored in the code. This is especially true
because code is in source control (see I. above) which means that anyone
with access to the source will know all your service passwords, which is a bad
security hole as your team grows.

All configuration data must be stored in a separate place from the code,
strictly separated, and read in by the code at a deployment for runtime.

Importance: High Security review mandates that the config environment

elements be separable at every stage.

IV. Backing Services — Treat backing services as attached resources

Code at run time will talk to many services, like a database, a cache, an email
service, a queueing system, etc. These should all be referenced by a simple
endpoint (URL). They might be running on a local file system, or they might be
on a different host, in a different datacenter, or managed by a cloud provider.
The point is, code shouldn’t know the difference.

This allows great flexibility, so someone from your team could replace a local
instance of Redis with one served by Amazon through Elasticache, and the
code wouldn’t have to change.

This is another case where defining dependencies cleanly keeps the system
flexible and each part is abstracted from the complexities of the others (very
much a core tenet of good architecture).

Importance: High Given the current bindings to services, there’s little reason
not to adhere to this best-practice.

V. Build, release, run — Strictly separate build and run stages

The process of turning the code into a bundle of scripts, assets and binaries
that run the code is the build. In R, the build assembles the package elements
like documentation, unit tests, and binaries. The release sends that code to a
server in a fresh package together with the nicely separated config files for
that environment (See III. above). Then the code is run so the application is
available on those servers.

The idea here is that the build stage does a lot of heavy lifting, and developers
manage it. The run stage should be simple and bullet-proof so that the team
can sleep soundly through the night, knowing that the application is running
well, and that if a machine gets restarted (say, a power failure happens) that
the app will start up again on launch without the need for human intervention.

Importance: Conceptual From a practical perspective, the tools and

framework used will define best-practices for building, deploying, and running
the app. Some do a better job than others of enforcing strict separation, but
you should be okay if you follow your framework’s suggested mechanisms.

VI. Processes — Execute the app as one or more stateless processes

It’s likely the enterprise will have the application running on many different
servers over time, because that makes it more fault tolerant, and because it
can support more traffic. As a rule, we want each of those instances of
running code to be stateless. In other words, the state of the system is
completely defined by the databases and shared storage, and not by each
individual running application instance. Memory or cache cannot be relied on
to be available for a future run.

Let’s say our app has a signup workflow, where a user has to enter 3 screens
of information to create their profile. One (wrong) model would be to store
each intermediate state in the running code, and direct the user back to the
same server until the signup process is complete. The right approach is to
store intermediate data in a database or persistent key-value store, so even if
the web server goes down in the middle of the user’s signup, another web
server can handle the traffic, and the system is none-the-wiser.

Importance: High Not only is a stateless app more robust, but it’s easier to
manage, generally incurs fewer bugs, and scales better.

VII. Port binding — Export services via port binding

This factor is an extension of factor IV. above. The idea is that, just like all the
backing services you are consuming, your application also interfaces to the
world using a simple URL.

Most of the time we get this for free because the application is already
presenting itself through a web-server. But let’s say we have an API that’s
used by both customers in the outside world (untrusted) and an internal
website (trusted). We might create a separate URL to the API that the website
can use which doesn’t go through the same security (firewall and
authentication), so it’s a bit faster for us than for untrusted clients.

Importance: Low Most runtime frameworks will give you this for free. If not,
don’t sweat it. It’s a clean way to work, but it’s generally not hard to change
later.

VIII. Concurrency — Scale out via the process model

When running code, the idea is that lots of little processes are handling
specific needs. So we might have dozens of handlers at the ready to process
web requests, and another dozen to handle API calls for enterprise users. And
still another half-dozen processing background welcome-emails going to new
users, or sending tweets for your users sharing things on your social media
service.

By keeping all these small parts working independently, and running them as
separate processes (in a low-level technical sense), the application will scale
better. In particular, you’ll be able to do more stuff concurrently, by smoothly
adding additional servers, or additional CPU/RAM and taking full advantage of
it through the use of more of these small, independent processes.

Importance: Low Trust the data architect to raise the red flag if this is going
to become an issue.

IX. Disposability — Maximize robustness with fast startup and graceful

shutdown

When deploying new code, we want that new version to launch right away and
start to handle traffic. If an application has to do 20 seconds of work (say,
loading giant mapping files into RAM) before it’s ready to handle real traffic,
we’ve made it harder to rapidly release code, and we’ve introduced more
churn on the system to stop/start independent processes.

With the proliferation of so many 3rd party libraries in today’s software

systems, sub–1-second startup times are less and less common. But beyond
loading code, the application should have everything it needs waiting in high-
speed databases or caches, so it can start up snappily and be ready to serve
requests.

Further, the application should be robust against crashing. Meaning, if it does

crash, it should always be able to start back up cleanly. Never do any
mandatory “cleanup” tasks when the app shuts down that might cause
problems if they failed to run in a crash scenario.

Importance: Medium Depending on how often you are releasing new code
(hopefully many times per day), and how much you have to scale your app
traffic up and down on demand, be sure to understand the implications.
X. Dev/prod parity — Design for continuous deployment by keeping
development, staging, and production all as similar as possible

It has become in vogue in recent years to have a much more rapid cycle
between developing a change to your app and deploying that change into
production. For many companies, this happens in a matter of hours. In order
to facilitate that shorter cycle, and the risk that something breaks when
entering production, it’s desirable to keep each developer’s local environment
as similar as possible to production.

This means using the same backing services, the same configuration
management techniques, the same versions of package libraries, and so on.

This is often accomplished by driving developers to use a tool to manage their

own personal virtual server that’s configured just like production servers.
External consultancies may struggle to work with clients to achieve anything
remotely like continuous deployment independently.

Importance: Medium Developers will feel like taking shortcuts if their local
environment is working “well enough”. Onboarding new personnel is made
much easier if the entire team has nearly identical environments and tools.

XI. Logs — Treat logs as event streams

Log files keep track of a variety of things, from the mundane (your app has
started successfully) to the critical (users are receiving thousands of errors).

In an ideal situation, those logs are viewed by developers in their local

consoles, and in production they are automatically captured as a stream of
events and pushed into a real-time consolidated system for long-term archival
and data-mining like Databricks.

At the very least, the std.out device in the environment should be capturing
errors and sending them to an error reporting service.

Importance: Low If you are relying on logs as a primary forensic tool, you are
probably already missing out on better solutions. Be sure to consolidate your
logs for convenience, but beyond that, don’t worry about being a purist here.
XII. Admin processes — Run admin/management tasks as one-off
processes

You’ll want to do lots of one-off administrative tasks once you have a live app.
For example, doing data cleanup on bad data you discover; running analytics
for a presentation you are putting together, or turning on and off features for
A/B testing.

Usually a developer will run these tasks, and when they do, they should be
doing it from a machine in the production environment that’s running the latest
version of the production code. In other words, run one-off admin tasks from
an identical environment as production. Don’t run updates directly against a
database, don’t run them from a local terminal window.

Importance: High Having console access to a production system is a critical

administrative and debugging tool, and every major language/framework
provides it. No excuses for sloppiness here.

Summary
Some of these items may seem esoteric, as they are rooted in fundamental
systems design debates. But at the heart of a happily running system is an
architecture that is robust, reliable, and surprises us as little as possible.
These 12 factors are being adopted by most major software platforms and
frameworks, and to cut corners against their grain is a bad idea. Discuss
these issues with the dev ops team and investigate whether there are some
quick wins to improve the quality of your application design.

Master Software Architecture Pragmatic
100% (2)
Master Software Architecture Pragmatic
400 pages
Ravi Sethi - Software Engineering - Basic Principles and Best Practices (2023, Cambridge University Pre
No ratings yet
Ravi Sethi - Software Engineering - Basic Principles and Best Practices (2023, Cambridge University Pre
807 pages
12 Factor Principles
No ratings yet
12 Factor Principles
5 pages
12factor 200129214915 PDF
No ratings yet
12factor 200129214915 PDF
269 pages
(Git) (Non HTTP WSDL) : I. Codebase VII. Port Binding
No ratings yet
(Git) (Non HTTP WSDL) : I. Codebase VII. Port Binding
6 pages
FSD M4
No ratings yet
FSD M4
25 pages
Unit 4 Deployment
No ratings yet
Unit 4 Deployment
10 pages
Factor: 12 App Methodology
No ratings yet
Factor: 12 App Methodology
42 pages
Devops-Unit 2
No ratings yet
Devops-Unit 2
15 pages
12 Factor App
No ratings yet
12 Factor App
126 pages
2021-03-11 - 12-Factor Apps Docker
No ratings yet
2021-03-11 - 12-Factor Apps Docker
36 pages
CheatSheet PDF
No ratings yet
CheatSheet PDF
3 pages
CCS112-Reviewer 031344
No ratings yet
CCS112-Reviewer 031344
13 pages
The Twelve-Factor Application
No ratings yet
The Twelve-Factor Application
9 pages
12fa Docker Golang Sample
No ratings yet
12fa Docker Golang Sample
29 pages
12 Factor Applications With Docker and Go (Tit Petric) (Z-Library)
100% (2)
12 Factor Applications With Docker and Go (Tit Petric) (Z-Library)
148 pages
Full Stack Developer
No ratings yet
Full Stack Developer
5 pages
CMPT 276 Cheatsheet
No ratings yet
CMPT 276 Cheatsheet
2 pages
Building High Performance, Scalable Web Applications
No ratings yet
Building High Performance, Scalable Web Applications
1 page
Introduction To Application Development and Emerging Technologies
No ratings yet
Introduction To Application Development and Emerging Technologies
7 pages
Tech Concepts PM
No ratings yet
Tech Concepts PM
1 page
A Developer's Guide To Load Testing: Software Architecture For Developers
No ratings yet
A Developer's Guide To Load Testing: Software Architecture For Developers
61 pages
12 Factor App
100% (1)
12 Factor App
24 pages
Project Title
No ratings yet
Project Title
2 pages
Developing The Front5
No ratings yet
Developing The Front5
3 pages
Software Design Guide
No ratings yet
Software Design Guide
9 pages
Cloud Foundry Certified Developer
No ratings yet
Cloud Foundry Certified Developer
7 pages
Software Design Guide
100% (1)
Software Design Guide
10 pages
Introduction - Software Development at A Crossroads
No ratings yet
Introduction - Software Development at A Crossroads
5 pages
Better Apis Quality Stability Observability
No ratings yet
Better Apis Quality Stability Observability
100 pages
Re Sumos
No ratings yet
Re Sumos
51 pages
7 Se
No ratings yet
7 Se
4 pages
AWS Marketplace Cloud-Native Ebook 2 Development Techniques FINAL
No ratings yet
AWS Marketplace Cloud-Native Ebook 2 Development Techniques FINAL
27 pages
SEmid 1
No ratings yet
SEmid 1
16 pages
Chapter 4 - Building Scalable Web Applications
No ratings yet
Chapter 4 - Building Scalable Web Applications
19 pages
Software Development Models Examples
No ratings yet
Software Development Models Examples
4 pages
Project Documentation - 5
No ratings yet
Project Documentation - 5
5 pages
Enterprise Application Development
No ratings yet
Enterprise Application Development
52 pages
Cloud Native Development and Maintenance Guide
No ratings yet
Cloud Native Development and Maintenance Guide
4 pages
Cloud Comouting - Unit No 4
No ratings yet
Cloud Comouting - Unit No 4
11 pages
Weterings Gijs Building A Scalable Development Cluster at Adyen
No ratings yet
Weterings Gijs Building A Scalable Development Cluster at Adyen
88 pages
Microservice Architecture
No ratings yet
Microservice Architecture
4 pages
The 12-Factor App
No ratings yet
The 12-Factor App
1 page
Master Software Architecture Demo
No ratings yet
Master Software Architecture Demo
86 pages
Slide 1: Title Slide: Software Development and Programming
No ratings yet
Slide 1: Title Slide: Software Development and Programming
9 pages
1 Ipt101
No ratings yet
1 Ipt101
19 pages
02 01 Architectural Design
No ratings yet
02 01 Architectural Design
4 pages
Microservices PDF
No ratings yet
Microservices PDF
6 pages
Software Dev Guide
No ratings yet
Software Dev Guide
2 pages
Serverless Best Practices - Paul Johnston - Medium
No ratings yet
Serverless Best Practices - Paul Johnston - Medium
7 pages
System Analysis and Design
No ratings yet
System Analysis and Design
9 pages
Microservices
No ratings yet
Microservices
14 pages
OOSE Basic Concepts 1
No ratings yet
OOSE Basic Concepts 1
8 pages
7 Default - Ilities - For Software Architecture
No ratings yet
7 Default - Ilities - For Software Architecture
2 pages
SIgma Daddies DocuVault Proposal
No ratings yet
SIgma Daddies DocuVault Proposal
9 pages
Msei 025
No ratings yet
Msei 025
24 pages
Agile Android Software Development Sample
0% (1)
Agile Android Software Development Sample
14 pages
Software Engineering 201
No ratings yet
Software Engineering 201
104 pages
The Philosophy of Fear and Freedom
No ratings yet
The Philosophy of Fear and Freedom
2 pages
01 Guide To Drafting Your Critical Role Letters
No ratings yet
01 Guide To Drafting Your Critical Role Letters
3 pages
Jesse
No ratings yet
Jesse
4 pages
Gcse Ict: by The End of This Session, You Will Be Able To
No ratings yet
Gcse Ict: by The End of This Session, You Will Be Able To
10 pages
Lesson 5 - Site Layout and Design-1
No ratings yet
Lesson 5 - Site Layout and Design-1
7 pages
Infinitiv Ili - Ing
0% (1)
Infinitiv Ili - Ing
4 pages
Translation For University Students - College of Artsdocx
No ratings yet
Translation For University Students - College of Artsdocx
28 pages
Frida Kahlo: By: Maria Jose Castillo, Camila Amaya, Danna Valencia
No ratings yet
Frida Kahlo: By: Maria Jose Castillo, Camila Amaya, Danna Valencia
9 pages
Assignment 2-2023-24
No ratings yet
Assignment 2-2023-24
7 pages
Secondary Market DR S Sreenivasa Murthy
No ratings yet
Secondary Market DR S Sreenivasa Murthy
33 pages
Motion To Disqualify Allen Baddour
No ratings yet
Motion To Disqualify Allen Baddour
12 pages
Modals of Probability 2
No ratings yet
Modals of Probability 2
2 pages
793 Utility Software
No ratings yet
793 Utility Software
126 pages
Throne of Secrets Kerri Maniscalco Instant Download
100% (2)
Throne of Secrets Kerri Maniscalco Instant Download
41 pages
Business Plan For Poultry in Ibadan
No ratings yet
Business Plan For Poultry in Ibadan
6 pages
Lesson 6: Physical Self: Makes Them Beautiful
No ratings yet
Lesson 6: Physical Self: Makes Them Beautiful
2 pages
Working of An Ad Production House
No ratings yet
Working of An Ad Production House
15 pages
Special Web 1 PDF
No ratings yet
Special Web 1 PDF
12 pages
British Ballads From Maine
No ratings yet
British Ballads From Maine
599 pages
PAHS 055 Session 4 Disaster Management - 1
No ratings yet
PAHS 055 Session 4 Disaster Management - 1
27 pages
Mergers and Acquisitions
No ratings yet
Mergers and Acquisitions
123 pages
RCU-75 Remote Controlled Tracked Carriers FAE EN
No ratings yet
RCU-75 Remote Controlled Tracked Carriers FAE EN
2 pages
Shree Cement Reprot 21-22
No ratings yet
Shree Cement Reprot 21-22
292 pages
Zudio Bill 1
No ratings yet
Zudio Bill 1
3 pages
Chap 014
No ratings yet
Chap 014
37 pages
Daily Express Friday April 29 2011
No ratings yet
Daily Express Friday April 29 2011
80 pages
Figurative Speech
No ratings yet
Figurative Speech
9 pages
MB-409 International Marketing All Unit Notes RGPV
No ratings yet
MB-409 International Marketing All Unit Notes RGPV
51 pages
Life of Augustine of Hippo The Donatist Controvers... - (PG 25 - 164) PDF
No ratings yet
Life of Augustine of Hippo The Donatist Controvers... - (PG 25 - 164) PDF
140 pages
Mountain of Fire & Miracles Ministries
No ratings yet
Mountain of Fire & Miracles Ministries
2 pages