0% found this document useful (0 votes)

11 views31 pages

Large Code Bases

Developing for bugger code base

Uploaded by

alyxredmond

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views31 pages

Large Code Bases

Developing for bugger code base

Uploaded by

alyxredmond

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Coding for Large Code Bases

Andrew Willmott
What is Large?
• Several million lines of C++

• Common for any AAA game these days

• Generally:

• Large shared codebase providing core functionality

• Graphics Engine

• Game code
Why is this different?
• Scale massively affects Software Engineering issues

• Complexity management

• Working within a large team

• Iteration times

• Most C++ references and textbooks are targeted at ~1000-100,000 line

codebases.

• What can make sense at a small scale may be completely unworkable at a large
scale
Large Codebase Concerns
• Understandability

• Majority of code is code you didn’t write

• Learning the codebase, making changes easily

• Compile times

• Single-line-change iteration time, recompile times, relink times.

• Debuggability
API Understandability
• Goal: should be able to easily find and understand any needed API

• Common Issues

• More files: need to skip around a lot to get complete picture

• Implementation in header: harder to pick out the bits that you care about

• Badly formatted, or big ball of mud: hard to find what you need quickly
Recompile Times
• How long does it take to recompile updated code

• How long does it take to recompile affected code

• That one header everything includes

• Implementation leaking into interface

Compile + Link Times
• Number of files opened while compiling a module

• Number of lines parsed while compiling a module

• Note: Cost of file open much bigger than the read

• Number of redundant symbols that must be unified or stripped by the linker

• Sheer symbol count

Causes?
C++
• C++ is broken by design for large codebases

• A class definition must include implementation!

• Still waiting on module support decades later

• But hey we have <wat> from c++1<x>

• Language features are primarily added via complex templated library header
files

• But, almost all large code bases are written in C++

Implementation Detail in Headers
• Changing implementation requires recompile of all client code

• Mixture of public and private makes API less intelligible

• Leads to extending the size of the compilation unit

• Inheritance

• #including other files for implementation detail

Compilation Unit Bloat
• Implicit instantiation: templates

• Implementation repeated in each including module, redundancy removed by

the linker

• Contrast with explicit IntArray class: single implementation in single module

• Global declarations

• Can wind up repeated in all including modules unless care is taken

• constexpr helps these days

Dead Code
• Code that isn’t called by anything

• Code within functional code that is never exercised (future looking?)

• Often kept as a “safety blanket”

• Makes it difficult to reason about how code can be fixed or extended

• Makes it difficult to refactor or clean up code

• Deleting more lines than you add is productive in a large codebase!

A Fix: Implementation Hiding
• Biggest thing: focus on cleanly separating interface (API) and implementation.

• Public header should only include those things clients care about

• Changing implementation should only cause local recompilation

• Compounding positives:

• Fewer includes

• Those includes are smaller

• Fewer public symbols

Stage 1: Forward Declaration
• You should all have heard of this

• Forward declare everything where possible

• Where not possible, consider fixing the issue, whether it's a class-scoped
variable or inability to split out a needed type.

• class BLAH;

• enum BLAH : S32;

• class BLAH { enum COW : S32; class BOB; };

Some Implications
• Always declare storage type for enum. (Doesn’t have to be enum class)

• Don’t use class scope. (Can’t forward declare)

• No nested enums, classes/structs

• Prefer functions if there is no state

• Prefer functions over static methods

• Don’t #include platform/OS headers from public API

Stage 2: Hide the rest
• If you're adding anything that will be commonly used, you need to go beyond
this and also hide as much of the implementation from clients as possible.

• Three possible methods:

• Interfaces

• PIMPL

• Functional APIs (C-style).

Interfaces
• “Abstract Base Class” — all methods pure virtual, no data.

• “Concrete” implementation class inherits from interface, header is private

• Pros: plugin nature, DLLs

• Cons: virtual function call overhead, separate (third) header

Public Header
class I_ANIM_PROCESS : public REF_COUNTED
{
public:
virtual void* as_class(SICORE::IDENTIFIER class_id) const = 0; //!< Returns given class interface if supported, or nullptr i

virtual void add_time( const F32 elapsed_time ) = 0; //!< Add given +ve or -ve time delta to the current animatio
virtual void set_time( const F32 time ) = 0; //!< Jump directly to the given time
virtual void reset() = 0; //!< We want to restore this node back to sensible defaults

virtual bool process() = 0; //!< Updates output according to current state. Should first

virtual S32 get_num_inputs() const = 0; //!< Returns current number of inputs.

virtual void set_input(S32 index, I_ANIM_PROCESS* input ) = 0; //!< Sets the given process input. Unhandled proce
virtual I_ANIM_PROCESS* get_input(S32 index) const = 0; //!< Returns the given process input, or 0 if unset.
virtual void clear_inputs() = 0; //!< Remove all current inputs

virtual const ANIM_OUTPUT& get_output() const = 0; //!< Returns output from last process() call

virtual void get_parameters (SICORE::JSON_VALUE* params) const = 0; //!< Returns JSON version of this process's cu
virtual void update_from_parameters(const SICORE::JSON_VALUE& params) = 0; //!< Update internal state from the given set
};
Private Header
class ANIM_MIRROR : public I_ANIM_PROCESS
{
public:
// I_ANIM_PROCESS interface
void* as_class(SICORE::IDENTIFIER class_id) const override;

void add_time(const F32 elapsed_time) override;

void set_time(const F32 elapsed_time) override;
void reset() override;

bool process() override;

S32 get_num_inputs() const override;

void set_input( S32 index, I_ANIM_PROCESS* input ) override;
I_ANIM_PROCESS* get_input(S32 index) const override;
void clear_inputs() override;

// ANIM_MIRROR methods
void set_active(bool enabled); //!< Can be used to toggle mirroring on and off
bool is_active() const;

protected:
SIMATH::QUAT get_parent_rotation(int node_index); //!< Find modelspace orientation of parent

I_ANIM_PROCESS_REF m_input;
SKELETON_CONST_REF m_skeleton;

U32 m_input_id_hash = 0;
…
More on Interfaces
• Generally idea is to use this only for major manager or subsystem classes.

• Widely used in both OS APIs (including DirectX) and shipped games over
many years.

• Often combined with intrusive reference counting for lifetime management.

PIMPL
• “Private IMPLementation”

• All implementation detail goes into an internal class that is only forward
declared in the header

• blah.h/blah_internal.h/blah.cpp, or simply blah.h/blah.cpp

• Drawbacks:

• Two allocations per class instantiation (can be avoided with wrappers)

• Need either pass-through functions or prefixing internal data references

Public Header
#include "sicore/generic/patterns/pimpl.h"

class UI_FILE_BROWSER
{
public:
PIMPL_DECL(UI_FILE_BROWSER);

void set_location( const SICORE::FILE_PATH& path ); //!< Set starting location of browser
void set_location( const SICORE::FILE_ITEM* item ); //!< Set starting location of browser

void set_file_type ( const SICORE::FILE_TYPE& file_type ); //!< Set the allowed file type.
void set_file_types( S32 num_types, const SICORE::FILE_TYPE types[] ); //!< Set the allowed file types. This may include FIL

void set_show_filtered_items(bool enabled); //!< Sets whether to still show unallowed items as inactive, rather tha
void set_close_on_selection (bool enabled); //!< Sets whether the dialog auto-closes on item selection
void set_sort_mode( SICORE::FILES_SORT_MODE mode ); //!< Set sort mode for container contents

bool show( const C8* label, bool* opened ); //!< Show file browser window. Returns true if the user selects a file (

SICORE::FILE_ITEM_REF get_selected_item() const; //!< Returns selected file (or container if that was included in the all
SICORE::FILE_ITEM_CONST_REF get_location() const; //!< Returns the location we initially set.
};
Source or Private Header
struct SIAPPLICATION::UI_FILE_BROWSER::PRIVATE
{
SISTL::vector<FILE_ITEM_REF> m_pane_items; //!< Left-to-right list of containers for each pane
SISTL::vector<FILE_ITEMS_REF> m_pane_contents; //!< Corresponding container contents

FILE_ITEM_REF m_selection; //!< Currently selected item

FILES_SORT_MODE m_sort_mode = FILES_SORT_MODE::SORT_NAME_ASCENDING_CASE_INSENSITIVE;
SISTL::vector<FILE_TYPE> m_file_types; //!< File types to filter against, if any

bool m_show_filtered = true; //!< If true, files that aren't filtered are still shown, but as inactive selections.
bool m_close_on_selection = true;

bool show(); //!< Show dialog internals. Returns true if a valid selection was double-clicked
void refresh_panes( int start_pane, int num_panes ); //!< Fetch all the file/container items in the specified panes
bool is_supported(const FILE_TYPE& type) const; //!< Returns true if given file type is supported
};

void UI_FILE_BROWSER::set_file_types( S32 num_file_types, const SICORE::FILE_TYPE file_types[] )

{
_.m_file_types.assign(file_types, file_types + num_file_types);
}
Functional API
• Convert the class into data (ideally with a substantial part hidden) and a set of
functions that operate on that data.

class ENTITY;

ENTITY* create_entity();
void manipulate_entity(ENTITY* entity);
int get_entity_value(ENTITY* entity);
void destroy_entity(ENTITY* entity);

• Often use handles rather than pointers

Other Fixes
• Highly recommend using at least one of these approaches

• Larger codebases often use several

• Initial extra effort in setup easily pays for itself in the long run

• However, lots more you can do…

Understandability
• Keep all public methods/functions together, as neatly laid out as possible

• Don’t use inline method declarations

• Don’t mix public and private, and always put public at the top

• Break up into functional groups

• Comment every method with //<! what this does

Be Kind to the Linker
• Put everything possible in the anonymous namespace

• Hides implementation

• But also means the linker can discard those symbols

• Avoid inline templated code

• Either avoid templates, or consider explicit instantiation

Avoid Concrete Inheritance
• One of the biggest failings of OOO

• Threads the implementation of a system through several abstraction layers &

files

• Must understand entire system to change anything!

• In particular, can’t easily change base functionality, as you don’t have a clear
picture of how it’s being used
More Suggestions
• Prefer explicit over implicit code. Clever systems that automagically register classes
or entities obfuscate code. A manual call hierarchy is easier to debug + introspect.

• Don't use "advanced" C++ features. Avoid using new C++1x features unless already
proven, i.e., shown to be practical. (Reduce language complexity/surface area.)

• Keep argument lists small. If they start getting unwieldy, consider creating some form
of FUNC_INFO struct that is passed in instead.

• Don't use in-class method definitions. (Clutters API.)

• Use pointers rather than non-const references for arguments that will be modified.
(Makes argument use obvious from call site, without looking at implementation.)
The Golden Rule
• KISS

• Everything else is noise

• Feature coding is all about managing complexity.

Conclusion
• These may seem like niceties, but being strict about API vs implementation is
key to:

• Getting compile times down below five minutes

• Allowing new coders to get up to speed quickly

• Increasing code productivity in general

C++ For Embedded Systems (PDFDrive)
100% (1)
C++ For Embedded Systems (PDFDrive)
235 pages
Devwin 32
No ratings yet
Devwin 32
2,467 pages
M. Ali Asdar Departement of Pulmonology and Respiratory Medicine Faculty of Medicine University of Indonesia - Persahabatan General Hospital Jakarta
No ratings yet
M. Ali Asdar Departement of Pulmonology and Respiratory Medicine Faculty of Medicine University of Indonesia - Persahabatan General Hospital Jakarta
30 pages
John B. Goodenough
No ratings yet
John B. Goodenough
11 pages
To Issue Swing Door For Entrance To Ac Area (With Overhead Concealed Double Acting Door Closer) Mi006232
No ratings yet
To Issue Swing Door For Entrance To Ac Area (With Overhead Concealed Double Acting Door Closer) Mi006232
2 pages
Abrahams & Millar (2008)
No ratings yet
Abrahams & Millar (2008)
27 pages
OOP CAE3 Answers
No ratings yet
OOP CAE3 Answers
23 pages
Amcas Coursework Video
100% (2)
Amcas Coursework Video
7 pages
Blank 4
No ratings yet
Blank 4
47 pages
Manipulator
No ratings yet
Manipulator
27 pages
Hidden Overhead of A Function API
No ratings yet
Hidden Overhead of A Function API
158 pages
C++ in Huge AAA Games - Nicolas Fleury - CppCon 2014
No ratings yet
C++ in Huge AAA Games - Nicolas Fleury - CppCon 2014
51 pages
File Page No 1663658874765
No ratings yet
File Page No 1663658874765
10 pages
CS3505 Lecture4
No ratings yet
CS3505 Lecture4
33 pages
Regent College London New
No ratings yet
Regent College London New
2 pages
Web of Science Core Collection:: Journal Evaluation Process and Selection Criteria
No ratings yet
Web of Science Core Collection:: Journal Evaluation Process and Selection Criteria
35 pages
IELTS Writing Task 2
No ratings yet
IELTS Writing Task 2
34 pages
Programming Methodology
No ratings yet
Programming Methodology
118 pages
C++ Summary Sheet
No ratings yet
C++ Summary Sheet
2 pages
Making C++ Code Beautiful - Gregory and McNellis - CppCon 2014
No ratings yet
Making C++ Code Beautiful - Gregory and McNellis - CppCon 2014
85 pages
Using-Modern-Cpp-Techniques-To-Enhance-Multicore-Optimizations - Das's Edution
No ratings yet
Using-Modern-Cpp-Techniques-To-Enhance-Multicore-Optimizations - Das's Edution
17 pages
Syllabus
No ratings yet
Syllabus
7 pages
Downward Interfaces
No ratings yet
Downward Interfaces
88 pages
Lec12 (Topic 7 Advanced Topic)
No ratings yet
Lec12 (Topic 7 Advanced Topic)
47 pages
Basic Coding
No ratings yet
Basic Coding
5 pages
A Cyber Security Awareness and Education Framework For South Africa
No ratings yet
A Cyber Security Awareness and Education Framework For South Africa
219 pages
Anastasia Kazakova - Debug C++ W-O Running C++ On Sea
No ratings yet
Anastasia Kazakova - Debug C++ W-O Running C++ On Sea
55 pages
Lecture15 Slides
No ratings yet
Lecture15 Slides
105 pages
Aluminum and Glass Company in Qatar
No ratings yet
Aluminum and Glass Company in Qatar
5 pages
Modernizing Legacy C++ Code - Gregory and McNellis - CppCon 2014
No ratings yet
Modernizing Legacy C++ Code - Gregory and McNellis - CppCon 2014
81 pages
Get Started With Win32 and C++
No ratings yet
Get Started With Win32 and C++
148 pages
52386a80aca05d24a6950fb4a2377bc1
No ratings yet
52386a80aca05d24a6950fb4a2377bc1
147 pages
Risk Assessment Table New Version
No ratings yet
Risk Assessment Table New Version
4 pages
Practical Set-1: The Result Is 600 The Result Is 70
No ratings yet
Practical Set-1: The Result Is 600 The Result Is 70
12 pages
C++ Tutorial Part II - Advanced: Silan Liu
No ratings yet
C++ Tutorial Part II - Advanced: Silan Liu
53 pages
CS 212 OOP Lab Manual PDF
No ratings yet
CS 212 OOP Lab Manual PDF
79 pages
C For C Programmers
No ratings yet
C For C Programmers
41 pages
Winnt H
No ratings yet
Winnt H
62 pages
1. 听力部分SL Mock Examination02-S
No ratings yet
1. 听力部分SL Mock Examination02-S
8 pages
MFC Internals: III III
100% (1)
MFC Internals: III III
13 pages
Assignmt 3
No ratings yet
Assignmt 3
15 pages
06 Introduction To MFC
50% (2)
06 Introduction To MFC
172 pages
Doom 3 C++ Coding Style Conventions
No ratings yet
Doom 3 C++ Coding Style Conventions
7 pages
Turbo C++ FILELIST
No ratings yet
Turbo C++ FILELIST
7 pages
Daniel Science
No ratings yet
Daniel Science
10 pages
Chatgpt
No ratings yet
Chatgpt
6 pages
PG AHC Admissions Policy 2020
No ratings yet
PG AHC Admissions Policy 2020
13 pages
PLC Interview Questions
No ratings yet
PLC Interview Questions
3 pages
09 COM Fundamentals1
No ratings yet
09 COM Fundamentals1
49 pages
Lesson List and Schedule: Visual C++/MFC Tutorial - Lesson 1: Behind The Scenes With Handles and Messages
100% (2)
Lesson List and Schedule: Visual C++/MFC Tutorial - Lesson 1: Behind The Scenes With Handles and Messages
41 pages
Advanced C++ Programming Advanced C++ Programming
100% (2)
Advanced C++ Programming Advanced C++ Programming
319 pages
09 ParallelizationRecap PDF
No ratings yet
09 ParallelizationRecap PDF
62 pages
Lab Manual
No ratings yet
Lab Manual
79 pages
Data-Oriented Design and C++ - Mike Acton - CppCon 2014
No ratings yet
Data-Oriented Design and C++ - Mike Acton - CppCon 2014
201 pages
Advanced C++ Programming
No ratings yet
Advanced C++ Programming
69 pages
C Basics v3 PDF
No ratings yet
C Basics v3 PDF
81 pages
Coding Guidelines: Program
No ratings yet
Coding Guidelines: Program
23 pages
Fischer FBN Anchors
No ratings yet
Fischer FBN Anchors
23 pages
Sains (Kertas 2) PMR Perak
No ratings yet
Sains (Kertas 2) PMR Perak
17 pages
Unit II
No ratings yet
Unit II
57 pages
Krisis Hipertensi
No ratings yet
Krisis Hipertensi
29 pages
Cse225L-Datastructures and Algorithms Lab Lab 01 Dynamic Memory Allocation
No ratings yet
Cse225L-Datastructures and Algorithms Lab Lab 01 Dynamic Memory Allocation
29 pages
MFE C++ Intro
No ratings yet
MFE C++ Intro
37 pages
An Introduction To C++: Dave Klein
No ratings yet
An Introduction To C++: Dave Klein
37 pages
Amine Unit
100% (1)
Amine Unit
69 pages
05-Microsoft Foundation Classes
No ratings yet
05-Microsoft Foundation Classes
22 pages
Syllabus MKCU Semester 2
No ratings yet
Syllabus MKCU Semester 2
3 pages
Windows Programming PDF
No ratings yet
Windows Programming PDF
236 pages
6648 0400 5 PS Pi 0001 - F PDF
100% (1)
6648 0400 5 PS Pi 0001 - F PDF
97 pages
The Complete Windows Programming Guide - MFC Tutorials
100% (1)
The Complete Windows Programming Guide - MFC Tutorials
610 pages
Building Applications Using VC
No ratings yet
Building Applications Using VC
54 pages
Portfolio Grade 1 Math Lesson
No ratings yet
Portfolio Grade 1 Math Lesson
1 page
Conjugate Beam Method SLU
No ratings yet
Conjugate Beam Method SLU
41 pages
Unit II Notes
No ratings yet
Unit II Notes
14 pages
Employee Welfare
No ratings yet
Employee Welfare
44 pages
Days/weeks/months/years? A: When Your Program Should Run For A Long Time
No ratings yet
Days/weeks/months/years? A: When Your Program Should Run For A Long Time
5 pages
Syllabus 2021 Foundation Engineering
No ratings yet
Syllabus 2021 Foundation Engineering
4 pages
Course Contents
No ratings yet
Course Contents
19 pages
6th Sem. CS 1305.-Visual Prog.
No ratings yet
6th Sem. CS 1305.-Visual Prog.
5 pages
How To Draw A Bitmap in A MFC Dialog Window
No ratings yet
How To Draw A Bitmap in A MFC Dialog Window
9 pages
Visual C++
No ratings yet
Visual C++
6 pages
ZT105A (MT) 非公路自卸车技术规格书（中英） 20230818
No ratings yet
ZT105A (MT) 非公路自卸车技术规格书（中英） 20230818
15 pages
Bottom of Form C++ Interview Questions and Answers How Can You Tell What Shell You Are Running On UNIX System?
No ratings yet
Bottom of Form C++ Interview Questions and Answers How Can You Tell What Shell You Are Running On UNIX System?
42 pages
Update to Modern C++
From Everand
Update to Modern C++
James Raynard
No ratings yet
The Beginner’s Guide to Kilo Code
From Everand
The Beginner’s Guide to Kilo Code
Steven Mcananey
No ratings yet
Isd Process V1
100% (1)
Isd Process V1
3 pages
Some Tutorials in Computer Networking Hacking
From Everand
Some Tutorials in Computer Networking Hacking
Dr. Hidaia Mahmood Alassouli
No ratings yet
C Programming for the Pc the Mac and the Arduino Microcontroller System
From Everand
C Programming for the Pc the Mac and the Arduino Microcontroller System
Peter D Minns
No ratings yet
The Little Book of Sitecore® Tips: Volume 1
From Everand
The Little Book of Sitecore® Tips: Volume 1
Neil P Shack
No ratings yet
SRS - How to build a Pen Test and Hacking Platform
From Everand
SRS - How to build a Pen Test and Hacking Platform
alasdair gilchrist
2/5 (1)

Large Code Bases

Uploaded by

Large Code Bases

Uploaded by

Coding for Large Code Bases

• Common for any AAA game these days

• Large shared codebase providing core functionality

• Working within a large team

• Most C++ references and textbooks are targeted at ~1000-100,000 line

• Majority of code is code you didn’t write

• Learning the codebase, making changes easily

• Single-line-change iteration time, recompile times, *relink* times.

• More files: need to skip around a lot to get complete picture

• How long does it take to recompile affected code

• That one header everything includes

• Implementation leaking into interface

• Number of lines parsed while compiling a module

• Note: Cost of file open much bigger than the read

• Number of redundant symbols that must be unified or stripped by the linker

• Sheer symbol count

• A class definition must include implementation!

• Still waiting on module support decades later

• But hey we have <wat> from c++1<x>

• But, almost all large code bases are written in C++

• Mixture of public and private makes API less intelligible

• Leads to extending the size of the compilation unit

• #including other files for implementation detail

• Implementation repeated in each including module, redundancy removed by

• Contrast with explicit IntArray class: single implementation in single module

• Can wind up repeated in all including modules unless care is taken

• constexpr helps these days

• Code within functional code that is never exercised (future looking?)

• Often kept as a “safety blanket”

• Makes it difficult to reason about how code can be fixed or extended

• Makes it difficult to refactor or clean up code

• Deleting more lines than you add is productive in a large codebase!

• Changing implementation should only cause local recompilation

• Those includes are smaller

• Fewer public symbols

• Forward declare everything where possible

• enum BLAH : S32;

• class BLAH { enum COW : S32; class BOB; };

• Don’t use class scope. (Can’t forward declare)

• No nested enums, classes/structs

• Prefer functions if there is no state

• Prefer functions over static methods

• Don’t #include platform/OS headers from public API

• Three possible methods:

• Functional APIs (C-style).

• “Concrete” implementation class inherits from interface, header is private

• Pros: plugin nature, DLLs

• Cons: virtual function call overhead, separate (third) header

virtual S32 get_num_inputs() const = 0; //!< Returns current number of inputs.

void add_time(const F32 elapsed_time) override;

bool process() override;

S32 get_num_inputs() const override;

• Often combined with intrusive reference counting for lifetime management.

• blah.h/blah_internal.h/blah.cpp, or simply blah.h/blah.cpp

• Two allocations per class instantiation (can be avoided with wrappers)

• Need either pass-through functions or prefixing internal data references

FILE_ITEM_REF m_selection; //!< Currently selected item

void UI_FILE_BROWSER::set_file_types( S32 num_file_types, const SICORE::FILE_TYPE file_types[] )

• Often use handles rather than pointers

• Larger codebases often use several

• However, lots more you can do…

• Don’t use inline method declarations

• Break up into functional groups

• Comment every method with //<! what this does

• But also means the linker can discard those symbols

• Avoid inline templated code

• Either avoid templates, or consider explicit instantiation

• Threads the implementation of a system through several abstraction layers &

• Must understand entire system to change anything!

• Don't use in-class method definitions. (Clutters API.)

• Everything else is noise

• Feature coding is all about managing complexity.

• Getting compile times down below five minutes

• Allowing new coders to get up to speed quickly

• Increasing code productivity in general

You might also like

• Single-line-change iteration time, recompile times, relink times.