0% found this document useful (0 votes)

96 views8 pages

Exploring Computation Graphs in Rust

The document discusses exploring different ways to model computation graphs in Rust. It begins by describing computation graphs and providing an example. It then explores modeling the graphs using vector indices, nodes as an enum, and finally nodes as a trait. For each approach, it implements getting values, derivatives, and other graph operations. Overall, it finds modeling nodes as a trait to be a good solution to avoid match statements ballooning with different node types.

Uploaded by

stephen kimeu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

96 views8 pages

Exploring Computation Graphs in Rust

Uploaded by

stephen kimeu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Exploring Computation Graphs in Rust

17 Jun 2018
Paul Kernfeld dot com

Recently, I’ve been trying to figure out a good way to model computation graphs in Rust. In this post, I
explore using a graph with vector indices. I’m not sure if this is the best approach, but writing it out has
helped me to understand the advantages and disadvantages better.

When I say “computation graph,” I mean a representation of a mathematical expression like 2 * a + a *

b. This example contains a constant (2), two variables (a and b), and two functions (addition and
multiplication). This expression can be modeled as a directed acyclic graph:

2 a b
\ / \ /
* *
\ /
+

In my ASCII diagram above, all edges go downwards. In general, edges don’t really have any interesting
information associated with them so I’m going to pretty much ignore them.

A homogenous graph
Below is a homogeneous DAG as a jumping-off point. This is strongly inspired by Modeling graphs in
Rust using vector indices. This isn’t a computation graph, but it does let us implement an algorithm that
traverses the graph and memoizes intermediate values. This is important because the number of paths
through a DAG can grow exponentially with the number of nodes, meaning that recursive
implementations can be very slow.

/// Deriving Copy reduces ownership headaches

#[derive(Copy, Clone)]
pub struct Idx(usize);

pub struct Node {

children: Vec<Idx>,
}

#[derive(Default)]
pub struct Graph {
nodes: Vec<Node>,
}

/// Graph maintains the invariant that nodes can only be added, never removed. This
means that
/// a particular Idx will always be valid as long as it is used with the correct
Graph.
impl Graph {
pub fn push(&mut self, children: Vec<Idx>) -> Idx {
self.nodes.push(Node { children });
Idx(self.nodes.len() - 1)
}

1
/// This returns the number of paths between each leaf node and the final node.
This
/// implementation memoizes the number of paths from leaves to each node.
///
/// Note that "final node" is only a meaningful concept in a DAG where there is
one node
/// that is the ancestor of every other node in the graph; I'm using it here for
simplicity.
pub fn count_paths(&self) -> usize {
let mut path_counts = Vec::new();

for node in &self.nodes {

let paths_to_here = if node.children.is_empty() {
1
} else {
node.children
.iter()
.map(|child_index| path_counts[child_index.0])
.sum()
};
path_counts.push(paths_to_here);
}

path_counts[path_counts.len() - 1]
}
}

let mut g = Graph::default();

let a = g.push(vec![]);
let b = g.push(vec![a]);
let c = g.push(vec![a, b]);
let d = g.push(vec![a, b, c]);

// All paths are:

// a -> b -> c -> d
// a -> b ------> d
// a ------> c -> d
// a -----------> d
assert_eq!(4, g.count_paths())

What if Node is an enum?

Next, a graph that can actually do some computation. I’ve installed a few upgrades relative to the
previous implementation:

• There are three different kinds of Node: Constant, Variable, and Sum
• There is a Subgraph type that represents an ordered set of nodes
• Idx implements std::ops::Add for cute graph-building syntax
• There is a derivative method that transforms a subgraph
• Graph implements Index<Idx> for slightly more type safety

use std::collections::{HashMap, HashSet};

use std::ops::{Add, Index};

2
/// To enable this to be used in HashMap and HashSet, this derives Eq, PartialEq,
and Hash
#[derive(Copy, Clone, Eq, Hash, PartialEq)]
pub struct Idx(usize);

impl Add for Idx {

type Output = Node;

fn add(self, rhs: Idx) -> Node {

Node::Sum { children: vec![self, rhs] }
}
}

pub enum Node {

Constant(f64),
Variable,
Sum { children: Vec<Idx> },
}

impl Node {
fn get_value(&self, my_index: &Idx, values: &HashMap<Idx, f64>) -> f64 {
match self {
Node::Constant(value) => *value,
Node::Variable => values[my_index],
Node::Sum { children } => children.iter().map(|child|
values[child]).sum(),
}
}

fn derivative(
&self,
my_index: &Idx,
wrt: &HashSet<Idx>,
derivatives: &HashMap<Idx, Idx>,
) -> Node {
match self {
Node::Constant(_) => Node::Constant(0.0),
Node::Variable => {
if wrt.contains(my_index) {
Node::Constant(1.0)
} else {
Node::Constant(0.0)
}
}
Node::Sum { ref children } => {
Node::Sum {
children: children.iter().map(|child|
derivatives[child]).collect(),
}
}
}
}
}

/// This helps us to represent the idea that only a subset of the nodes in a graph
might be
/// relevant for a particular computation. The indices in a Subgraph are ordered
such that a
/// child always comes before one of its parents.
pub struct Subgraph {
indices: Vec<Idx>,
}

3
impl Subgraph {
fn new(indices_unsorted: impl Iterator<Item = Idx>) -> Self {
let mut indices: Vec<Idx> = indices_unsorted.collect();

// This is an easy way to enforce the order condition

indices.sort_unstable_by_key(|index| index.0);
Self { indices: indices }
}
}

#[derive(Default)]
pub struct Graph {
nodes: Vec<Node>,
}

impl Graph {
pub fn push(&mut self, node: Node) -> Idx {
self.nodes.push(node);
Idx(self.nodes.len() - 1)
}

pub fn as_subgraph(&self) -> Subgraph {

Subgraph { indices: self.nodes.iter().enumerate().map(|(i, _)|
Idx(i)).collect() }
}

/// Given values for each relevant variable, this computes the value for each
node in the
/// graph.
pub fn evaluate_subgraph(
&self,
subgraph: Subgraph,
variable_to_value: HashMap<Idx, f64>,
) -> HashMap<Idx, f64> {
let mut result = variable_to_value;

for index in subgraph.indices.iter() {

let value = self[*index].get_value(index, &result);
result.insert(*index, value);
}

result
}

pub fn evaluate(&self, variable_to_value: HashMap<Idx, f64>) -> HashMap<Idx,

f64> {
self.evaluate_subgraph(self.as_subgraph(), variable_to_value)
}

/// This transforms the graph by taking the derivative

pub fn derivative(&mut self, of: Idx, wrt: HashSet<Idx>) -> (Idx, Subgraph) {
// Memoize the derivative of each node
let mut derivatives: HashMap<Idx, Idx> = HashMap::new();

for old_index in 0..self.nodes.len() {

let old_index = Idx(old_index);
let new_node = self[old_index].derivative(&old_index, &wrt,
&derivatives);
let new_index = self.push(new_node);
derivatives.insert(old_index, new_index);
}

// The subgraph contains all the new nodes we just created

4
(
derivatives[&of],
Subgraph::new(derivatives.values().cloned()),
)
}
}

impl Index<Idx> for Graph {

type Output = Node;

fn index(&self, index: Idx) -> &Node {

&self.nodes[index.0]
}
}

// c = 1 + b
let mut g = Graph::default();
let a = g.push(Node::Constant(1.0));
let b = g.push(Node::Variable);
let c = g.push(a + b);

// 1 + 2 = 3
let variable_to_value = {
let mut result = HashMap::new();
result.insert(b, 2.0);
result
};
assert_eq!(3.0, g.evaluate(variable_to_value)[&c]);

// The derivative of c wrt b is just 1

let wrt = {
let mut result = HashSet::new();
result.insert(b);
result
};
let (d_c_b, subgraph) = g.derivative(c, wrt);
assert_eq!(1.0, g.evaluate_subgraph(subgraph, HashMap::new())[&d_c_b]);

Overall, I’m pretty happy with this implementation. However, adding many different types of node will
cause the match statements to balloon. Additionally, I would rather see all the code for one type of Node
in one place.

What if Node is a trait?

To solve this, I’m going to make Node a trait instead of an enum. I have hidden aspects of the
implementation that are the same as in the previous implementation; the complete implementation is in
the source code for this post.

pub trait Node: 'static {

/// The input must include values for all variables and for all children of this
node.
fn get_value(&self, my_index: &Idx, values: &HashMap<Idx, f64>) -> f64;

fn derivative(
&self,
my_index: &Idx,
wrt: &HashSet<Idx>,

5
derivatives: &HashMap<Idx, Idx>,
) -> Box<Node>;
}

pub struct Constant(f64);

impl Node for Constant {

fn get_value(&self, _my_index: &Idx, _values: &HashMap<Idx, f64>) -> f64 {
self.0
}

fn derivative(
&self,
_my_index: &Idx,
_wrt: &HashSet<Idx>,
_derivatives: &HashMap<Idx, Idx>,
) -> Box<Node> {
Box::from(Constant(0.0))
}
}

pub struct Variable;

impl Node for Variable {

fn get_value(&self, _my_index: &Idx, _values: &HashMap<Idx, f64>) -> f64 {
_values[_my_index]
}

fn derivative(
&self,
my_index: &Idx,
wrt: &HashSet<Idx>,
_derivatives: &HashMap<Idx, Idx>,
) -> Box<Node> {
if wrt.contains(my_index) {
Box::from(Constant(1.0))
} else {
Box::from(Constant(0.0))
}
}
}

pub struct Sum {

children: Vec<Idx>,
}

impl Node for Sum {

fn get_value(&self, _my_index: &Idx, _values: &HashMap<Idx, f64>) -> f64 {
self.children.iter().map(|child| _values[child]).sum()
}

fn derivative(
&self,
_my_index: &Idx,
_wrt: &HashSet<Idx>,
derivatives: &HashMap<Idx, Idx>,
) -> Box<Node> {
Box::from(Sum {
children: self.children
.iter()
.map(|child| derivatives[child])
.collect(),
})

6
}
}

/// Since Node does not implement Sized, we need to box it so we can put it into a
Vec.
#[derive(Default)]
pub struct Graph {
nodes: Vec<Box<Node>>,
}

/// This is almost identical to the Graph implementation with the enum, except that
the push
/// fn now accepts a Box<Node>, and I've added push_box.
impl Graph {
pub fn push_box(&mut self, box_node: Box<Node>) -> Idx {
self.nodes.push(box_node);
Idx(self.nodes.len() - 1)
}

pub fn push<N: Node>(&mut self, node: N) -> Idx {

self.push_box(Box::from(node))
}
}

// c = 1 + b
let mut g = Graph::default();
let a = g.push(Constant(1.0));
let b = g.push(Variable);
let c = g.push_box(a + b);

// 1 + 2 = 3
let variable_to_value = {
let mut result = HashMap::new();
result.insert(b, 2.0);
result
};
assert_eq!(3.0, g.evaluate(variable_to_value)[&c]);

// The derivative of c wrt b is just 1

let wrt = {
let mut result = HashSet::new();
result.insert(b);
result
};
let (d_c_b, subgraph) = g.derivative(c, wrt);
assert_eq!(1.0, g.evaluate_subgraph(subgraph, HashMap::new())[&d_c_b]);

While implementing this, I noticed that making Node a trait enforces a clean separation of responsibility
between the graph and the node. I actually used the implementation of this version to clean up the
factorization of the enum version.

However, making Node a trait brings with it the ergonomic disadvantage that nodes often need to be
passed around inside a Box, which is slightly annoying. Separately, there is a performance penalty
because we are now using dynamic dispatch instead of static dispatch. I don’t think that I care too much
about this, because I’m interested in using these graphs with large tensors where the cost of the actual
computation will dwarf the cost of traversing the graph.

7
Advantages
I was happy about several aspects of this experiment:

• I did not need to introduce explicit lifetimes at all.

• I think it will be possible to construct one of these graphs at runtime. This means that a serialized
graph could be loaded in from a file, for example.
• I didn’t need to rely on any external dependencies. This is not usually a goal of mine but it makes
the code easier to understand.
• Having Idx implement Copy makes it more pleasant to work with.

Disadvantages
• The syntax for creating a graph is a bit cumbersome, since we have to write g.push(...) every
time we want to add a node to the graph.
• Having Graph, Subgraph, Node, and Idx is a lot of structs even for this toy implementation.
• I don’t entirely understand why Node needs to have the static lifetime. Hopefully that doesn’t
mean anything bad.
• Since a Node doesn’t know its own index, the index needs to be passed around a lot.

Future directions
I have an unhealthy obsession with building an elegant DSL in vanilla Rust. I would love to be able to
create a graph by writing something like this:

let x = variable();
let y = variable();
let z = x * 2.0 + y;

One somewhat crazy direction of exploration would be to allow different nodes in the graph to implement
different traits.

I would like to be able to save and load graphs.

It would be good to have a way to represent functions that are composed of smaller functions, like
softmax.

These questions aside, the most obvious ways to make this more useful would be to implement many
different functions and to allow computation on data such as tensors.

About
This blog post was produced using cargo-readme to ensure that all of the code actually works. The source
code is here.

TD2 Solution
No ratings yet
TD2 Solution
5 pages
Unit 5
No ratings yet
Unit 5
27 pages
Unit 5 Notes
No ratings yet
Unit 5 Notes
58 pages
Graphs & Hashing Essentials
No ratings yet
Graphs & Hashing Essentials
67 pages
Bellman-Ford Algorithm Guide
No ratings yet
Bellman-Ford Algorithm Guide
5 pages
Minimum Vertex Cover Problem Explained
100% (1)
Minimum Vertex Cover Problem Explained
4 pages
Data Lab 13
No ratings yet
Data Lab 13
10 pages
Go, Rust, Python, Kotlin, Scala, Dart Cheat Sheet
No ratings yet
Go, Rust, Python, Kotlin, Scala, Dart Cheat Sheet
108 pages
Lab Manual: Spring 2021
No ratings yet
Lab Manual: Spring 2021
33 pages
Graphs Lectures
No ratings yet
Graphs Lectures
44 pages
Module 5 1
No ratings yet
Module 5 1
25 pages
Dictionary Graph
No ratings yet
Dictionary Graph
5 pages
Graph
No ratings yet
Graph
7 pages
Graph Search Algorithms Explained
No ratings yet
Graph Search Algorithms Explained
64 pages
Unit 8
No ratings yet
Unit 8
44 pages
BG
No ratings yet
BG
156 pages
Graphs Practice Questions
No ratings yet
Graphs Practice Questions
2 pages
DSA - Ch13 - 14 - Graph
No ratings yet
DSA - Ch13 - 14 - Graph
83 pages
Ads 3 Part 1
No ratings yet
Ads 3 Part 1
123 pages
Understanding Graph Data Structures
No ratings yet
Understanding Graph Data Structures
27 pages
Graph
No ratings yet
Graph
31 pages
Bfs
No ratings yet
Bfs
6 pages
Understanding Graphs in Data Structures
No ratings yet
Understanding Graphs in Data Structures
78 pages
Chapter 26
No ratings yet
Chapter 26
34 pages
Unit 5 - DS - AK2 - Graph
No ratings yet
Unit 5 - DS - AK2 - Graph
92 pages
Graph Theory Essentials
No ratings yet
Graph Theory Essentials
17 pages
Assignment 2 Dsa
No ratings yet
Assignment 2 Dsa
27 pages
Bellmanford
No ratings yet
Bellmanford
3 pages
DSA Day 4
No ratings yet
DSA Day 4
7 pages
Dsa Assignment-4 Soutik Dey
No ratings yet
Dsa Assignment-4 Soutik Dey
21 pages
DS Unit-4
No ratings yet
DS Unit-4
47 pages
Dsa Unit - 4
No ratings yet
Dsa Unit - 4
20 pages
Unit III - Graphs
No ratings yet
Unit III - Graphs
38 pages
Lec 33
No ratings yet
Lec 33
33 pages
AI Lab5
No ratings yet
AI Lab5
6 pages
Unit 3 Graph
No ratings yet
Unit 3 Graph
58 pages
Graph Theory Applications in CS
No ratings yet
Graph Theory Applications in CS
13 pages
Graphs in Data Structures Lecture
No ratings yet
Graphs in Data Structures Lecture
85 pages
Alg 07
No ratings yet
Alg 07
17 pages
Graph Data Structures
No ratings yet
Graph Data Structures
78 pages
Graph Theory
No ratings yet
Graph Theory
27 pages
Graphs
No ratings yet
Graphs
11 pages
CENG 213 PA3 2023-v1
No ratings yet
CENG 213 PA3 2023-v1
11 pages
Graphs
No ratings yet
Graphs
101 pages
Dsa Ass 5
No ratings yet
Dsa Ass 5
5 pages
DS 3
No ratings yet
DS 3
45 pages
Lab 7 Ds
No ratings yet
Lab 7 Ds
8 pages
Graph Algorithms and Data Structures
No ratings yet
Graph Algorithms and Data Structures
227 pages
Graph
No ratings yet
Graph
13 pages
Graphs in ds2 Bca 4
No ratings yet
Graphs in ds2 Bca 4
20 pages
HND in Computing and Software Engineering: Lesson 16 - Graph Data Structure
No ratings yet
HND in Computing and Software Engineering: Lesson 16 - Graph Data Structure
40 pages
MFCS Practicals
No ratings yet
MFCS Practicals
24 pages
Unit III
No ratings yet
Unit III
146 pages
Graph
No ratings yet
Graph
56 pages
Iterative Map-Reduce for Graph Search
No ratings yet
Iterative Map-Reduce for Graph Search
25 pages
Graph Theory Fundamentals
No ratings yet
Graph Theory Fundamentals
3 pages
Unit Iv
No ratings yet
Unit Iv
26 pages
DATA STRUCTURES AND ALGORITHMS - Unit 5
No ratings yet
DATA STRUCTURES AND ALGORITHMS - Unit 5
35 pages
13 Customer Service Role-Play Scenarios
No ratings yet
13 Customer Service Role-Play Scenarios
26 pages
50-Plus British Phrases and Slangs
No ratings yet
50-Plus British Phrases and Slangs
52 pages
5 Habits That Instantly RAISE YOUR VIBE
No ratings yet
5 Habits That Instantly RAISE YOUR VIBE
55 pages
Exploring Directed Acyclic Graphs in Golang
No ratings yet
Exploring Directed Acyclic Graphs in Golang
3 pages
White Jesus: Challenging Racism
100% (4)
White Jesus: Challenging Racism
11 pages
Flask Blog Setup with Mail and Bcrypt
No ratings yet
Flask Blog Setup with Mail and Bcrypt
24 pages
Docker Image-Building Best Practices
No ratings yet
Docker Image-Building Best Practices
7 pages
The Book of The Thousand Nights and A Night - Volume 10 by Anonymous
No ratings yet
The Book of The Thousand Nights and A Night - Volume 10 by Anonymous
348 pages
Developer's PWM Quick Guide
No ratings yet
Developer's PWM Quick Guide
3 pages
CSS Styling for Web Development Exercise
No ratings yet
CSS Styling for Web Development Exercise
13 pages
META AIMBOT (2) .Scriptable PDF
No ratings yet
META AIMBOT (2) .Scriptable PDF
1 page
Minecraft Hour of Code Designer #12
No ratings yet
Minecraft Hour of Code Designer #12
1 page
Research Paper
No ratings yet
Research Paper
41 pages
Moonscraper Chart Editor Manual
100% (1)
Moonscraper Chart Editor Manual
4 pages
Seismic Interpretation Guide
No ratings yet
Seismic Interpretation Guide
3 pages
A Brief Guide To Periodic Average Costing (PAC)
No ratings yet
A Brief Guide To Periodic Average Costing (PAC)
21 pages
Installation Manual VS111 Series: Plug-In Amplifier For Proportional Valves
No ratings yet
Installation Manual VS111 Series: Plug-In Amplifier For Proportional Valves
7 pages
100+ Spring Boot Interview Questions
No ratings yet
100+ Spring Boot Interview Questions
67 pages
Data of Wiper Project
No ratings yet
Data of Wiper Project
10 pages
8-0-SP1 Designer BPM Process Development Help
No ratings yet
8-0-SP1 Designer BPM Process Development Help
260 pages
Maximal Hosting Capacity (ICA) : Pss®Sincal
No ratings yet
Maximal Hosting Capacity (ICA) : Pss®Sincal
2 pages
Python Pro Ques
No ratings yet
Python Pro Ques
2 pages
Android Interview Questions For Senior Developer
No ratings yet
Android Interview Questions For Senior Developer
23 pages
ABE Soft Manual (Ver4.00)
No ratings yet
ABE Soft Manual (Ver4.00)
40 pages
Ebook Dashboards PDF
No ratings yet
Ebook Dashboards PDF
30 pages
PIM - Data Hub
No ratings yet
PIM - Data Hub
3 pages
Goup Assignment EE 8403-2024
No ratings yet
Goup Assignment EE 8403-2024
5 pages
Salesforce Test Class Guide
No ratings yet
Salesforce Test Class Guide
5 pages
Upload Files To Oracle Cloud Object Storage (Windows - Linux) - Oracle Pro Labs
No ratings yet
Upload Files To Oracle Cloud Object Storage (Windows - Linux) - Oracle Pro Labs
16 pages
Braulio de Diego - Problemas Oposiciones Matemáticas Vol 1 (69-80) Con Indice
No ratings yet
Braulio de Diego - Problemas Oposiciones Matemáticas Vol 1 (69-80) Con Indice
10 pages
LS8 Operators Manual
No ratings yet
LS8 Operators Manual
66 pages
LA 1 Linear & Non Linear Multimedia
No ratings yet
LA 1 Linear & Non Linear Multimedia
5 pages
PDFEditor Manual
No ratings yet
PDFEditor Manual
28 pages
Instagram and Facebook Are Hardly Social Media Apps Anymore - Business Insider
No ratings yet
Instagram and Facebook Are Hardly Social Media Apps Anymore - Business Insider
6 pages
Learn Golang & Python for Beginners
No ratings yet
Learn Golang & Python for Beginners
403 pages
Telecom Slice Order Management
No ratings yet
Telecom Slice Order Management
46 pages

Exploring Computation Graphs in Rust

Uploaded by

Exploring Computation Graphs in Rust

Uploaded by

Exploring Computation Graphs in Rust

When I say “computation graph,” I mean a representation of a mathematical expression like 2 * a + a *

/// Deriving Copy reduces ownership headaches

pub struct Node {

for node in &self.nodes {

let mut g = Graph::default();

// All paths are:

What if Node is an enum?

use std::collections::{HashMap, HashSet};

impl Add for Idx {

fn add(self, rhs: Idx) -> Node {

pub enum Node {

// This is an easy way to enforce the order condition

pub fn as_subgraph(&self) -> Subgraph {

for index in subgraph.indices.iter() {

pub fn evaluate(&self, variable_to_value: HashMap<Idx, f64>) -> HashMap<Idx,

/// This transforms the graph by taking the derivative

for old_index in 0..self.nodes.len() {

// The subgraph contains all the new nodes we just created

impl Index<Idx> for Graph {

fn index(&self, index: Idx) -> &Node {

// The derivative of c wrt b is just 1

What if Node is a trait?

pub trait Node: 'static {

pub struct Constant(f64);

impl Node for Constant {

pub struct Variable;

impl Node for Variable {

pub struct Sum {

impl Node for Sum {

pub fn push<N: Node>(&mut self, node: N) -> Idx {

// The derivative of c wrt b is just 1

• I did not need to introduce explicit lifetimes at all.

I would like to be able to save and load graphs.

You might also like