Armadillo: C++ library for linear algebra & scientific computing

Armadillo employs a delayed evaluation approach to combine several operations into one and reduce (or eliminate) the need for temporaries. Where applicable, the order of operations is optimised. Delayed evaluation and optimisation are achieved through recursive templates and template meta-programming.

While chained operations such as addition, subtraction and multiplication (matrix and element-wise) are the primary targets for speed-up opportunities, other operations, such as manipulation of submatrices, can also be optimised. Care was taken to maintain efficiency for both "small" and "big" matrices.

See also the Questions page for more information about speed enhancements.

Below is a set of timing comparisons against two other C++ matrix libraries (IT++ and Newmat) which have comparable functionality. The comparisons were done using an Intel Core2 Duo CPU (2 GHz, 2 Mb cache), Linux kernel 2.6.26, GCC 4.3.0. In each case the value of N (see the code below) was empirically found so that each test took at least 5 seconds.

Code extract

// size and N are specified by the user on the command line
mat A = randu(size,size);
mat B = randu(size,size);
...
mat Z = zeros(size,size);

timer.tic();

for(int i=0; i<N; ++i)
  {
  Z = A+B;  //  or Z = A+B+C ... etc
  }

cout << "time taken = " << timer.toc() / double(N) << endl;

Add two matrices

Z = A+B

Matrix size: 4x4

Approximate speed-up relative to

IT++:	15 times
Newmat:	10 times

Matrix size: 100x100

Approximate speed-up relative to

IT++:	3.5 times
Newmat:	same speed

Add four matrices

Z = A+B+C+D

Matrix size: 4x4

Approximate speed-up relative to

IT++:	15 times
Newmat:	10 times

Matrix size: 100x100

Approximate speed-up relative to

IT++:	6 times
Newmat:	1.5 times

Multiply four matrices

Z = A*B*C*D