Also, a pure MATLAB implementation should be simple by following the algorithm in the paper and would be as faster as a C++ implementation with MKL, since the computational bottleneck of the ...