Abstract: Performance in modern embedded systems, particularly those executing computation-intensive signal/image processing and machine learning algorithms, is critically dependent on the efficiency ...