e-PAL (epsilon-Pareto Active Learning) is an algorithm designed to quickly predict the Pareto-optimal solutions in a multi-objective design space. This is particularly useful when measuring the target objectives is expensive, and therefore only a few samples of the design space should be drawn for measurement.

e-PAL predicts a Pareto front of the design space with the level of accuracy or granularity desired by the user. This granularity is specified with a vector parameter epsilon.

Assume that two objective functions are to be maximized simultaneously. The figure below shows an example of the true Pareto front of a design space contrasted with an epsilon-accurate Pareto front that could be predicted by e-PAL, with epsilon=(3,3).

As shown in the figure, certain amount of redundancy can be removed when epsilon is greater than zero.

All our work on learning for optimization and autotuning.

The challenge of multi-objective design spaces is that rather than having a single optimal solution, several solutions may simultaneously optimize the target objective functions.

A concrete example in the domain of hardware design, are the design spaces generated by the Spiral DFT and sorting network hardware generators, when exploring the different settings provided by the tool. The user needs to choose between different candidate designs that perform the same task but that trade differently throughput and area. This challenge is aggravated by the fact that evaluating each design is extremely expensive, since complex synthesis processes must be executed. Thus, it is impractical and sometimes infeasible to evaluate every possible design in order to find the Pareto-optimal solutions that maximize throughput and minimize area at the same time.

e-PAL has several key features. Having drawn a few samples of the design space, it builds models using Gaussian processes to predict the objective functions for the rest of the space. Further, it uses the predictive uncertainty associated with these nonparametric models in order to guide the iterative sampling. Specifically, e-PAL's sampling strategy aims to maximize progress on designs that are likely to be Pareto-optimal. e-PAL iteratively discards points that are either redundant or suboptimal, and it terminates when no more points can be removed in order to guarantee that, with high probability, the remaining points define an epsilon-accurate Pareto set of the given design space.

We have derived theoretical bounds on the number of iterations required by the algorithm to generate a prediction with the level of accuracy (epsilon) and confidence (delta) specified by the user.e-PAL is an improved version of PAL[1]. Unlike PAL, e-PAL returns an epsilon-accurate Pareto front of the given design space. In addition, e-PAL reduces the number of computations required in each iteration of the algorithm, making it more suitable for large design spaces.

The plots below show the results of our experiments, performed based on data sets obtained in [3], [4] and [5].

In the y-axis, we show the accuracy of the Pareto front predicted when the algorithm has terminated. Accuracy is measured as the average maximum distance found between a point in the true pareto front and the closest point in the predicted Pareto front. The x-axis shows the number of designs that had to be evaluated in order to generate the prediction.

The results show that different accuracy/training-cost tradeoffs can be obtained by varying the value of epsilon. A larger epsilon causes e-PAL to stop earlier.

We compare the efficiency of e-PAL with those obtained with PAL and with the evolutionary algorithm ParEGO [6]. These results show that e-PAL in almost all cases significantly improves over PAL and ParEGO.

More details on these experiments can be found in [1].

This implementation is written in Matlab, and uses the GPML libraries written by Carl Edward Rasmussen and Chris Williams [2]. e-PAL can be used for any number of objective functions, however this implementation is limited to two objective functions.

- Marcela Zuluaga, Andreas Krause, Guillaume Sergent and Markus Püschel
**Active Learning for Multi-Objective Optimization**

Proc. International Conference on Machine Learning (ICML), 2013 - Marcela Zuluaga, Peter A. Milder, and Markus Püschel

**Computer Generation of Streaming Sorting Networks**

Proc. Design Automation Conference (DAC), 2012 - Carl Edward Rasmussen and Christopher K. I. Williams

**Gaussian Processes for Machine Learning**

The MIT Press, 2006 - Oscar Almer, Nigel Topham, and Björn Franke

**Learning-Based Approach to the Automated Design of MPSoC Networks**

Proc. Architecture of Computing Systems (ARCS), 2011 - N. Siegmund, S. S. Kolesnikov, C. Kastner, S. Apel, D. Batoryx, M. Rosenmuller, and G. Saake

**Predicting Performance via Automated Feature-Interaction Detection**

Proc. Int'l Conference on Software Engineering (ICSE), 2012 - J. Knowles

**ParEGO: a Hybrid Algorithm with Online Landscape Approximation for Expensive Multiobjective Optimization Problems**

IEEE Trans. on Evolutionary Computation, 2006

Contact: Marcela Zuluaga <firstname.lastname@inf.ethz.ch>