Professional Documents
Culture Documents
DOI 10.1007/s10287-010-0127-2
ORIGINAL PAPER
Robust portfolio optimization with a hybrid heuristic
algorithm
Bjrn Fastrich Peter Winker
Received: 24 September 2009 / Accepted: 8 December 2010 / Published online: 25 December 2010
Springer-Verlag 2010
Abstract Estimation errors in both the expected returns and the covariance ma-
trix hamper the construction of reliable portfolios within the Markowitz framework.
Robust techniques that incorporate the uncertainty about the unknown parameters are
suggested in the literature. We propose a modication as well as an extension of such
a technique and compare both with another robust approach. In order to eliminate
oversimplications of Markowitz portfolio theory, we generalize the optimization
framework to better emulate a more realistic investment environment. Because the
adjusted optimization problemis no longer solvable with standard algorithms, we em-
ploy a hybrid heuristic to tackle this problem. Our empirical analysis is conducted with
a moving time window for returns of the German stock index DAX100. The results
of all three robust approaches yield more stable portfolio compositions than those of
the original Markowitz framework. Moreover, the out-of-sample risk of the robust
approaches is lower and less volatile while their returns are not necessarily smaller.
Keywords Hybrid heuristic Algorithm Markowitz Robust optimization
Uncertainty sets
B. Fastrich P. Winker (B)
Department of Statistics and Econometrics, University of Giessen,
Licher Strasse 64, 35394 Giessen, Germany
e-mail: Peter.Winker@wirtschaft.uni-giessen.de
B. Fastrich
e-mail: Bjoern.Fastrich@wirtschaft.uni-giessen.de
P. Winker
Center for European Economic Research (ZEW),
L7,1, 68161 Mannheim, Germany
123
64 B. Fastrich, P. Winker
1 Introduction
An investors primary objective is to optimally allocate his nancial resources among
a given set of assets. This allocation is traditionally modeled following Markowitz
(1952), the rst to explicitly quantify the trade-off between risk and return within the
process of portfolio selection. The model assumes a market of K assets with multi-
variate normally distributed expected returns, given in the (K 1)-vector , and a
(K K)-covariance matrix of returns . Efcient portfolios can be constructed if the
weights w
i
of the assets i = 1, . . . , K, are chosen such that the following problem is
solved:
max
w
(1 )w
_
w
w (1)
subject to
w
l
K
= 1
where l
K
= (1, . . . , 1)
and w = (w
1
, . . . , w
K
)
i I
p
n
i
P
i
(1 +
i
)
i I
p
I
prev C
i
_
+ R
p
V
1
__
w
p
w
p
_
(2)
subject to
X = {I
p
{1, . . . , K} | #I
p
K
max
} set of feasible asset combinations
w
i
=
n
i
P
i
V
i I
p
discrete portfolio weights
C
i
= c
f
z
i
+ c
v
|n
i
| P
i
i I
p
I
prev
transaction costs
z
i
=
_
1 if |n
i
| > 0 i I
p
I
prev
0 otherwise
indicator variable
R
p
= V
i I
p
n
i
P
i
i I
p
I
prev C
i
residual cash holdings
w
l
w
i
w
u
i I
p
no short-selling
Due to its complexity, a discrete search space, and multiple optima, problem (2)
will be optimized with a modied hybrid heuristic algorithm (HHA) introduced by
Maringer and Kellerer (2003). Heuristic algorithms have been employed in the eld
of portfolio optimization since Dueck and Winker (1992). An overview of heuristic
optimization techniques is provided by Gilli and Winker (2009) and for applications
in nance by Maringer and Winker (2007).
In order to solve either of the problems with real world data, estimators
i
and
4: for t = 1 to t hresh do
5: Determine the step size U
t
= U
max
U (t 1)
6: for l = 1 to i t er do
7: g = (t 1) i t er +l
8: Modication Phase (Algorithm 2)
9: Evaluation Phase (Algorithm 3)
10: Replacement Phase (Algorithm 4)
11: end for
12: end for
13: terminate algorithm and report current elitist I
In the Modication Phase, shown in more detail in Algorithm 2, the agents develop
independently from each other over st eps search steps. In each of these steps, an
amount of n
i, p
pieces of a randomly picked asset i I
g
p
, amounting to the step
sizes cash equivalent, is sold (3:). Of course, n
i, p
= h() is a (discrete) func-
tion of various parameters of the problem. If the disposition leads to a drop of the
assets weight below the lower bound, or if it leads to a complete clearance of asset
i I
g
p
, two proceedings exist: (1) with probability p
replace
asset i gets substituted
by a random asset k / I
g
p
, or (2) with probability 1 p
replace
asset i is kept with
quantity n
i, p
= w
i, p
= 0 and a random other portfolio constituent j I
g
p
, j = i ,
is bought (4: to 6:). In contrast, if the disposition does not violate the lower bound on
the weights, i is kept in the reduced (but still positive) quantity and, again, a random
other portfolio constituent j I
g
p
, j = i , is bought (7: to 9:). Under consideration
of all constraints, the assets j and k respectively are bought from the cash that was
generated by selling asset i . Hence, an altered version of portfolio I
g
p
denoted by I
g
p
is generated. This new solution has to be evaluated (10:) and compared to the current
solution I
g
p
. Whenever the objective function value (the tness) is greater than that of
Algorithm 2 Modication Phase (HHA).
1: Initialize p
replace
{T
t
},; import {I
g
p
}, V, U
t
, P
2: for s = 1 to st eps (parallel p) do
3: Sell n
i, p
= h(U
t
, V, P
i, p
. . .) pieces of a random assets i I
4: if n
i, p
= 0 w
i, p
< w
l
then
5: With p
replace
: buy random asset k / I
g
p
versus i I
g
p
6: With 1 p
replace
: buy a random asset j I
g
p
, j = i , i I
g
p
n
i, p
= w
i, p
= 0
7: else
8: Buy random asset j I
g
p
, j = i
9: end if
10: Evaluate F(I
g
p
)
11: if F(I
g
p
) [F(I
g
p
) T
t
] then
12: I
g
p
= I
g
p
& update I
g
if necessary
13: end if
14: end for
123
Robust portfolio optimization with a hybrid heuristic algorithm 69
the current solution I
g
p
, or whenever the impairment is not greater than the threshold
value T
t
, the solution I
g
p
is accepted as the new current solution I
g
p
(11: to 12:). This
possibility to accept impairments in the objective function value ultimately enables
the agents to escape local extrema. Hence, the threshold value can be interpreted as
the tolerance towards impairments; its value decreases with an increasing t to a nal
value of T
t hresh
= 0. If an accepted solutions objective function value is also larger
than that of the best portfolio found, the so-called elitist I
g
is updated (12:).
After the individuals developed independently from each other, the Evaluation
Phase, which is presented in Algorithm 3, applies. In that, the Pop individuals get
sorted in an descending order according to their objective function values (2:). A
selection of promising tendencies that will be reinforced as well as the selection of
unpromising tendencies that might be excluded, i.e., the evolutionary component of the
HHA, relies on this order as follows. The < 0.5Pop best individuals, the so-called
prodigies I
g
x
{I
g
p
}, x = 1, . . . , , are dened to be idols for the remaining pop-
ulation(3:).
4
This groupof idols, denotedby
g
, is enlargedbythe current generations
elitist I
g
}
5: Assign the prodigies linearly decreasing amplifying factors a
g
<x>
, ranging from + 1 down to 1; the
elitists factor is
6: Dene underdogs I
g
<Pop(x1)>
{I
g
p
}, merged in the set
g
The last phase in the life of a populations generation is the Replacement Phase,
shown in Algorithm 4, in which the set of idols, in cooperation with the amplifying
factors, is used to (possibly) replace the underdogs I
g
<Pop(x1)>
.
An underdog is replaced with probability p
clone
by an exact copy, i.e., by a Clone,
of a prodigy. Therefore, each prodigy within
g
gets assigned a selection probability
p(I
g
<x>
), that increases with the prodigys tness, i.e.,
p(I
g
<x>
) =
F(I
g
<x>
)a
g
<x>
x=1
F(I
g
<x>
)a
g
<x>
, x (3)
The resulting probability distribution is then used to randomly choose a prodigy to
replace the underdog (2: to 5:). Starting from a promising point in the search space,
4
The index z represents the zth-best position in this sorted order.
123
70 B. Fastrich, P. Winker
Algorithm 4 Replacement Phase (HHA).
1: Import {I
g
p
},
g
,
g
, , {a
g
<x>
}, , T
t
2: if Replacement by Clone (with p
clone
) (parallel p
g
) then
3: Compute selection probabilities according to Eq. (3)
4: Randomly choose a prodigy I
g
pi cked
g
based on p(I
g
<x>
)
5: I
g
<Pop(x1)>
= I
g
pi cked
6: else
7: Compute averaged weights according to Eq. (4)
8: Randomly choose K
max
assets i I
g
based on w
g
i
and normalize
9: Make I
g
i dol
a valid solution and evaluate F(I
g
i dol
)
10: if F(I
g
i dol
) [F(I
g
<Pop+(x1)>
) T
t
] then
11: I
g
<Pop(x1)>
= I
g
i dol
& update I
g
if necessary
12: end if
13: end if
during the Modication Phase, a Clone will ultimately develop a different portfolio
structure than its twin.
In contrast, an underdog is replaced by a so-called Averaged Idol (instead of by a
Clone) with probability 1 p
clone
(6:). An Averaged Idol is a solution that is created
based on the idols pool of successful asset combinations and portfolio weights. The
different indices of assets that are heldbythe groupof idols
g
are collectedina set I
g
.
For each of these assets i I
g
an averaged weight w
g
i
is computed as follows:
w
g
i
=
n={x:x=1,...,|i I
g
<x>
}
w
g
i,n
a
g
<n>
+ bw
g
i,
i I
g
n={x:x=1,...,|i I
g
<x>
}
w
g
i,n
a
g
<n>
+ bw
g
i,
_, i I
g
(4)
where
b =
_
1 if i I
g
0 else
In Eq. 4 an averaged weight of asset i I
g
and summed up
over all #I
g
assets in order to, thirdly, normalize (7:). The number of different assets
held by the idols, #I
g
and S
T K
s
of these bootstrap samples are
computed. Thus, 1,001 point estimators are gained. Thirdly, based on the 1,001 mean
vectors, S
T K
s
denes S
T K
.
With the objective of constructing a portfolio that exhibits good characteristics for
many possible scenarios of the point estimators, the portfolio is optimized under the
uncertainty sets worst-case-scenarios. Due to the no-short-selling constraint in (2),
the worst-case-expected return
T K
W
possible in S
T K
T K
W
. Because short-selling is forbidden, the worst-case for
each (co)variance is given by the largest value in S
T K
s
.
Due to the picking of single components from different covariance matrices, there is
no assurance that
T K
W
is positive definite.
The constructed parameters
T K
W
and
T K
W
are then used as the inputs for
problem (2), which is optimized with the HHA. The outcome is a stochastic approxi-
mation of the global maximum represented by the elitists (robust) portfolio I
T K
, i.e.
the best (robust) portfolio found.
2.3 Robustification: Ceria and Stubbs approach
Ceria and Stubbs (2006) (CS) only consider the uncertainties regarding the expected
returns and neglect the uncertainties regarding the covariance matrix. Their reasoning
for this procedure is the nding from Chopra and Ziemba (1993) that cash-equivalent
losses due to errors in estimates of expected returns are an order of magnitude greater
than those for errors in estimates of variances or covariances. S
CS
is constructed as a
K-dimensional ellipsoid that denes a region which envelopes the joint deviation of
the estimator from its true value with a given condence level 1 :
( )
1
( )
2
(1),K
(5)
In expression (5), represents the (K K)-covariance matrix of the expected returns
and
2
(1),K
the inverse cumulative distribution function value of a Chi-squared distri-
bution with K degrees of freedomand level of significance . The worst-case-scenario
in the CS-approach is dened by the maximum joint deviation, i.e. by the maximum
deviation of the true return from its estimator that theoretically can occur within ellip-
soid (5). Thus, it is given by solving the Lagrangian:
CS
W
= arg max
L(, )
= arg max
_
w
2
_
( )
1
( )
2
(1),K
_
_
=
_
2
(1),K
w
w
w (6)
Equation 6 shows that the (K 1)-mean vector is component wise penalized in
such a way that the larger an assets portfolio weight w is, the greater the assets
penalty becomes. Due to the penaltys positive dependence on w it (partly) com-
123
Robust portfolio optimization with a hybrid heuristic algorithm 73
pensates the error-maximizing characteristic of MVO-portfolios. Moreover, consid-
ering the expected returns correlations through when constructing the worst-case-
scenario is reasonable.
7
Of course, is not known and has to be estimated with the
given data sample. For a few types of return estimators , Stubbs and Vance (2005)
give suggestions. In the case of stationary returns and historical means for one can
use
CS
=T
1
.
8
Assigning a probability 1 with which the ellipsoid envelopes
the true expected return-vector is only valid if the return distribution is elliptical (e.g.
Fang et al. 1990), so that
2
(1),K
actually has its attributed meaning.
Due to the constraint that limits the number of different assets held to K
max
, Eq. 6
has to be adjusted to the dimensions of the considered individual I
p
, p = 1, . . . , Pop.
As before, this is denoted with index p:
CS
p,W
=
0, p
2
(1),K
p
w
CS
p
w
p
CS
p
w
p
(7)
During optimization of problem (2) with the HHA, the penalized return (7) is applied
for each of the Pop individuals whenever their tness is computed. The elitist I
CS
is also con-
structed with an ellipsoid similar to (5), its components are computed in a rened way.
Firstly, as is shown in Algorithm5, the estimator for the matrix is generated using the
MBB-technique as follows: the historical return sample is bootstrapped 5,000 times
to allow for the computation of an identical number of -vectors (1: to 3:). The gained
sequences are then used to calculate
ECS
(4:) out of which only those components
are written into
ECS
p
that correspond to the assets held by individual I
p
(5:).
Algorithm 5 ECS-Covariance Matrix for Expected Returns.
1: Generate 5,000 return samples s with the MBB-technique
2: Compute the expected return vectors
s
, s = 1, . . . , 5,000
3: Build K expected return sequences {
1,s
}, . . . , {
K,s
}, each of length 5,000
4: Estimate a covariance matrix
ECS
from these sequences
5:
ECS
p
consists of only those components that correspond to the assets held by I
p
7
Assets with larger (co)variances will c.p. be penalized stronger and vice versa. Therefore, unlike in the
TK-approach, an independent simultaneous occurrence of the worst-case-situation for all assets will be
prevented.
8
T denotes the number of return observations used to estimate
.
123
74 B. Fastrich, P. Winker
Secondly, as is shown in Algorithm 6, the ellipsoids size is also determined using
the MBB-technique. To this end the historical return sample (with mean-vector
0
)
is bootstrapped 10,000 times to allow for the computation of an identical number of
mean-vectors
s
, s = 1, . . . , 10, 000 (1: to 2:).
9
Out of each (K 1)-vector
s
, K
max
assets are randomly picked and written into
s,K
max
(4:). These
s,K
max
can be inter-
preted as the true expected return vectors and must be inserted together with their
corresponding estimators in
0
and in
ECS
into an ellipsoid such as (5) to obtain
10,000 joint deviations
s
(5: to 6:). The (1 )100% quantile of the generated dis-
tribution function f () is then used to replace
2
(1),K
p
. Thus, the worst-case-return
is given by:
ECS
p,W
=
0, p
_
f
1
()
w
ECS
p
w
p
ECS
p
w
p
(8)
The usage of f
1
() rather than
2
(1),K
p
has the advantage of not having to make
assumptions regarding the asset return distribution. However, randomly choosing
K
max
out of K assets as well as the application of the MBB-technique for gener-
ating both
ECS
and
s
lead to stochastic quantiles f
1
().
10
Algorithm 6 Size of the ECS-Ellipsoid for Expected Returns.
1: Generate 10,000 return samples s with the MBB-technique
2: Compute mean vectors
s
, s = 1, . . . , 10,000
3: for s = 1 to 10,000 do
4: Choose randomly K
max
assets from the (K 1)-vector
s
that dene
s,K
max
5: Insert
s,K
max
with the corresponding components in
ECS
and
0
into ellipsoid (5)
6: Obtain the joint deviation
s
7: end for
8: Determine the (1 )100% quantile f
1
()
The crucial extension in the ECS-approach is the construction of an uncertainty
set for . Analogous to the returns, the portfolio risks worst-case is dened by the
maximum joint deviation of the portfolios true variance w
w that theoretically can occur within an ellipsoid. By exploiting the symmetry of
and by using the definitions
p
= vec(
p
), W
p
= 2(w
p
w
p
) dg(w
p
w
p
) as well
as
p
= vec(W
p
),
11
alike in (6), this ellipsoid is considered in a Lagrangian. Hence,
9
Algorithm 6 generates a larger number of bootstrap samples compared to Algorithm 5, due to the latter
algorithms memory requirements of the procedures in line (4:).
10
Empirical tests show that the variation in the distribution functions and their quantiles f
1
() is neg-
ligible when they are based on 10,000 bootstrapped values
s
.
11
Here, the vec()-operator is columnwise stacking the components within the lower triangle of a matrix
to a vector. A (K K)-matrix A results in the ([K
2
+ K]/2 1)-vector vec( A). The vec
1
-operator
reverses this operation and restores the original matrix A. The dg() operator sets all elements off the main
diagonal to a value of zero.
123
Robust portfolio optimization with a hybrid heuristic algorithm 75
the portfolios worst-case covariance matrix is given by
12
:
ECS
p,W
= vec
1
_
arg max
p
_
p
p
2
_
(
p
p
)
1
p
(
p
p
)
_
__
= vec
1
_
arg max
p
L(
p
, )
_
=
0, p
+ vec
1
__
p
_
(9)
In Eq. 9,
p
represents the covariance matrix of the components in vector
p
, i.e., the
covariance matrix of the [(K
p
)
2
+ K
p
]/2 asset returns (co)variances. The estimator
ECS
(see
Algorithm 5). Also, the ellipsoids size has to be determined via MBB, since the
distribution of
p
is unknown. To this end, as depicted in Algorithm 7, the historical
return sample (which yielded the stacked covariance matrix
0
) is bootstrapped 10,000
times and the same number of stacked covariance matrices
s
, s = 1, . . . , 10,000, is
computed (1: to 2:).
Algorithm 7 Size of the ECS-Ellipsoid for Covariance Matrices.
1: Generate 10,000 return samples s with the MBB-technique
2: Compute the stacked covariance matrices
s
, s = 1, . . . , 10,000
3: for s = 1 to 10,000 do
4: Dene
s,K
max
by only those components from
s
that correspond to K
max
randomly chosen assets
5: Insert
s,K
max
with the corresponding components in
0
and in
into an ellipsoid as appears in (9)
6: Obtain the joint deviation
s
7: end for
8: Determine the (1 )100% quantile f
1
()
Out of each
s
-vector those components are written into
s,K
max
that correspond
to K
max
randomly chosen assets (4:). Together with their corresponding components
in
0
as well as in
these 10,000
s,K
max
are inserted into the ellipsoid to compute
10,000 joint deviations
s
(5: to 6:). The (1 )100% quantile of the generated dis-
tribution function f () is then used as the ellipsoids size, i.e., = f
1
() (8:).
13
Unlike in Eqs. 7 and 8, it cannot be assumed that the weighted sum of the components
in a covariance matrix row is positive.
14
In order to ensure
ECS
p,W
to be a penalized
12
The expression already considers the dimensionality of an individual p.
13
Empirical tests indicated that the number of bootstrap samples was sufcient to ensure a small enough
variation.
14
In
EC
p
and
ECS
p
the variances can be assumed to be greater than the (absolute) covariances, since in
the empirical application daily stock returns were used.
123
76 B. Fastrich, P. Winker
covariance matrix, only the main diagonal components of
p
, i.e., the variances of
the returns (co)variances, are used. Thus, the penalized covariance matrix is given
by
15
:
ECS
p,W
=
0, p
+ vec
1
_
_
f
1
()
p
dg(
p
)
p
_
(10)
The interpretation of Eq. 10 is as follows: the greater a (co)variances variance, the
more the historically estimated (co)variance will be penalized. Also, due to a positive
dependence of the penalization on the weights, the error-maximization is (partly)
compensated. Both the penalized return (8) and the covariance matrix (10) are applied
for each of the Pop individuals whenever their tness is computed. The elitist I
ECS
0
are based on 250 daily log-returns.
The results are transformed into monthly values, since the investor is assumed to rebal-
ance the portfolio once a month. The value determines the most extreme parameter
values that are still included in the uncertainty sets. The smaller is, the greater an
uncertainty set will be, and thus the greater the worst-case estimation errors will be.
Hence, can be interpreted as a parameter that captures the investors tolerance for
estimation errors. We assume the investors utility function to correspond to = 0.05.
Secondly, the HHA has to be parameterized. The minimum step size is set
to U
mi n
= 0.0004 corresponding to 400 Euro. The maximum step size is
set to U
max
= 0.3 to ensure the trial of enough asset combinations for the
case K
max
= 7. Through extensive empirical testing the following parameter-
izations have been found to result in a sufciently reliable stochastic outcome:
Pop =40, t hresh =30, i t er =5, st eps =50, =15, = 10, p
c
= 0.7, and
p
r
= 1.
17
Thus, 300,000 objective function values are computed per optimization.
18
15
Due to using only the main diagonal components the second addend is a matrix consisting of only
positive components, which is why
ECS
p,W
can be guaranteed to be positive semidefinite.
16
Two rms were removed from the sample due to missing data.
17
Tens of thousands of portfolios were optimized testing a large spectrum of possible parameter settings
in order to nd an appropriate parameterization (results not reported, but available on request).
18
Section 3.2 provides a convergence analysis for the HHA that supports the use of 300,000 objective
function evaluations per optimization.
123
Robust portfolio optimization with a hybrid heuristic algorithm 77
Table 1 Out-of-sample performance
wi n Objective function value Portfolio return Portfolio risk
MVO TK CS ECS MVO TK CS ECS MVO TK CS ECS
1 2.77 2.94 2.89 3.61 0.10 3.72 0.49 5.23 4.68 2.41 4.50 2.53
2 9.63 5.78 9.86 8.26 6.91 7.32 8.35 8.12 11.45 4.76 10.87 8.35
3 11.56 6.09 9.27 7.72 3.53 3.73 4.07 4.58 16.92 7.66 12.74 9.81
4 3.81 2.41 2.30 1.89 3.67 1.39 0.39 0.41 3.90 3.08 3.57 3.42
5 0.56 0.31 0.36 0.24 4.33 3.60 4.80 4.19 3.82 2.92 3.80 3.19
6 1.21 1.07 0.84 0.84 0.35 0.07 0.66 0.76 1.79 1.73 1.84 1.91
7 0.69 0.76 0.69 0.19 1.57 0.98 1.65 4.12 2.20 1.92 2.25 2.43
8 0.66 0.51 0.62 0.18 1.51 1.67 1.61 3.25 2.11 1.96 2.12 2.46
9 0.16 0.05 0.04 0.12 3.53 2.76 3.37 3.19 2.09 1.92 2.27 2.30
10 1.73 0.26 1.71 0.28 1.60 2.95 1.72 4.91 3.96 2.40 3.99 2.80
11 1.80 1.62 1.74 1.64 0.10 0.24 0.22 0.21 2.93 2.55 2.66 2.58
12 3.49 2.62 3.53 3.70 2.65 2.21 2.96 2.95 4.05 2.89 3.68 3.91
13 0.04 0.12 0.25 0.21 4.73 4.26 4.67 4.94 3.09 3.04 2.70 2.95
14 2.49 2.76 3.00 2.37 2.62 3.11 2.60 3.87 2.40 6.68 6.74 6.54
15 1.80 1.57 1.79 1.68 0.44 0.22 0.54 1.12 2.71 2.46 2.62 3.55
16 1.55 1.15 1.45 0.77 2.33 0.05 0.43 2.71 4.14 1.88 2.70 3.09
17 7.60 6.18 6.36 6.33 6.53 8.25 6.70 5.15 8.31 4.81 6.13 7.11
18 3.53 1.93 1.57 1.14 0.86 0.33 1.75 3.33 6.45 3.43 3.79 4.12
19 1.17 1.01 1.11 1.05 6.52 5.30 5.96 6.39 2.40 1.85 2.13 2.51
20 2.42 3.38 3.20 2.87 2.54 3.74 1.70 0.24 5.72 3.15 4.20 4.62
21 9.13 4.27 7.62 7.13 11.81 2.94 9.67 9.11 7.34 5.15 6.26 5.81
22 3.67 2.92 3.29 3.29 2.94 2.73 3.18 3.78 4.15 3.05 3.36 2.96
Mean 3.12 2.17 2.75 2.35 0.54 0.53 0.41 0.18 4.85 3.26 4.31 4.04
SD 3.39 2.02 2.97 2.76 4.30 3.58 4.07 4.51 3.59 1.60 2.80 2.14
Table 1 summarizes for all approaches the actual objective function value, portfolio return, and portfolio
risk, measured as the return volatility. For each column the mean and the standard deviation are shown
Thirdly, to evaluate a portfolios performance with respect to its robustness, a mov-
ing time window procedure is implemented. After the portfolio is optimized and held
for 21 subsequent out-of-sample trading days, the window of 250 trading days is
moved forward by 21days. Then a new optimization is run and the resulting portfolio
is again held for 21 out-of-sample trading days before the window is moved again and
so forth.
19
The sample spans from March 17th, 2005 to January 21st, 2008, allowing
for the construction of 23 portfolios.
20
Table 1 shows the out-of-sample performance of the MVO and all robust
approaches, in which the former serves as a benchmark. It can be seen that the mean
19
Whenever a window is moved so that the samples observations are updated, new threshold sequences,
parameters, and distributions (when applicable) must be computed for each approach.
20
All used return series were shown to be stationary according to the ADF-test as well as the KPSS-test.
123
78 B. Fastrich, P. Winker
risks are lower and the mean returns are higher for all three robust approaches com-
pared to the MVO-approach. In addition, smaller variations around these better mean
values can be observed. Among the robust approaches, on average TK- and ECS-
portfolios exhibit the lowest risks, and the ECS-portfolios clearly exhibit the highest
returns. The fact that also the highest return is on a rather low absolute level is mainly
caused by using historical means as estimators and by the conservative choice of the
transaction costs.
21
However, since our aim is to examine portfolios regarding their
robustness-properties, the absolute level of returns might be of minor importance. The
robust approaches advantages become most apparent when investigating actual bad-
case-scenarios, e.g., periods 2, 3, and 21. While, e.g., in period 3, the TK-approach
achieves a return that is only slightly lower than that of the MVO-approach, its risk is
less than half as high as that of the MVO-approach. Similarly, in period 21, all robust
approaches exhibit a higher return compared to the MVO-approach plus a lower risk.
In most good-case-scenarios, e.g., period 19, the robust approaches exhibit only a
slightly lower, in some periods (e.g. 8 and 10) even higher returns. Especially the
ECS-approach performs well also in normal periods, which is why it exhibits in
12 (3) of 22 periods the highest (lowest) return. Consequently, it achieves the only
positive mean return, which makes the approach especially interesting.
To gain further meaningful insight, Table 2 exhibits the portfolio compositions as
well as the forecast errors. There, TraS (traded stocks) shows the traded volume
measured in pieces of stocks and ToR (turnover ratio) is the priced traded volume
relative to the twofold capital endowment, both in period wi n compared to wi n 1;
the number of identical stocks in these subsequent periods is given by I dS (K
p
is
shown in parenthesis) and RoF is their averaged range of uctuation. It is obvious
that the MVO-approach exhibits a remarkable uctuation in the portfolio composition.
On average 14,777 pieces of stocks are traded to rebalance the portfolio in each period.
It is, however, noticeable that in periods wi n [7; 12], the average turnover ratio is
only about half as high as over the total period and TraS exhibits an average value
of only 10,465 in that period for which reason this might be considered a more stable
period.
The last three columns of Table 2 show the differences between actually realized
and predicted objective function values (multiplied by 100), portfolio returns, and
portfolio volatilities. An examination of the portfolio returns forecast errors shows
that the actual (out-of-samples) performance is worse than its expectation in 18 out
of 22 periods. The return was on average overestimated by 4.81% points, as is shown
by the mean forecast error. A mean overestimation of 6.26% points (as shown by
the mean negative forecast error) as well as the root mean negative squared forecast
error of 4.95 provide more detail about the extent of the return overestimation. The
corresponding results for the portfolio risk are, as expected, better. Nevertheless, the
mean forecast error also points in the unfavorable direction, i.e., the portfolio risk is on
average underestimated by almost one percentage point. Due to = 0.6 the objective
functions mean forecast error is between that of the return and the risk. However, since
21
The construction of portfolios based on estimators with such a limited forecast quality apparently contrib-
utes only little to good out of sample properties. Instead, this procedure solely contributes to a construction
of portfolios that exhibit good in-sample risk-return characteristics.
123
Robust portfolio optimization with a hybrid heuristic algorithm 79
Table 2 MVO-approach
wi n Fluctuation of composition Portfolio weights of stock no. Forecast errors
TraS ToR IdS RoF 7 30 34 48 95 e
F
e