Professional Documents
Culture Documents
CITATIONS READS
5 47
4 authors, including:
All content following this page was uploaded by Jane Reis Coimbra on 27 November 2014.
The user has requested enhancement of the downloaded file. All in-text references underlined in blue are added to the original docum
and are linked to publications on ResearchGate, letting you access and read them immediately.
THERMAL PROCESS CALCULATION USING ARTIFICIAL
NEURAL NETWORKS AND OTHER TRADITIONAL METHODS
ABSTRACT
INTRODUCTION
The existing methods for thermal processing calculation have been exten-
sively applied in the food industry, which aim to estimate the lethality of a
given process, or alternatively, to determine an appropriate process time based
on a set of thermal conditions.
Bigelow et al. (1920) introduced a graphic procedure to evaluate the
efficiency of food thermal processing. This technique was the basis for a group
1
Corresponding author. TEL: +55-31-3899-1617; FAX: +55-31-3899-2208; EMAIL: lminim@ufv.br
(k 1)
x 1(k 1) x0
x 2(k 1)
w jk0
w jk1
w jk2
x 3(k 1)
w jk3 x jk
w jkn
x (nk 1)
x kj = f ( w kji xik 1 ) = f ( I jk ) (1)
j
where x kj is the current output state of jth neuron in layer k, wijk is the weight on
connection joining the ith neuron in layer (k - 1) to jth neuron in layer k, I jk is
the weighted summation of inputs to jth neuron in layer k and f is a differen-
tiable function, such as sigmoid or hyperbolic tangent. The aim of the learning
processes is to minimize the global error E of the system by modifying the
weights. Given the current set of weights w kji, we need to determine how to
change them in order to decrease the global error. This can be done using the
gradient descent as follows:
THERMAL PROCESS CALCULATION BY ANN AND OTHER METHODS 165
E
w kji = h (2)
w kji
where h is the learning coefficient. The critical parameter d that is passed back
through the layers is defined by:
E
d jk = k (3)
Ij
E E I jk ( k 1)
= = d j xi
k
(4)
w kji I jk w kji
E = (1 2) (d j o j )
2
(6)
j
where d is the desired output specified by a teacher, and o is the actual output
produced by the network with the current set of weights. It can be shown that
the error d is obtained for the output layer accordingly to Eq. (7) and for any
hidden layer accordingly to Eq. (8):
d j = ( d j o j ) f ( I j ) (7)
d j = d l w jl f ( I j ) (8)
l
As described in Eq. (6), E defines the global error of the network for a
particular inputoutput pattern. An overall global error function can be defined
as the sum of all pattern-specific error function.
166 E.C. GONALVES ET AL.
Data Generation
The product used in the thermal processing was a paste of liver, composed
of 20% pork liver, 60% pork meat and 20% water, prepared in a cutter
(Mainca, Barcelona, Spain).
Product-containing cylindrical cans of 340 g were processed in a non-
continuous vertical retort, using steam as the heat medium. In order to monitor
the cold point temperature during the process, the cans were prepared by
inserting a type-T thermocouple in their geometrical centers. After filling and
sealing, the cans were kept in a thermal bath so that the initial temperature
could be homogeneously adjusted before the process began. A proportional-
integral controller (Spirax Sarco, Cheltenham, England) was used to control
the process temperature. In order to record the time and temperature, a data
acquisition system (Field Logger, Novus, Porto Alegre, Brazil) was used, with
an accuracy of 0.5C. At the end of the sterilization, cooling was initiated with
air pressurization to avoid inflation of the cans, until reaching a temperature
below 100C. The cooling process continued until the cold point reached 40C.
In order to obtain a set of experimental data, a factorial design was applied
using four levels of retort temperature (113, 117, 121 and 125C), three levels
of initial product temperature (25, 35 and 45C) and three levels of time elapsed
after the beginning of the process to apply disturbances (10, 30 and 50 min).
Disturbances applied to the retorts temperature were kept for three minutes
THERMAL PROCESS CALCULATION BY ANN AND OTHER METHODS 167
TABLE 1.
SOME OPERATIONAL CONDITIONS USED IN THE GENERALIZATION OF THE NETWORK
A 113 25 10
B 117 45 50
C 121 45 50
and performed with an interruption of the steam feed. This was made to
simulate faults that may happen in a real process. The time and temperature at
the center of the product were recorded every 20 s.
ANN Modeling
NeuralWorks Professional II/Plus (NeuralWare Inc., Pittsburg, PA) was
employed for ANN modeling. A BPTT network with logistic sigmoid trans-
fer function was applied, along with variable learning rate and momentum
coefficient. This algorithm represents an extension of the standard back-
propagation, in which the output yj,t+1 is reefed through recurrent loops to the
input layer yj,t. In order to predict the cold point temperature along the time,
the retorts temperature, process time and current and past temperature of the
cold point (TR, ti, Ti-1 and Ti-2, respectively) were used as an input variable.
Although there are some techniques to optimize the architecture of a neural
network (constructive, pruning, genetic algorithms), it is still a trial and error
task. We have used the constructive method because it is simple. The soft-
ware we have used permits us to apply pruning, but the results were con-
fused. A set of 25,112 data obtained within the products thermal processing
for network training was selected. Simultaneously, the test step was per-
formed using a set of 8968 data. This is an important stage, because the
overparameterization of the network can be avoided. To verify its generali-
zation capacity, the trained networks were used to predict the temperature
profile of the cold point and, consequently, the process lethality (F)-value of
the process, using a set of 4298 data, which were not used in the training and
test stages (see Table 1).
The sterilizing lethality of the process (F) was calculated accordingly to
Eq. (9), with the reference temperature TREF = 121.1C and z = 10.0C:
(Ti TREF )
n
F = 10 z
t (9)
i =1
168 E.C. GONALVES ET AL.
ti
TR
Ti T i+1
T i1
T i2
The efficiency of the model developed from the neural network technique
was tested and compared with the results obtained by the Ball and Stumbo
methods.
ANN Development
For developing the ANN, the number of hidden layers and PEs was
investigated by the trial and error method. The networks topology that pre-
sented the lowest mean squared error (MSE) is shown in Fig. 2 and it con-
sisted of two hidden layers with eight and nine nodes on the first and second
layer, respectively. As observed by Cybenko (1989), a network with a single
hidden layer is enough to approximate any continuous function, while a
network with two or more hidden layers is enough to approximate any math-
ematical function. It must be observed that two hidden layers can make the
network training easier and can lead to a better performance. The lethality is
an exponential function of temperature; therefore, even a slight deviation
of the predicted temperature causes a considerable effect on the lethality,
mostly when the cold point temperature is higher than 100C. Thus, using
two hidden layers resulted in a superior predictive model as compared to
using one hidden layer, and it also permitted to calculate lethality with a
very low error. Figure 3 presents the predicted and observed temperature at
the cold point along the time. A good agreement is observed in all phases of
the process, indicating the high performance of ANN in extracting the main
characteristics of the process.
THERMAL PROCESS CALCULATION BY ANN AND OTHER METHODS 169
Figure 4 shows the temperature profile at cold point over the time, for the
processes A, B and C (see Table 1). A good similarity between the experimen-
tal values and those predicted by the neural networks can be observed. A
critical point in the prediction of cold point temperature is when heating is
finished and the cooling phase starts. Great changes take place in the process,
because the retort temperature changes abruptly. At this moment, a slight
deviation in the predicted temperature may occur. However, as destruction
kinetics is an exponential function of temperature, the effect on lethality will
be enhanced. This can be observed in Fig. 5, which presents the accumulated
lethality along time. In this case, a slight deviation of the F-value was observed
for the process A, B and C, but with an error less than 2.3% (see Table 2). Of
course, this is a small error, which represents a difference of a few seconds in
the total heating time.
Table 2 presents the F-value calculated with the observed time
temperature data and using the Ball, Stumbo and ANN methods. It was
observed that the ANN developed in this work presented deviations below
2.3% when compared to the F-value determined using the experimental
timetemperature data. On the other hand, Balls and Stumbos methods
170 E.C. GONALVES ET AL.
Temperature (C)
Temperature (C)
Time (min)
FIG. 4. AUTOCLAVE AND COLD POINT TEMPERATURE OF THE PRODUCT ALONG TIME
, autoclave; , observed; , predicted.
were found to present superior errors, as they are not appropriated for
process calculation using variable retort temperature conditions. These
results show the great applicability of ANN model in routine calculations
of process time, as variable retort temperature always prevails. This of
course is because of faults in the steam temperature that frequently happen,
and also because of the activity of a process control system. Another inter-
esting feature of the ANN model is that the trained model is simple and
easily implemented in any commercial software for control of thermal
sterilization.
THERMAL PROCESS CALCULATION BY ANN AND OTHER METHODS 171
TABLE 2.
COMPARISON BETWEEN THE CALCULATED F-VALUES, USING THE EXPERIMENTAL
TIMETEMPERATURE DATA AND THE METHODS OF BALL, STUMBO AND ANN
CONCLUSION
ACKNOWLEDGMENT
The authors wish to thank FAPEMIG (CAG 7722002) for the financial
support.
REFERENCES