Professional Documents
Culture Documents
A COST/BENEFIT ANALYSIS
by
Lynn E. Webb
BA, San Francisco State University, 1992
Submitted in partial satisfaction of the requirements for
the Degree of
MASTER OF ARTS
in
Translation of German
Graduate Division
Monterey Institute of International Studies
Monterey, California
Copyright 1998-2000 by Lynn E. Webb
TABLE OF CONTENTS
ACKNOWLEDGEMENTS
INTRODUCTION
8
12
12
13
13
14
14
14
15
16
16
5
KEY CONSIDERATIONS FOR DETERMINING THE COST-EFFECTIVENESS OF TRANSLATION
MEMORY
18
5.1
5.2
5.3
5.4
5.5
5.6
5.7
5.8
6
EXAMPLES
6.1
6.2
6.3
6.4
6.5
24
INITIAL INVESTMENT
THE CLIENT
THE TRANSLATION AGENCY
THE FREELANCE TRANSLATOR
COMPANIES WITH IN-HOUSE TRANSLATION DIVISIONS
SURVEY/CASE STUDIES
7.1
7.2
19
19
20
20
21
23
23
23
25
26
30
34
37
40
SURVEY
CASE STUDIES
40
47
8.
TM DATABASE OWNERSHIP
47
49
10
50
10.1
10.2
10.3
51
51
52
11
52
12
FUTURE TRENDS
53
TABLE OF CONTENTS
13
CONCLUSION
56
REFERENCES
57
ACKNOWLEDGEMENTS
When I first considered writing my thesis on translation memory, I wasnt exactly
sure on what aspect I should focus. I would like to thank Chris Langewis for helping me
to narrow my topics down to the one presented in this thesis. I would also like to thank
him for his valuable input as my thesis advisor and as an expert in the field.
My survey would not have been a success without responses from the members
of LANTRA-L, CompuServes Foreign Language Education Forum (FLEFO), Interlang
and from fellow translators who responded to the various personal e-mails that I sent
out. Gerald Dennett, Mark Berry and Jeff Allen all deserve a special thanks for
contributing valuable information. I would also like to thank the various translation
memory software manufacturers, especially TRADOS Corporation, for their input and
responses to my questions.
Finally, I would like to thank William Webb for reading and editing and David
Sawyer and Frank Austermhl for reading and approving my final thesis. After all, if it
werent for this audience, I wouldnt be able to share it with the wider audienceall of
you.
The figures and tables that are not displayed in this electronic document can be
found in the HTML pages (figures) and Excel spreadsheets (tables) included with this
document in the archive zip file.
INTRODUCTION
Many articles have been written about translation memory in the last few years.
Most of the material is provided by the producers of translation memory systems and
only covers a specific product or lists specific features of the technology. Some articles
even magnify the negative aspects of translation memory. Despite all that has been
written, not much has been said about the actual costs or potential savings involved
when using translation memory. It is becoming increasingly clear that translation
memory is here to stay and that it is serving a useful purpose, but just exactly how is
this technology affecting the translation industry? Who can profit from it? Are there any
"losers? This thesis will attempt to determine the applicability of translation memory
technology and illustrate the advantages and disadvantages of translation memory in
the form of a cost/benefit analysis from the point of view of the end-user.
For the purpose of this thesis, the end-user comprises freelance translators,
companies with in-house translation divisions, translation agencies and direct clients.
The key considerations for determining the cost-effectiveness of translation memory
and the cost/benefit analysis will be covered later in this text.
against various search conditions.1 In other words, translation memory (also known as
sentence memory) consists of a database that stores source and target language pairs
of text segments that can be retrieved for use with present texts and texts to be
translated in the future. The translator, a different translation memory system or a
machine translation system provide the target text segments that are paired with the
source text segments so that the end product is a quality translation.
What distinguishes TM from other computer-assisted translation (CAT) tools?
There are many CAT tools available to assist the translator, such as bilingual and
multilingual dictionaries, grammar and spell checkers and terminology software, but TM
goes one step further by making use of these other CAT tools while at the same
matching up the original source document stored in its database with the updated or
revised document through exact and fuzzy matching. Normally, the basic unit of text in
a TM database is a sentence; however, the TM user can define what the unit will be.
The basic unit might even be a sentence fragment or a paragraph. The translator does
not have to re-translate work he or she has already completed. Figure 1 illustrates the
basic translation memory process for creating a target language translation.
EAGLES <http://issco-www.unige.ch/ewg95/node152.htm#spptmdesign>.
updated version of the text. Only the segments of the new text that do not match the old
one would have to be translated. The alternative would be to use an MT system or a
different TM system to translate the original. The new TM system could then be used by
a translator to translate the revision or update by aligning the texts produced by the MT
system or other TM system and storing them in the TM database for present and future
work. The translator could then proceed to translate only the segments of the new text,
using TM as described above.
3.1
question, we must have an idea of what takes place during this process. This section
will briefly touch on the general translation process and the features of TM used in this
process.
Figures 2 and 3 illustrate the conventional translation process and the translation
process using TM respectively. If it is possible to analyze quickly what type of text one
is translating and if the text suits translation memory, there are about the same number
of steps involved in both processes. One of the greatest differences, however, is that
once a translation has been performed using TM, not only is there a glossary of terms
stored for future recall, individual sentences will also be stored, thus cutting down
considerably on the time required for a future translation, update or revision.
Figure 4 illustrates the conventional translation process for a text that is being
revised. Figure 5 illustrates the same text using TM. Note that when using TM, fuzzy
and exact matching are performed using the translation memory program, allowing for
quick access to sections that have changed and permitting the user to focus on
translating only those changed sections.
Exact matching is the process by which the TM program pairs text segments in a
revised source text that match the original source text exactly; however, any text in the
document that does not exactly match the original will not be translated. Fuzzy
matching is the process by which the TM program pairs text segments in a revised
source text with similar text segments from a previously stored translation based on the
original source text. Fuzzy matching will find segments that are very similar to the
original and suggest the original translation. This function can be set to different levels
of sensitivity, allowing the translator to match source text segments that may differ
only slightly or segments that vary greatly, but still have some similarities. After exact
and fuzzy matching, the translator can modify the remaining segments that reflect the
changes between the original and revised texts without having to retranslate the entire
document (see Figure 6).
10
In addition to matching source text segments, fuzzy matching can also be used
to find terminology in the terminology database that is very similar to terminology being
used for a translation. For example, if the term communicate is in the terminology
database, the translation of communicate will be suggested whenever the terms
communicated or communication appear in the original text. The translator can then
enter the correct form of the word accordingly.
Although fuzzy matching is quite useful, the user must also be aware of
problems that may arise during post-editing of matched text segments. Gerald Dennett
explains in his thesis entitled Translation Memory: Concepts, products, impact and
prospects:
Take the German sentence pairs:
1. Ein Messer ist im Schrank. Er mit Elektrizitt.
2. Ein Messer ist im Schrank. Es ist sehr scharf.
Imagine that the translator has translated a document containing sentence
pair 1 and has thus stored in his Translation Memory the two segments:
A meter is in the cabinet. And It measures electricity. The syntactical and
contextual information supplied by the second sentence indicates to the
translator that the word Messer here refers to a meter. The translator then runs
a text containing sentence pair 2 through the pre-translation routine in his
Translation Memory software. The Translation Memory software will recognise a
100% match in the first part of the pair, and insert A meter is in the cabinet. in
the translation. A human translator would immediately realise from the
syntactical and contextual information supplied in the second part of the pair that
here in German word Messer is of neuter gender, and hence means knife.
The translator must hope that he can pick up such mistranslations in his proof2
reading.
On the other hand, the likelihood that the above sentences would appear in the same
document is probably quite low, especially since they would probably be used in
completely different domains or in different types of text.
Dennett 33.
11
3.2
user in managing projects, coordinating team efforts and building glossaries and
dictionaries. Following are some additional features of TM that allow the translator or
other user to manage translation projects more efficiently.
12
by project, client, etc. when performing fuzzy matching, or a project manager may have
more control over accountability for translated texts by filtering for creation date or the
name of the creator of the translated segments. The latter is particularly useful when a
number of translators are working on one large project, especially when the translators
are all working with the same language pair.
13
or project manager to assess whether or not translation memory will be useful for the
project and also helps him or her determine how much time may be involved in
translating the document, depending on the amount of repetition, the word count, etc.
The user may also use the analysis function to compare different documents for
similarities. Analysis can reveal if one document that has been translated previously
and a newer document are in any way similar. Depending on how similar the two
documents are, the user can estimate the time required for translation.
REUSABILITY
The most important characteristic of a text that is conducive to translation
memory is that the text will be reused in one way or another. Following are examples of
how texts can be reused and how translation memory becomes involved in the process.
4.1.1 UPDATES
A not uncommon occurrence during the translation process is when an update of
the text being translated is suddenly made available to the translator. An update is a
change in a source text that occurs while the translation is still in progress. Receiving an
updated text can cause major difficulties for the translator if the text is large and
changes have been made throughout the entire document. Figures 3 and 4 illustrate
the update/revision process with and without translation memory. Making updates using
translation memory has the advantage over the conventional update process in that the
translator does not have to physically search through the entire document for changes.
Instead, the translator only has to run the updated source text through the translation
14
memory program to identify new or changed segments and any new terminology. New
terminology can be entered into the terminology database by the translator for future
use.
Keep in mind that in order for translation memory to be effective, all work must
be done in TM and saved in TM format. Anything done outside of TM will not be stored
in the memory database and therefore will not be a translation that can be manipulated
in the future, unless one has access to an alignment tool. The best way to approach TM
is to think about it as being an integral part of the main word processor, just like the
word processors spell checker. If the TM system is a stand-alone product, always keep
a copy of the text file that retains the TM products file format.
A translator can even begin the translation process before the final original
document is completed. If the translator is given drafts of the original document in its
early stages of development, the text can be translated and stored in the TM database.
Then, as updated sections of the text are made available, the translator can perform
fuzzy and exact matching, thus isolating the new parts from the parts that have already
been translated or that are similar to the original. Section 6.5 is an example of this
process.
4.1.2 REVISIONS
Many translators find that they continually receive revisions from the same
clients. A revision is a new project amending a prior translation, reflecting changes
made to a prior source text. Often a translator is asked by a client to revise the
translation of a manual for the current product model that will be released within a short
period of time. The client wants the translated manual to be available at the same time
15
that the product is launched on the market. If the translator were to use the
conventional translation process, it could take months before a very large document
would be ready, and the client might not have that much patience or time. If, however,
the translator uses translation memory, he or she can analyze what has changed within
the document and can provide the revised translation of the manual within a shorter
period of time than if he or she had used the conventional process. Section 6.4
illustrates this process.
REPETITIVE CONTENT
Another important factor is whether or not there is repetitive content within a text.
The higher the percentage of repetitive content within a text, the more desirable it is to
use translation memory. Repetitive content may include words, phrases or entire
paragraphs. There are a number of different text types, but some tend to have more
16
repetitive content than others. The majority of translatable texts fall into the following
categories3:
Correspondence
Journalism/Communication
Business/Commercial
Marketing
Advertising
Administration
Legal
Scientific
Technical
Culture
Literature
The types of texts that are usually suited for translation memory are marked with
the "" symbol. Interestingly, according to the Telecom Observer, each year 450
million pages of scientific, technical, and commercial materials are translated worldwide.4 Some examples of the type of texts that fall into these categories include:
Patents (Legal)
Contracts (Legal, Business/Commercial)
User manuals (Technical)
Annual reports (Business/Commercial)
3
4
EAGLES <http://issco-www.unige.ch/ewg95/node162.html>.
Telecom Observer <http://telobs.com:8011/TelecomObserver/Spotlight/5/Welcome.E.html>.
17
Reusability of a text and the amount of repetitive content can help determine
whether TM should be used on a given text; however, reusability is the most important
factor. Regardless of text type, if a document will be updated, revised or recycled and is
fairly large, the use of translation memory is worthwhile. Figure 8 best illustrates this
concept.5
Repetitive Content,
Recyclability and
Frequency of Updates
High
USE TM
DO NOT
USE TM
Low
Small
Size of Document
Large
Langewis, Diagram.
18
paper will provide more detailed examples of how these factors influence the decision
to use TM.
5.1
5.2
memory is of little use; however, we are quickly becoming a paperless society. From the
recent survey I conducted for this thesis (see section 7.1), 70% of the 37 respondents
reported that half of the documents they translated were in electronic format. Out of the
25 freelancers who responded, 60% performed at least half of their translations on
electronic documents. Almost all of the agencies and companies with in-house
translation that responded worked primarily with electronic files. The high percentage of
electronic documentation within translation agencies and companies suggests that
freelance translators could probably receive a higher percentage of work in electronic
format if they were to ask their clients for electronic files. At some point, almost
everyone will be translating only electronic documents. As the number of electronic
documents increases, the use of translation memory will be all that much more justified.
19
5.3
reused and when working with certain text types. If a translator only translates literature
(having very little repetitive content) or translates such a variety of texts that he or she
will most likely never encounter similar documents again, then translation memory is not
an option. On the other hand, if a translator mainly translates software manuals or other
material with repetitive content and which the translator knows will be revised in the
future, then translation memory is an excellent choice.
5.4
same or may even take a little longer than when using the conventional translation
process, especially when there is a learning curve involved. The respondents in my
survey estimated that it took them one to two weeks of regular TM use to feel
comfortable using it. Entering a new translation into TM may also take longer than using
the conventional translation process because the first draft is so critical. Any future work
done in the TM database will depend on what was entered initially. Post-editing is also
affected by the quality of the first draft. A terminology database may also require
additional time to develop.
Once the original translation has been stored, however, subsequent revisions
and/or updates of the same text will take considerably less time to translate using TM.
Overall productivity using translation memory with machine translation is said to be
20
5.5
COMPARING RATES
Translators must be made aware of the fact that some agencies or customers
may request a discount or may want to pay by the hour instead of by the word if they
know that a translation was completed using TM. The translation agencies responding
to my survey point out that some clients already request discounts for repetitive content
within a text. If this is the case, then one is clearly better off using TM to take care of
repetitive content so that more time can be spent focussing on the overall quality of the
translation. If a translator is able to complete more work in less time, he or she could
also potentially earn much more than what was initially expected over a certain amount
of time just from the sheer volume of work that he or she is able to produce.
Customers or agencies using translators are increasingly taking advantage of the
lower cost of translation when using translators with access to TM. As a result, the price
of translation workregardless of whether or not one uses translation memorymay
eventually be driven down by customers and agencies requesting its use. In the near
future, freelance translators may find themselves scrambling to acquire a translation
memory product so that they can increase their productivity and remain competitive.
21
One project manager who responded to the survey works for a company
providing TM software. He cited various theories on how agencies and freelancers view
the rate situation:
1. Using TM adds consistency to the translation, therefore improving the
quality of the translation. The customer should pay for the extra
quality.
2. One should charge less per word/line/page, but base the rate on the
word count of the entire project. Any repetition would be money in the
translators/agencys pocket.
3. Charge standard rates for the parts that require special attention. Full
translation rates would apply to sections requiring translation. Full
proofreading/editing rates would apply to pre-translated sections. Full
layout work rates would apply for any desktop publishing, plus a flat
overhead fee would be required for processing the job in TM.
4. The translator receives part of the pay from an agency for segments
that are derived from fuzzy matching. This is predetermined, based on
the idea that if 70% is similar, we pay only 30%.
At least two freelance respondents mentioned that they were paid by the hour
when they used TM. In addition, the responses received from LANTRA subscribers
revealed that most of them were paid by the hour when pre-translating text using TM.7
22
5.6
past. Depending on the degree of similarity between the prior translation and the
current project and whether or not the prior work is in electronic format, the translator
may decide that it is worth aligning the old source text and old translation to assist him
or her in the present translation assignment.
5.7
she may be working on a revision of the project at some point in the future. It is
important to note, however, that two documents for completely different projects very
rarely resemble one another enough to use TM. Translation memory is therefore almost
exclusively employed for updates and revisions. When the translator receives a new
document and determines that it has similar terminology or sentences that match an
older translation done in TM, the translator simply translates the new text in TM, using
exact and fuzzy matching to identify previously translated terms (from the terminology
database) and/or sentences that either exactly or partially match the current
terms/sentences. The more terms and/or sentences that match or are similar, the
greater the increase in the translators productivity.
5.8
makes to implement TM. In fact, even if a translator finds that he or she only receives
23
one substantial update or revision a year, translation memory may pay for itself. For
more information, see section 6 of this paper.
5.9
EXAMPLES
This section includes various examples of how clients, corporations with in-
house translation divisions, translation agencies and freelance translators can estimate
the cost-effectiveness of TM for their specific translation needs. The examples will start
24
from the client side, move to the agency and freelance translator, and end with an inhouse translation department. This section is by no means all-encompassing, since
every companys or translators situation is unique. The scenarios below are restricted
to certain factors and therefore should be viewed only as theoretical applications of
translation memory; however, they provide an effective way of illustrating the benefits of
translation memory in the working environment.
6.1
INITIAL INVESTMENT
The initial translation memory investment for a translator or company may vary.
Translation memory (with a terminology database) costs on average about $2,000. The
alignment tool may cost another $2,000. Of course, a translator could create a template
or macro to facilitate the integration of prior work, but depending on how much text
needs to be integrated, the time required to create the template or macro might not be
worth the effort. A 17" monitor or larger is recommended ($400 - $1,300) due to the fact
that the translation memory software usually runs using a split screen. If the translator
has hard copy documents that he or she would like to integrate into translation memory,
then a scanner ($90 - $1,000) and OCR (Optical Character Recognition) software ($100
- $500) are required. The translator may, however, determine that using OCR software
to create electronic documents is not worth the effort. Translation memory also uses
much of the computer's available memory, so the computers should have between 32
and 64 MB of RAM ($60 - $200).
It is not inexpensive to own translation memory. An individual translator could
pay $4,650 to $7,000 if he or she were to purchase the above items all at once. A
company or translation agency would pay even more for multiple software licenses, etc.
25
Most translators, however, already own some of these products. Not only that,
translation memory can pay for itself within a year or two, depending on what type and
size of documents are translated. The following examples depict how translation
memory can help save companies or translators money and even help provide them
with more work over the long term.
6.2
THE CLIENT
In this section, the term client is considered to be a larger corporation. In order
26
The clients project manager (CPM) for product information knows that a new
line of coffeemakers will be coming out in two months. The coffeemakers are
marketed in France, Germany, Scandinavia, the U.K., the Netherlands and North
and South America. The company is located in the U.S., and the American English
users manual has just been completed. CPM realizes that when the coffeemaker
hits the shelves, the manual that comes with it must be available in at least eight
different languages in addition to American English. The manuals must be ready for
packaging in fifteen days. Ideally, CPM would like to have the translation finished in
five days so that the printing division will have plenty of time to publish the manuals.
CPM has a budget of $8,000 for the translation of the 20 page/5,000 word manual.
CPM contacts the companys regular translation agency (TA) to find out how much it
would cost and how long it would take for the translations. TA informs CPM that the
agency has recently implemented translation memory, but will not have time to align
the new manual to the previous one for this project. The translation agencys project
manager (TPM) assures CPM that once the new manual has been integrated into
TM, any similar manual in the future will be translated much more quickly.
After TPM examines the manual, it is estimated that the project will cost
approximately $8,400 for the initial translation (at $.21 per word). TPM also
estimates the turn-around time to be four to five days (three days for translating and
editing and about one to two for post-editing and administration, using one translator
per language). CPM accepts the conditions, although the cost is slightly above
budget, and places an order for the project.
Now let us take a look at the same corporate client dealing with the same agency
six months from this last transaction.
27
CPM knows that a new coffeemaker will be launched on the market in two months.
The manual for this coffeemaker is very similar to the manual translated six months ago,
except for some changes involving the coffeemaker name and features. CPMs staff has
analyzed the differences in a desktop publishing program and has estimated that 20%
of the text has changed.
CPM calls TA and lets TPM know there know that a 20-page manual needs to be
translated into eight languages. CPM informs TPM that the manual is very similar to the
manual translated six months before, and therefore the company would like a discount
for the repetitive content. TPM asks to see the manual to analyze it for the changes.
CPM sends the electronic copy to the agency.
TPM finds the last manual translated for this client. TPM then analyzes the English
manuals using TM to determine what differences there are between the two. CPM was
accurate in the estimate. TPM informs CPM that, indeed, only about 20% of the text had
changed. As a result, the estimated turn-around time for the project would be
approximately three days (one day for pre-translation using TM, one day for translating
and editing the remaining text and one day for post-editing, using one translator per
language). TPM also offers a 50% discount to CPM for the 80% repetitive content.
TPM has the two in-house translators perform the pre-translation work in TM. The inhouse translators finish the work in less than half an hour. The two translators earn
about $35,000 per year, or about $17 an hour/$.08 per word. TPM then has the two inhouse translators work on two documents and sends the remaining six documents out
to six freelance translators. Each translator must still translate the remaining 1,000
words, plus post-edit both the pre-translated work and the new translation. The
freelance translators earn approximately $.12 per word for new translation work. They
are paid about $30 per hour for post-editing the pre-translated work. All of the
28
translators require one day (8 hours) for post-editing the pre-translated work and one for
translating and editing the remaining 1,000 words.
The cost for the freelance translators would total approximately $2,160 ($720 for
6,000 words and $1,440 for post-editing). The in-house translator cost would be about
$449 ($160 for 2,000 words, $17 for pre-translation work and $272 for post-editing). The
cost to TA for the entire project would be about $2,609. The cost to CPM would be
about $3,360 for the pre-translated text and $1,680 for the remainder, totaling $5,040 for
the entire project. CPM has saved about $3,360 on this project.
Number of
words in project
Number of days
for turn-around
Total translation
cost
Cost per word
Without TM
40,000
Using TM
40,000
Savings
N/A
4-5
1-2
$8,400
$5,040
$3,360
$.21
$.13
$.08
If the above project had been for a larger manual, e.g., a 200 page/50,000 word
manual, and the rate were the same ($.21 per word), the savings would be even more
apparent. The first time the project was translated, it would take approximately 33 days
to complete (29 days for translating and four days for post-editing) and cost the client
$84,000, assuming no volume discount. If 80% remained unchanged in the next version
of the manual, the project would take approximately 9 to 10 days to complete
(approximately half of a day for the pre-translation using TM, 6 days for the remaining
translation work and 3 days for post-editing, using one translator per language). The
client would only have to pay $50,400 (assuming the same 50% discount from the
previous example)a savings of $33,600.
29
Number of
words in project
Number of days
for turn-around
Total translation
cost
Cost per word
Without TM
400,000
Using TM
400,000
Savings
N/A
33
9-10
23-24
$84,000
$50,400
$33,600
$.21
$.13
$.08
Using translation memory has fulfilled most of the clients needs. CPM has saved
the company money and has received the translation back sooner than originally
expected. As a result of using TM, CPM will also be better able to plan future budgets
and estimate the time necessary for projects. All of this will help CPM streamline the
process on the companys end and may even help prevent the company from losing
money due to the late release of manuals for products that are already ready to hit the
market.
The other benefit of translation memory is that the client will see consistent use
of company-specific terminology throughout the translations. Depending on the
companys agreement with the translation agency, CPM may also have access to the
translation memory database file in the future, giving the company the additional
flexibility to decide if it wants to start an in-house translation division or even to choose
another agency that uses TM.
6.3
30
After the initial project is completed, the agency now has a translation memory
database for this clients manual. When the client comes back in six months requesting
the manual revision, the agency will see a substantial increase in profit as a result of a
31
decrease in overhead costs. The following calculations demonstrate how the agency is
able to realize the profit.
TPM estimates that only 20% of the original text has changed. Since 80% remains
unchanged, one of the options TPM has is to employ the agencys in-house translators
to pre-translate the text, using the previous translation as the template. It would take
less than half an hour for two in-house translators to pre-translate eight 5,000-word texts
in TM. The cost for this work would be approximately $17 (at $8.50 per half hour). The
rest of the translation work comprises about 8,000 words, or 1,000 words per translator.
The translators can accomplish this work in one day, including editing and some postediting. Post-editing the text already entered into translation memory by the in-house
translators along with the newly translated segments would an additional day. The total
cost for the translation work and post-editing would be about $2,592 ($432 [translation
and post-editing] for the in-house translators and $720 [translation] plus $1,440 [postediting] for the freelance translators). The entire project would cost approximately
$2,609 (excluding other costs of doing business) and would take about three days
(including administration). By employing only the in-house translators for the pretranslation work, the agency was able to save approximately $3,503 to $5,215 in
overhead expenses and speed up the translation process.
Number of
words in project
Number of days
for turn-around
Total translation
cost
Cost per word
Total profit
Without TM
40,000
Using TM
40,000
Savings
N/A
4-5
1-2
$6,112 to $7,824
$2,609
$3,503 to $5,215
$.15 to $.19
$576 to $2,312
$.07
$5,936 to
$7,556*
$.08 to $.12
* Includes savings, although some of the savings may be used to cover any additional costs of using TM.
32
After the initial project is completed, the agency now has a translation memory
database for this clients manual. When the client comes back in six months requesting
the manual revision, the agency will see a substantial increase in profit as a result of a
decrease in overhead costs. The following calculations demonstrate how the agency is
able to realize the profit.
TPM estimates that only 20% of the original text has changed. Since 80% remains
unchanged, one of the options TPM has is to employ the agencys in-house translators
to pre-translate the text, using the previous translation as the template. It would take
less than half an hour for two in-house translators to pre-translate eight 5,000-word texts
in TM. The cost for this work would be approximately $17 (at $8.50 per half hour). The
rest of the translation work comprises about 8,000 words, or 1,000 words per translator.
The translators can accomplish this work in one day, including editing and some postediting. Post-editing the text already entered into translation memory by the in-house
translators along with the newly translated segments would take an additional day. The
total cost for the translation work and post-editing would be about $2,592 ($432
[translation and post-editing] for the in-house translators and $720 [translation] plus
$1,440 [post-editing] for the freelance translators). The entire project would cost
approximately $2,609 (excluding other costs of doing business) and would take about
three days (including administration). By employing only the in-house translators for the
pre-translation work, the agency was able to save approximately $3,503 to $5,215 in
overhead expenses and speed up the translation process.
33
Number of
words in project
Number of days
for turn-around
Total translation
cost
Cost per word
Total profit
Without TM
400,000
Using TM
400,000
Savings
N/A
33
10
22
$50,848
$15,784
$35,064
$.13
$33,152
$.04
$69,680*
$.09
* Includes savings, although some of the savings may be used to cover any additional costs of using TM.
The agency was able to do something it could not have done without using TM. It
managed to keep its costs down, increase its profit margin, speed up the translation
process and manage the project much more efficiently.
6.4
are the needs of a freelance translator? The freelance translator is not really much
different from a translation agency when it comes to his or her requirements. The
translator wants to:
1. Keep costs down
2. Increase income
3. Speed up translation process while maintaining quality
4. Eliminate repetitive translation tasks
5. Plan his or her budget
34
Let us take the last example from the agencys point of view and look at it from
the freelance translators perspective. We will first look at a translator who did not
personally own translation memory.
Translator 1 (T1) receives a call from TA. TPM asks if T1 would be interested in
translating a manual similar to one T1 had worked on in the past. T1 does not own
translation memory, but has used it on a past project, so T1 is familiar with it. TPM
informs T1 that the agency will lend T1 a copy of TAs translation memory software and
the terminology database for the project.
T1 charges $.12 per word to translate the text into French. It takes T1 approximately
three days to translate and edit about 5,000 words. T1 charges the agency $600 for the
translation. T1 has made about $200 per day, or $25 per hour. Since T1 does not own a
copy of the translation memory software, T1 must return the software and TM database
that has been created for this project. The agency allows T1 to keep a copy of the
terminology database that has been developed, but the database will be of no use to T1
unless it is exported into another format or unless T1 eventually buys a copy of the
terminology database software.
Six months later, T1 receives another call from the agency asking for a translation of
1,000 words from the revised manual and for post-editing of the 4,000 words in the
translation memory database. T1 is again given a copy of the TM software so the postediting can be completed. T1 charges $30 per hour for post-editing. It takes T1 one day
for the translation and some editing and about eight hours for post-editing the entire
document. T1 has made about $360 on this project and has finished the work in two
days.
Now let us look at another translator (T2), a German freelance translator who
has been using translation memory for at least a year and owns TM software.
35
T2 receives a similar call from TA. T2s rates happen to be the same as T1s. TPM
supplies T2 with the CMPs terminology database, although T2 already has a copy from
the last translation project. T2 has also already integrated the last project into a TM
database. TPM knows that T2 owns a TM system, but does not know that T2 has
already integrated the previous manual. TPM requests that T2 provide the agency with a
TM database at the end of the project. Since T2 already has a similar manual in a TM
database, T2 analyzes the new text against the old and estimates that about 50% of the
text is the same or very similar. It takes T2 less than half an hour to pre-translate the
similar or identical text. T2 translates the remainder and edits the completed translation
over the next two days. T2 has earned $300 per day, and has an extra day in which to
take on new translation work.
Six months later, T2 receives another call from TPM to translate the 1,000 words
and post-edit the remainder. T2 accepts the job and makes $120 for the 1,000 words
that need to be translated. The rest of the text is identical to what is already in the TM
database. Post-editing takes about eight hours, costing the agency another $240. T2
has earned a total of $360 in about two days.
If we look at the two examples just described, we see how the freelancer can
benefit from translation memory just as much as a client or translation agency can. T2
initially has a clear advantage over T1 as a result of the investment in TM. T2 is able to
complete more work in less time, thus being able to realize an increase in income by
taking on more jobs. T2 is also more valuable to the translation agency for the simple
reason that T2 is faster and does not need to rely on the agency for translation memory
software. These advantages make it more likely that T2 will receive more work from the
agency in the future. Coupled with the fact that T2 is now able to maintain client
databases and budget time more easily, T2 appears to be on the road to success.
36
Project 2
Project 1
T1
(using
TM)
Postediting
T1
(no TM)
$600
$200
$240
5,000
Project 2
T2
(using TM)
T2
(using TM)
1,000
T2
(using
TM)
5,000
Post-editing
1,000
$600
$300
$240
$120
$180
$120
$180
W hat if these two translators received work from the 400,000-word project? Their
6.5
50,000
Project 2
T1
T1
(using
(no TM
TM)
Post10,000
editing
30
5-6
$6,000
$960
$1,200
$200
$240
$200$240
Project 1
T2
(using TM)
Project 2
T2
(using TM)
T2
(using TM)
50,000
Postediting
10,000
16 (trans.)
plus 3 (pretrans. & postedit.)
$6,000
5-6
$960
$1,200
$316
$240
$200-$240
translation agency with in-house translators. An in-house translation division fulfills the
37
role of a company, translation agency and translator all rolled into one; its needs are
therefore similar to all three. An in-house translation division seeks to:
1. Increase income
2. Keep translation costs down
3. Produce a quality translation within a short amount of time
4. Produce the translation in the format required
5. Have the ability to plan translation budgets and estimate time-to-market
6. Eliminate repetitive translation tasks
7. Efficiently manage the translation project
In order to demonstrate how translation memory affects an in-house translation
environment, we must look at a different model. In this scenario, we will look at the inhouse translation division of an automobile manufacturer.
The in-house translation division has a technical department comprised of 40
translators, two per language. The technical department is involved with translating
all of the different user manuals for each make and model of automobile. Translation
memory is a standard part of the divisions translation process.
The company is coming out with a new model of its five standard cars in six
months. The 50-page/12,500-word manuals for these automobiles must be ready in
five months. The translations of the five manuals from the previous models are
already stored in translation memory.
The project manager receives the first draft of the five English manuals and
distributes them to the 20 translation teams. The translation teams compare the new
drafts to the older manuals and determine that 45% of the new drafts is exactly the
same as the previous manuals and 20% is very similar. The teams research new
38
terminology and add the new terms to the terminology databases, which takes a
couple of days. One of the team members then pre-translates two manuals and the
other pre-translates the remaining three using fuzzy and exact matching. The pretranslation work takes less than one half hour per manual.
After the pre-translation work is completed, the translators post-edit what was
entered and translate the remaining 35% of the text in each manual. The entire
process takes twelve days (two for research and database entry, seven to pretranslate and translate and three to post-edit).
Two months later, the translators receive updates of the English manuals. The
translators compare the updated manuals to the earlier drafts and determine that
80% of the text is exactly the same and 10% is very similar. The pre-translation work
is completed in one day, but this time it only takes the translators approximately two
days for translation and editing work and two days for post-editing the document
(remember, more of the text matches exactly).
Two and a half months later, the translators receive last-minute updates to the
manuals. The manuals are due in less than one month. This time, the translators
analyze the manuals and determine that 90% of the text is the same, 7% is similar
and only 3% is new. The pre-translation work takes one day per team. Translating
the remaining text and post-editing only takes about one and one-half days, and final
editing takes about half a day. The final version of all five manuals in twenty different
languages is ready at least twelve days before the due date even when the
translators were given the last update with only about 15 days to spare.
39
Number of
words in project
Number of days
for turn-around
First Draft
62,500
Second Draft
62,500
Final
62,500
12
What has the automobile company gained from translation memory? The answer
is pretty clear. The company will have its manuals in time for shipping. There was no
need to send any translation work outside, thus theoretically saving the company
money. The teams of translators worked in tandem over a network to complete the
translations, ensuring consistency throughout each document. The project manager
was able to manage the large projects with ease by starting earlier in the year. In short,
the company was able to accomplish all of the tasks required to streamline the
translation process and thereby make the company more efficient.
SURVEY/CASE STUDIES
This section discusses the results of a survey conducted for this thesis and case
7.1
SURVEY
The following questions were asked in the survey for this thesis entitled
40
1.
2.
3.
4.
5.
6.
7.
Do you also use machine translation? How does this influence the
translation process when combined with TM?
8.
41
9.
10.
11.
12.
There were 37 respondents in total. Only one of my six personal contacts did not
respond to the survey. Due to the fact that the survey was sent over electronic forums,
it is not possible to determine the total number of people who actually received and
read it.
The respondents varied by occupation: five were agencies, seven were in-house
translators, twenty were freelancers, two were freelancers who also worked in-house at
one time or another and three were freelance translators who also worked for agencies.
Eleven of the freelancers used TM, although two of them only used it when it was
supplied by an agency. Another freelancer who did not own TM was planning on buying
a TM product in the near future. Four of the agencies and six of the in-house translators
also used TM. Both freelancers who were also working as in-house translators used
TM. The three respondents who considered themselves freelancers and agencies did
not use TM. One of the last group of respondents had never heard of TM.
42
The majority (92%) of agencies and in-house translators performed at least half
of their work in teams. The freelancers performed an average of about 86% of their
work alone.
The majority of the respondents (70%) received at least half of their work in
electronic format. The translation agencies handled about 84% of translations in
electronic format. In-house translators performed practically all of their translation work
(99%) in electronic format. The freelance translators averaged about 56% of their
translation work in electronic format.
Virtually all of the respondents who had worked with TM reported that they were
impressed with the ability to maintain consistency throughout texts and/or were
impressed with the ability to increase the amount of text translated in a day. Productivity
varied from a 10% to 60% increase, depending on text type and amount of repetition in
updated or revised texts.
A few of the respondents noted that when a text was translated for the first time
using TM, it actually took longer to translate. Two respondents mentioned technical
problems were encountered at some point during their use of TM. One respondent
noted a change in work style to adapt to TM.
About 81% of the respondents translated technical or software manuals.
Approximately 35% of the respondents reported a moderate to high rate in frequency of
revisions and updates.
About 43% of respondents had a need to align prior work for integration into
present and future work. One respondent noted that there was no longer any need to
43
align prior work because the work had already been integrated into TM at an earlier
time.
Only two respondents mentioned that they had been requested to use TM, and
only two stated they were working on integrating machine translation into the translation
process alongside TM.
Most of the respondents did not have to purchase any additional hardware or
software as a result of their purchase of TM. At least two of the freelance translators
mentioned that they did not personally own a TM package, but that they used TM
provided to them when working on projects for a specific client. Only two respondents
mentioned having to update their operating systems, one mentioned upgrading system
memory and another mentioned having to purchase a second dongle for the TM
system, two mentioned purchasing scanners and OCR software, and one mentioned
upgrading to new computers and purchasing technical support for the TM system. It is
very likely that most of the respondents from translation agencies and companies with
in-house translation departments were unaware of their companies' extra purchases
resulting from the integration of TM.
In terms of rates of pay, most of the freelance respondents had not encountered
a situation in which they were paid less when using TM. One of them reported that he
actually charged more for TM because he was the one who mentioned to his client that
he could use it to improve consistency. Two freelance translators mentioned that when
they use TM (supplied by an agency), they are paid by the hour instead of by the word.
In a separate survey sent over the LANTRA-L Internet mailing list regarding this
subject, most of the respondents were paid by the hour when they used TM. At least
44
two of the respondents mentioned that they do not tell their clients or agencies that they
use TM for fear of a decrease in their current rates. Here is what one freelance
respondent had to say:
"The agencies pay less when they ask you to do the translation
using TM. I don't tell the agencies that I use TM when it isn't part of the
agreement (Subject 21).
When asked what they liked or disliked about using TM, responses varied:
"I don't like that you basically have to stick to segments. In a Word
document, a translator can rearrange a paragraph if it makes more sense
in the target language. With [TM], it takes a considerable amount of time
to merge the segments, and that defeats the purpose of the memory
(Subject 1)."
Subject 1 does have a point. In certain cases, a translator may want to combine
or break up sentences or text segments. The difficulty comes when combining more
than one sentence or segment. Most translation memory tools allow access to one
sentence or segment at a time. For example, when a translator wants to combine two
sentences into one, he or she will most likely translate both sentences when presented
with the first original sentence and will skip entering a translation when presented with
the second sentence. This technique might not work the way the translator would like,
since the current translation would appear correct, but the sentence pairs stored in the
translation memory would not be correct. Instead of combining the two sentences into
one translated sentence, the translator would be better off making the changes outside
of the translation memory.
45
46
7.2
CASE STUDIES
TRADOS is a German company that is well known for its translation tools. The
company has recently begun conducting case studies among users of its TM product
Translator's Workbench. One of the companies with in-house translation, Bernhard
Beumer Maschinenfabrik KG, reports that the use of the TRADOS product has
"improved quality through standardization of terminology used for translations,
"increased efficiency through reduced turn-around time, and the company has
witnessed a 30% translation cost reduction."8
8.
TM DATABASE OWNERSHIP
One important area of discussion in recent years is the question of who owns a
translation memory database. The question is not an easy one to answer. Everyone
has an opinion. The issue is relatively new and consequently there are no known legal
precedents covering TM database ownership. Alison Rowles, business manager for
47
LISA, says: Based on our previous experience with similar legal issues in this industry,
this area is one that is normally defined specifically in client/vendor contracts. Since
the localization industry is so new there is not a great body of law and precedents for
reference.9
Mark Berry from MCB Systems states that a TM database is a sentence-bysentence log of the work, ... translation memories are arguably a by-product of the
translation processthe by-product argument cuts both ways. The translation vendor
may maintain, Translation memory is simply a by-product of the work we do. Therefore
we own it. But the client may argue, Translation memory is just a by-product of the
work we commissioned, so it belongs to us.10 He believes that the best solution is to
make translation memory ownership a specific point in contract negotiations, so that
both sides come away knowing that their respective investments in this asset are both
protected and compensated.11
On the other hand, when it comes to compiling the terminology database for a
specific project, using TM may actually work to the translators advantage, as Suzanne
Falcone points out in her article on translation aid software in the Translators Journal:
for once, the client is technically forced to provide some terminology. The glossaries
of the projects may not be that big, but they still help, as well as the text possibly
contained in translation memories or repetition files.12
Donald Plumley, senior vice president of marketing for Bowne Global Solutions
states: From our point of view, the Client owns the [database] if they are paying for it to
48
be developed and maintained. However, there are really tricky issues if [there is a]
mixing of company resources (which could be domain specific) and client-specific
[databases].13
If a TM database has been created for a specific client, then it is best for the
translator to assume that the database is the clients property, unless an agreement has
been made otherwise. This also means that the translator should not use this clients
database on projects for other clients. The bottom line appears to be that the translator
and client need to agree on who owns the database by making it part of the business
contract.
2.
13
49
translation. Many translators find that when they use TM, they are less
likely to fine-tune the translation. They may feel that the TM system is
less flexible than a word processing program. It may be that once the
translator uses the TM system more often, the less likely this will continue
to be an issue. It is also possible that these translators may tend to work
in fields that award speed over accuracy.
3.
Post-editing of pre-translations performed in TM can be quite timeconsuming and may take longer than if the translator had used the
conventional translation process to translate and edit the document. This
issue is largely related to instances in which many text segments appear
to be exact or fuzzy matches, but in reality are expressing completely
different ideas (see Geral Dennetts example in section 3.1).
4.
Learning how to use the TM program thoroughly may take some time,
which the translator may not have, especially if the translator is borrowing
the TM program for a specific project. Suzanne Falcone comments:
One problem with these programs is that they usually come at the
same time as the first job, and the deadline for delivery obviously
doesnt take training into account. The translator rarely has time to
read the users manual, and sometimes cant even install the
program properly on the first (and even second) attempt This is one
point to take into account before accepting work with a program that
14
the client is offering to provide.
10
14
Falcone <http://accurapid.com/journal/03TM2.htm>.
50
as part of a localization or machine translation product. Following are lists of the various
types of products in which translation memory technology can be found. See Table 3 in
the Appendices for more detailed information on standard translation memory software.
10.1
their main function is to create translation memory databases. They do not provide
machine translations and are less frequently used for localization than the products in
section 10.2.
Dj Vu from Atril Software
Joust (TSS) from Alpnet International
Eurolang Optimizer from LANT Technology
Translation Manager (TM/2) from IBM
TRANSIT from STAR AG
Translator's Workbench from TRADOS Corporation
10.2
51
10.3
TM/MT HYBRIDS
These products consist of translation memory and machine translation systems.
The first is a true hybrid of the two technologies. The second currently works with
localization tools, but will soon have translation memory integrated into the software.
T1 translation memory from Langenscheidt
LOGOS (with XL8) from Logos Corporation
11
15
16
LISA <http://www.lisa.unige.ch/tmx/>.
Langewis, Discussions.
52
If there were a universal file format, end-users could easily upgrade to a different
TM system. Not only that, but then anyone could develop a translation memory system,
knowing that they would only have to conform to one specific format instead of being
forced to create filters to accommodate other TM products. These two issues are also
reasons why TM developers do not condone the development of export utilities.
Software developers prefer the idea of filtering TM databases or developing a converter
to allow users to integrate TM databases from other products into their product.
12
FUTURE TRENDS
The future looks bright for all kinds of CAT tools. In fact, the future of translation
is heading toward automated solutions. This doesnt necessarily mean that machine
translation will dominate the translation industry. While machine translation is
improving, it is only part of the bigger picture. The trend is to integrate all of the different
CAT tools and elements into one continuous process.
Translation memory will have a strong role in this process. Some companies
such as Caterpillar are already using CAT tools and MT together in the translation
process.17 These companies send translations through TM for exact matches, then on
to MT to fill in the blanks and then back to TM for integration into the TM database.
From that point, the translators take over to post-edit and perfect the translation.
Other tools that fit into this process of integration include terminology databases,
Web site management systems, word processing programs, graphics programs, etc.
There may be a time when most of the non-translation related work that a translator
does is no longer a major issue. Imagine a world in which one program automatically
53
separates graphics and other non-text elements from the actual text. The graphics are
sent off to a graphics program. Translation memory and machine translation programs
work seamlessly with one another on the source text to produce a decent translation.
The translator perfects the output and then reincorporates all of the non-text elements
back into the fully translated, localized and formatted document.
Going one step further, imagine a program on the Internet that acts like a virtual
translation management program. The translator or customer provides the source text
for review. The virtual manager analyzes the document, determines what resources
are best for translating the project in a cost-effective, efficient manner, provides the
translator or customer with information on costs, Internet security and estimated
translation time, and arranges for the document to be translated by various resources
available over the Internet at the click of a button. Alis Technologies has already
developed a very similar type of program called Alis Translator for Lotus Domino.18
This on-line translation program offers a translation solution for Web and Lotus Notes
applications. This solution can automatically publish Web pages in multiple languages
and translate e-mail and other documents quickly using integrated machine translation
engines. Other optional tools include dictionaries, accent tools, HTML filters and preediting tools. At some point in time, products like this will probably also include
translation memory.
Some other solutions are being developed as we speak. One of these solutions
is the TransRouter (or Translation Router) project. The TransRouter project aims to
build on existing translation technology and standard integration techniques in order to
17
18
54
develop a tool which will help decision makers in translation agencies, service providers
and other prospective user categories to make the most effective and appropriate use
of translation technology tools and the best mix of human and computer-aided
resources for a given set of documents. The tool will partly rely on data supplied by
users, but will do much of its work automatically, based on computer analysis of
characteristics of the text.19 This project is being coordinated by Berlitz Ireland Ltd. and
includes the involvement of the University College Dublin, the Gesellschaft fr
Multilinguale Systeme, the University of Edinburgh, the University of Regensburg, the
Center for Sprogteknologi and Suissetra (Institut Dalle Molle pour les tudes
smantiques et cognitives).
Does this mean that translators will be out of a job? According to Jeff Allen, a
research linguist and translation lab supervisor at the Center for Machine Translation at
Carnegie Mellon University, it is true that many large companies are looking into
automated translation solutions; however, this does not necessarily mean that the
companies are trying to replace their valuable and experienced translation personnel
with computers, but rather that the companies are trying to reconsider if they are using
the time of their translation specialists most effectively and how they can improve it.20
Translators, agencies and corporations will have to adapt to the new technologies and
solutions in order to remain competitive over the long term.
19
20
TransRouter <http://www2.echo.lu/langeng/en/le4/trouter/summary.html>.
Allen, TM and repetitive content (part A).
55
13
CONCLUSION
Translation memory is a technology that is here to stay. It has proven to be
useful, especially in technical fields where electronic documents are constantly being
updated and revised. Individual translators, translation agencies, clients, and
companies with in-house translation divisions can all benefit from this technology.
Translation memory saves the user time and money under the appropriate
circumstances. The key for success is knowing when, where and how to use it.
Translation memory should be considered one of the many tools of the
translation trade. A painter has a brush and palette; a pianist has sheet music. How
painters and pianists use their brushes, palettes and sheet music is what counts. The
same can be said of translators and translation memory.
Unlike the brush, palette and sheet music, however, not everyone recognizes
that translation memory is an invaluable tool for the translator and that the translator,
like the artist, is the one who must fit all the pieces together to form a work of
perfection. The more that is written about this technology, the more translation
professionals and their clients will be made aware of its important role in the translation
process.
56
REFERENCES
[FLEFO] Foreign Language Education Forum. CompuServe.
[Interlang]. Internet language mailing list.
<majordomo@netacc.net>.
[LANTRA-L] Internet Languages Translation mailing list.
<LANTRA-L@SEGATE.SUNET.SE>.
Alis. Alis Translator for Lotus Domino.
<http://www.alis.com/altd/index.html?AlisFramesTgtDoc> (10 Oct. 1998).
Allen, Jeff. <translators@juno.com> ISSUE: hour vs. word rates for repetitive work.
(2 Sept. 1998) E-mail in response to my e-mail sent to
<LANTRA-L@SEGATE.SUNET.SE>.
Allen, Jeff. <translators@juno.com> ISSUE: transit software. (17 Sept. 1998)
Forwarded e-mail from his response to <LANTRA-L@SEGATE.SUNET.SE> (30
May 1998) regarding Transit software.
Allen, Jeff. <translators@juno.com>, <LANTRA-L@SEGATE.SUNET.SE> TM and
repetitive content (parts A and B). (1 Sept. 1998) E-mail in response to my email sent to <LANTRA-L@SEGATE.SUNET.SE>.
Atril. Whats Dj Vu.
<http://www.atril.com/whatsdv.html> (26 Sept. 1998).
Benis, Michael. <michaelb@pavilion.co.uk> Re: translation memory productivity.
(13 July 1998) Personal e-mail.
Benito, Emilio. <ebenito@atril.com> Re Dj Vu specs.
(7 Oct. 1998) Personal e-mail.
Bennett-Hofstadt, Valerie. <vbh@star-ag.ch> Re: Transit questions.
(6 Oct. 1998) Personal e-mail.
Berry, Mark. Who Owns Translation Memory? MCB News No. 10, Fall, 1995.
San Diego (CA): MCB Systems (1995).
Boitet, Christian. Machine-aided Human Translation. Survey of the State of the Art in
Human Language Technology. (21 November 1995) Grenoble (France).
57
58
59
60