Translation Technology
Translation Technology
Most vendors provide some form of support through forums and chat
rooms, and details of many of the available training courses and
events are given under the Education tab of the website of proz.com -
an online marketplace for translators.
Users of translation tools will testify that taking some basic training is
the most effective way of getting up and running with a new CAT
package.
A typical TMS consists of two parts: a Translation Memory (TM) and a
search engine. The TMS is a type of CAT tool that is able to store the
previously translated text and the corresponding source text in a
database, and allows the translators to reuse them or part of them in
a new translation task. The TM is in its nature a database that store
translated texts together with the corresponding original texts. The
user can make use of the previously translated text and the source
text to set up one or more TM database(s). Then, in a new translation
task, the system can use the search engine to automatically search in
the existing TM database for the same or similar translation material
(such as a sentence, a paragraph, etc.) and provide a translation
suggestion to the translator. As a result, the translator does not have
to do any repetitive work and can focus his/her energy and time on
the translation of the new materials. At the same time, the TM
database is going on storing newly translated materials and
expanding the volume of the database. As for the translation
suggestion, the translator can choose to follow it, edit it or discard it.
Some basic concepts in the technology of the TMS are briefly
explained as follows.
1. Segmentation
When texts are added into a TM database, they have to be first
divided into some small chunks that are called segments. A segment is
the smallest unit that can be processed by the TMS, and is also the
base of the alignment. Generally speaking, a segment is a sentence or
a sentence-like structure and is usually ended with a punctuation
mark (such as a period or a question mark). However, according to
Bowker (2002a), it is also possible for the user to specify what should
or should not represent the end of a segment.
Alignment
The alignment is the second step in storing and organizing the texts.
Aligning texts means matching the source text segment with the
corresponding target text segment, and storing them in the database
as an integrated unit for future processing. The correct alignment is
the key to the successful reuse of the stored information.
Retrieval
Once the aligned texts have been stored in the TM database, they can
be reused by the translator in a new translation task via the automatic
search and retrieval of the system. When a new text is opened in the
translation editor of a TMS, the system first divides it into segments
as was introduced before. Then the system takes the first segment
and compares it with those segments that have been stored in the TM
database. The comparison is carried out on the basis of some preset
rules. The establishment of these rules is also an important topic of
researches and studies on the TMS. Once a match is found out in the
comparison, the system retrieves the relevant information (a unit of
the aligned texts) and presents it as a translation suggestion to the
translator. There are a number of types of matches that a TMS can
retrieve, so as to find not only identical translation units but also
similar ones.
Perfect or Exact Match
A perfect or exact match means that the new segment in the source
text is exactly the same as a segment that is already stored in the TM
database (100 percent match). According to Austermuhl (2001), there
is another type of match that is similar to the exact match called a full
match, which means the new segment is identical to a stored
segment in all the other aspects except for some variable elements
such as dates, numbers, time or measurements. In most TM tools
such as the Trados, these elements can be automatically treated by
the system by replacing the old dates, numbers, time or
measurements with new ones.
Fuzzy Match
Fuzzy match is an important and powerful function of the TMS,
meaning that the new segment is not identical to but similar to a
stored segment. It is more often encountered by translators in the
real practice than the exact match. Translators can edit the fuzzy
match so as to produce a valid translation for the new source
segment. In most TM tools, different parts between the new segment
and the stored segment are highlighted with a different color to
indicate which elements in current translation need to be changed or
adjusted to meet the source text. Another important thing in using
fuzzy match is the degree of fuzziness, or on the other hand, the
degree of similarity, which respectively means to what degree a
stored segment found by the searching engine is different from or
similar to the new source segment that needs to be translated. In
most TM tools, there is a user- defined minimum match value which
can be adjusted by the translator before translation according to the
desired degree of fuzziness or similarity. Generally speaking, the
higher the match value is, the stricter the retrieval will be, and vice
versa. However, according to Bowker (2002b), the translator must be
very cautious in selecting the degree of fuzziness so as to avoid the
problem of silence and noise. That is to say, if the match value is set
too high (e.g., 95%), the system may not retrieve matches that could
be potentially helpful to a translator. In information retrieval, this
type of problem is known as "silence." In contrast, if the match value
is set too low (e.g., 10%), then too many segments, or segments that
are not useful, could appear leading to a problem known as "noise,"
which may actually hinder the efficiency of the translation process.
There has been a significant increase in the volume of text that needs
to be translated into a wide variety of languages largely as a result of
globalization. In addition, new types of texts, such as web pages, have
appeared and require translation. These demands of our fast-paced,
globalized knowledge society have left translators struggling to keep
pace with the increasing number of requests for high-quality
translation into many languages on short deadlines. In today's
translation market therefore, the use of technology by translators is
no longer a luxury but a necessity if they are to meet rising market
demands for the quick delivery of high-quality texts in many
languages.
Translation Technology
The invention of the computer led very quickly to attempts to use it
for translation of natural languages. In the period immediately
following World War II.
Again, with the advent of new age software tools such as Google
Translate and Translation Memory tools like Trados, translation is
becoming much easier to manage. This situation has led to the
creation of on-line machine translation services such as AltaVista,
which offer rapid e-mail services, web pages, etc in the desired
language, as well as to the availability of multilingual dictionaries,
encyclopedias and free, direct- access terminology databases. These
tools do not pose a threat to the translators and interpreters to do
their job better and aren not in any way replacement for them but are
to help them to function better. We believe in the power of
technology for progress and acknowledge that when misused it can
bring about evil, but then, that is not the problem of technology but
that of users and abusers of it.
machine translation (MT) and computer- assisted translation (CAT).
MT which in the 1980s held great promises has been steadily losing
ground to computer-assisted translation because CAT responds more
realistically to actual needs since it has a human touch.
Computer-Aided Translation
CAT systems are systems which actually perform the task of
translation but rely on the intervention of the human translator at
various stages in the translation process. CAT uses a number of tools
to help the translator work accurately and quickly, the most
important of which are terminology databases and translation
memories. recent decades have witnessed considerable changes in
our society in general and in the translation market in particular.
Most texts are now produced in a digital format, which means they
can be processed by computer tools.
Types of Modern Translation Technologies:
Blatt et al, (1985), distinguish three types of computerized
approaches to the translation process: “machine aids for translators”,
“machine-aided translation” and “machine translation”.
Translating with the help of the computer is definitely not the same
as working exclusively on paper and with paper products such as
conventional dictionaries, because computer tools provide us with a
relationship to the text which is more flexible than a purely lineal
reading.
We also share the conviction that translation has not become a new
profession with the new technologies, but the changes have come to
stay and will continue to evolve. Translators need to acquire totally
new skills, to join the digital revolution (not fight it)...to understand
mobile technologies (not discard them)...to use the tools of the trade
in the 21. century (and learn what they are and what they can do for
them). There is no stopping technology.
How Translation Technologies Work:
In reality, the translation process comprises two stages:
First, the production of a rough text or preliminary version in the
target language, in which most of the translation problems are solved
but which is far from being perfect; and second, the revision state,
varying from merely re-reading the text while making minor
adjustments to the implementation of radical changes. It could be
said that MT aims at performing the first stage of this process in an
automatic way so that the human translator can then proceed directly
to the second, carrying out the meticulous and outstanding task of
revision.
Machine Translation Strategies
Machine translation is an autonomous operating system with
strategies and approaches that can be classified as follows:
the direct strategy
the transfer strategy
the pivot strategy
Computer-assisted Translation
Computer-assisted translation is in practice a complex process
involving specific tools and technology adaptable to the needs of the
translator, who is involved in the whole process and not just in the
editing stage.
CONCLUSION
It did not take long following the development of the first computers
for researchers to turn their attention to applying these computers to
natural language processing tasks. The translation market has changed
dramatically in the last decade because there has been an increase in
demand of translation services. The sharing of information is at every
level of the human experience today, Skype, face book, create video,
etc. "According to UNICEF more people die from lack of knowledge
than from diseases. People in poor countries are simply unable to
access global knowledge in a language they understand. modern
translation technology may be bringing this knowledge closer to the
bottom three billion....."
The costs of adapting software products for other locales seemed like a
small price to pay given the sizable international markets and potential
revenues to which localized products could enable access. Software
publishers approached localization in different ways: some performed
the work using in-house teams and some outsourced the work to
specialized service providers, whereas others assigned responsibility for
localization to in-country subsidiaries or distributors (Esselink 2003b: 4).
Despite the ostensible differences between these approaches, they all
shared one fundamental characteristic: in each case localization was
performed apart from, and subsequent to, the development of the
original, domestic- market products. 'This separation of development
and localization proved troublesome in many respects,' observes
Esselink (2003b: 4).
"SDL Trados Studio 2014" works with the source and target languages
displayed alongside each other. For each new segment (usually a
sentence) in the source text, proposed translations are displayed in a
separate pane, allowing the translator to select the most appropriate
translation which is then inserted in the editor window alongside the
source sentence.
What is the “pre-translate” feature?
The "pre-translate" feature can help the translator by translating the
document by first retrieving 100% matches or fuzzy matches from the
selected translation memories. but it works once the translator has
built up or obtained a large enough translation memory, thus he or she
will achieve significant time savings by automating part of the
translation process with the software's "pre-translate" feature.
Define machine translation (=automatic translation or computer-
generated translation)
Machine translation is the branch of language technology in which a
computer program translates without any human intervention.
What are the factors that the usefulness of machine translation
depends on?
The usefulness of machine translation depends on various factors
including language pair (= the equivalence), the complex ambiguity of
the source text, and the system's familiarity with the terminology and
subject matter of the original.
What is the first big boost that led to the development of machine
translation?
Research and development in machine translation had its first big boost
in the Cold War period.
What are the reasons behind research and development in machine
translation?
1. The expansion of the European institutions
2. International conflicts and the need for better intelligence
3. The need to translate many pieces of content generated every
minute on the Internet
What is the statistical model and what is syntactic?
statistical models are what the translations of the Statistical Machine
are generated on.
when has Statistical Machine Translation become the predominantly
used approach?
Since the mid-1990's Statistical Machine Translation has become the
predominantly used approach.
Are Google Translate and Microsoft Translator statistical or syntactic
systems?
Both Google Translate and Microsoft Translator are fundamentally
statistical systems.
what is the successful use of statistical systems depending on?
The successful use of these systems is broadly dependent on the
quantity and quality of the bilingual data used to build the translation
model.
What is GUI stands for?
What is CAT Tools stands for?
How can translators working on the same project make a consistent
use of terminology?
What is TAUS? What does it do?