Academia.eduAcademia.edu

Translation Memory

description245 papers
group1,109 followers
lightbulbAbout this topic
Translation Memory (TM) is a technology used in computer-assisted translation that stores previously translated source texts and their corresponding target texts. It enables translators to reuse existing translations, improving consistency and efficiency in the translation process by suggesting matches for new content based on stored data.
lightbulbAbout this topic
Translation Memory (TM) is a technology used in computer-assisted translation that stores previously translated source texts and their corresponding target texts. It enables translators to reuse existing translations, improving consistency and efficiency in the translation process by suggesting matches for new content based on stored data.

Key research themes

1. How can paraphrasing enhance the effectiveness and accuracy of Translation Memory matching and retrieval?

This research theme investigates the integration of semantic knowledge, specifically paraphrasing, into Translation Memory (TM) systems to improve fuzzy matching and retrieval of translation segments. Traditional TM systems rely heavily on string-based, syntactic similarity measures such as edit-distance, which often miss semantically equivalent but lexically different segments. Exploring paraphrasing allows the system to capture semantic similarity, reducing post-editing effort, improving translation quality, and enhancing translator productivity.

Key finding: This paper presents a novel dynamic programming and greedy approximation algorithm to integrate paraphrasing into the edit-distance metric for TM matching. The approach effectively computes fuzzy match scores that reflect... Read more
Key finding: Building on earlier algorithmic advances, this study provides an extensive human-centered evaluation of paraphrase-enhanced TM matching and retrieval. Measurements include post-editing duration, keystroke save, HTER and... Read more
Key finding: This work refines prior paraphrasing approaches by eliminating greedy approximations, employing an exclusively dynamic programming-based method that globally optimizes edit distance with paraphrasing. Tested on... Read more
Key finding: This paper examines the contribution of combining linguistically aware matching techniques (e.g., part-of-speech tags, semantic roles, paraphrase resources) with traditional string-based fuzzy matching in TM. Their hybrid... Read more

2. What effects do translation memory match types and their presentation order have on translators’ cognitive and technical effort?

This theme explores how various TM match types (exact, fuzzy, no match, post-editing suggestions) and their sequential order during the translation process influence translator behavior, cognitive load, and editing effort. Recognizing the temporal and dynamic nature of translation sessions, the studies analyze whether order effects or 'first impression' biases impact translator engagement with TM suggestions and cognitive segmentation strategies, which is crucial for designing TM systems and workflows that better support user cognition and efficiency.

Key finding: Analyzing time-series data from prior TM studies, this paper identifies significant order effects during sequential translation tasks, particularly in technical and cognitive effort correlated with the sequence of TM match... Read more
Key finding: This empirical study operationalizes temporal, physical, cognitive, and subjective effort measures across different fuzzy match rates in TM segments. Results indicate at the participant level, translator effort did not... Read more
Key finding: Utilizing pause-word ratios and Activity Units derived from keystroke and gaze data, this study quantifies cognitive effort during translation and post-editing tasks involving TM suggestions. The work reveals that translation... Read more

3. How does the integration of Translation Memories influence translator training and the development of translation competence?

This cluster of research focuses on the educational role of TM systems and associated technologies (including parallel corpora and CAT tools) in translator training programs. It investigates the pedagogical utility of TM for fostering translation competence, terminology consistency, and real-world workflow familiarity among students. The studies emphasize the importance of incorporating TM technology literacy in curricula to better prepare students for evolving market demands and to enhance their practical translation skills.

Key finding: This study highlights the transformational role of aligned bilingual parallel corpora and Translation Memories in translator education. It argues for their integration as pedagogical tools to improve students' terminology... Read more
Key finding: Examining pedagogical frameworks, this paper advocates for the systematic inclusion of TM technology and computer-aided translation (CAT) tools in translation curricula. The author discusses different learning theories... Read more
Key finding: Through pre- and post-training attitude surveys, this study demonstrates that translation students' perceptions of CAT tools, including TMs, become more positive following hands-on exposure. It identifies prior technology... Read more
Key finding: This collective work presents interdisciplinary perspectives on the impact of AI and translation technologies, including TMs, on translation education and professional practice. It underscores that adaptation to technological... Read more

All papers in Translation Memory

This dissertation aims to shed light on the impact of artificial intelligence and machine learning on translators. The current hype surrounding the recent progress in the field of artificial intelligence in this era of technology... more
This dissertation aims to shed light on the impact of artificial intelligence and machine learning on translators. The current hype surrounding the recent progress in the field of artificial intelligence in this era of technology... more
que estima esfuerzo y tiempo necesarios de posedición, y determina si es factible procesar un texto con TA.
This thesis presents a study of the impact a translation memory system may have over the translation process in terms of cognitive segmentation, lexicogrammatical variation (agnation) and explicitation, when professional translators... more
Artificial Intelligence-generated translation tools are proliferating at an unprecedented pace. Translation job markets are changing and changing the required skills of graduates. The debate about replacing human translators by machines... more
This project describes the design and implementation of a modular, filebased encryption and decryption system developed using modern C++ (C++17). The system is designed to securely encrypt and decrypt files stored within a directory by... more
Machine translation outputs are not correct enough to be used as it is, except for the very simplest translations. They only give the general meaning of a sentence not the exact translation. As Machine Translation (MT) is gaining a... more
The growing use of computer technologies as well as the amount of available information has posed new challenges to humanmachine interaction. Applying natural language processing techniques to information systems has brought advantages to... more
In the literature and discussion about information retrieval (IR) the words "string", "word", "term" and "concept" are used inconsistently. There is no common agreement on the concepts named by these words. Thus the communication in IR is... more
Unités de traduction et identité du traducteur : l'impact des outils d'aide à la traduction Résumé Les mémoires de traduction (MT) sont bien implantées dans le monde de la traduction spécialisées et font partie intégrante de la formation... more
The poor quality of user-generated content (UGC) found in forums hinders both readability and machine-translatability. To improve these two aspects, we have developed human-and machine-oriented pre-editing rules, which correct or... more
This book series tackles prominent issues in Machine Translation (MT) at a depth which will allow these books to reflect the current state-of-the-art, while simultaneously standing the test of time. Each book topic will be introduced so... more
Writers and translators have difficulties treating proper nouns correctly. These designations represent concepts that are very likely not common knowledge. While humans can research, machines can only apply data provided. It is therefore... more
The present study investigates routing of Iranian translator training procedures based on 'Dick and Carey model'. Since the development of psychology texts in theoretical and practical grounds is considered as an urgent need, translators... more
This paper evaluates the impact of ma-chine translation on the software localiza-tion process and the daily work of profes-sional translators when SMT is applied to low-resourced languages with rich mor-phology. Translation from English... more
This paper reports the latest development of The Halliday Centre Tagger (the Tagger), an online platform provided with semi-automatic features to facilitate text annotation and analysis. The Tagger is featured for its web-based... more
Semantic knowledge has been adopted recently for SMT preprocessing, decoding and evaluation, in order to be able to compare sentences based on their meaning rather than on mere lexical and syntactic similarity. Little attention has been... more
The current dynamic, rapidly changing labour market is influenced by the constant development of new technologies, globalisation, the changing nature of the economy, and changed demands on employees. In this context, university graduates... more
This paper reports the results of two studies carried out with two different group of professional translators to find out how professionals perceive and accept NMT in comparison with TM. The first group translated and post-edited... more
This paper highlights the results and trends on post-editing and machine translation from the recent AMTA and SDL Automated Translation Survey. Then Continental Airlines and SDL share their experiences, and the benefits and challenges of... more
This paper describes a database of translation memory, TotalRecall, developed to encourage authentic and idiomatic use in second language writing. TotalRecall is a bilingual concordancer that support search query in English or Chinese for... more
Download research papers for free!