Committee login






Small thumbnail

Mathematics for Modeling and Scientific Computing

Small thumbnail

From Prognostics and Health Systems Management to Predictive Maintenance 1

Monitoring and Prognostics

Small thumbnail

Reliability in Biomechanics

Reliability of Multiphysical Systems Set - Volume 3

Small thumbnail

Production and Maintenance Optimization Problems

Logistic Constraints and Leasing Warranty Services

Small thumbnail

Digital Electronics 3

Finite-state Machines

Small thumbnail

Transformation of Collective Intelligences

Perspective of Transhumanism

Small thumbnail

Simulation of Transport in Nanodevices

Small thumbnail

Heat Transfer in the Chemical, Food and Pharmaceutical Industries

Industrial Equipment for Chemical Engineering Set

Small thumbnail

Simulation of Stochastic Processes with Given Accuracy and Reliability

Small thumbnail

Energy Autonomy of Real-Time Systems

Energy Management in Embedded Systems Set

Small thumbnail

Comparable Corpora and Computer-assisted Translation

Estelle Maryline Delpech, University of Nantes, France

ISBN: 9781848216891

Publication Date: June 2014   Hardback   304 pp.

145.00 USD

Add to cart


Ebook Ebook


Computer-assisted translation (CAT) has always used translation memories, which require the translator to have a corpus of previous translations that the CAT software can use to generate bilingual lexicons. This can be problematic when the translator does not have such a corpus, for instance, when the text belongs to an emerging field. To solve this issue, CAT research has looked into the leveraging of comparable corpora, i.e. a set of texts, in two or more languages, which deal with the same topic but are not translations of one another.
This work had two primary objectives. The first is to assess the input of lexicons extracted from comparable corpora in the context of a specialized human translation task. The second objective is to identify bilingual-lexicon-extraction methods which best match the translators’ needs, determining the current limits of these techniques and suggesting improvements. The author focuses, in particular, on the identification of fertile translations, the management of multiple morphological structures, and the ranking of candidate translations.
The experiments are carried out on two language pairs (English–French and English–German) and on specialized texts dealing with breast cancer. This research puts significant emphasis on applicability – methodological choices are guided by the needs of the final users. This book is organized in two parts: the first part presents the applicative and scientific context of the research, and the second part is given over to efforts to improve compositional translation.
The research work presented in this book received the PhD Thesis award 2014 from the French association for natural language processing (ATALA).


Part 1. Applicative and scientific context
1. Leveraging comparable corpora for computer-assisted translation.
2. User-centered evaluation of lexicons extracted from comparable corpora.
3. Automatic Generation of Term Translations.
Part 2. Contributions to compositional translation
4. Morph-Compositional Translation: Methodological Framework.
5. Experimental data.
6. Formalization and Evaluation of Candidate Translation Generation.

About the Authors

Estelle Maryline Delpech holds a PhD in Computer Science from the University of Nantes in France, where she specialized in natural language processing and computer-aided translation. She is currently Chief Scientist at Nomao, a web and mobile app search engine company. Her research interests include multilingualism, computational linguistics, information extraction and data integration.


DownloadTable of Contents - PDF File - 56 Kb

0.03398 s.