Building and Using Comparable Corpora

Reinhard Rapp editor Serge Sharoff editor Pierre Zweigenbaum editor Pascale Fung editor

Format:Paperback

Publisher:Springer-Verlag Berlin and Heidelberg GmbH & Co. KG

Published:23rd Aug '16

Should be back in stock very soon

This paperback is available in another edition too:

Building and Using Comparable Corpora cover

The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel texts in the same domain or genre. Nevertheless, this research direction has not produced a single authoritative source suitable for researchers and students coming to the field.

The proposed volume provides a reference source, identifying the state of the art in the field as well as future trends. The book is intended for specialists and students in natural language processing, machine translation and computer-assisted translation.

“I would like to recommend ‘Building and Using Comparable … to those who are working with or are interested in multilingual and monolingual comparable corpora. … it is easy to say that the notion of comparable corpora was not only visionary, long-sighted, and productive. It is also easy to say that this volume remains the optimal starting point for any research or for any applications in Language Technology leveraging on comparable corpora.” (Marina Santini, forum.santini.se, February, 2017)

ISBN: 9783662520062

Dimensions: unknown

Weight: unknown

335 pages

Softcover reprint of the original 1st ed. 2013