NotesFAQContact Us
Search Tips
ERIC Number: ED377711
Record Type: Non-Journal
Publication Date: 1994
Pages: 6
Abstractor: N/A
Towards Automatically Aligning German Compounds with English Word Groups in an Example-Based Translation System.
Jones, Daniel; Alexa, Melina
As part of the development of a completely sub-symbolic machine translation system, a method for automatically identifying German compounds was developed. Given a parallel bilingual corpus, German compounds are identified along with their English word groupings by statistical processing alone. The underlying principles and the design process are described here. Design began with small-scale word-alignment experiments, using 2,543 English words and 1,898 German words that yielded unique lexical items in each language. A technique for decreasing reliance on one-to-one word correspondences was then applied, resulting in a distinct ability to capture relationships between compounds and non-compounded expressions. Statistical analysis of these relationships provides data on which to base machine translation operations. It is concluded that the method used is effective on identifying cross-language lexical fertility to establish translation units for re-combination within the example-based translation process. (MSE)
Publication Type: Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A