|
With each passing day communication between two people who don't speak the same language becomes more important. In this regard, the importance of the automatic translation systems has been increasing, too. In this thesis, we developed an example based machine translation and translation memory system that can do translation between Turkish and English.
Example-Based Machine Translation (EBMT) is a translation technique that leans on machine learning paradigm. Basically, EBMT is a corpus based approach that utilizes the translation by analogy concept. In this sense, according to our approach the translation templates between two languages are inferred from similarities and differences of the given translation examples by using machine learning techniques.
These inferred translation templates are used in the translation of other texts. The similarities and the differences between English sentences of two translation examples must correspond to the similarities and the differences between Turkish sentences of those translation examples. By using this information, the translation templates are inferred from the given translation examples.
Besides, when doing translation, helper programs which are named as translation memory systems are used. Translation memory is a storage environment which keeps previously translated sentences or phrases. When doing translation with translation memory system, the system retrieves the most similar translation examples to the sentence which translators want to translate. With the help of retrieved translation examples, translation of new sentence can be achieved more quickly.
In this thesis, in addition to developing a complete example based machine translation system, we aimed at developing a translation memory system and merging these two systems. We give importance of scalability of systems according to size of datasets which they used. |