国产日韩欧美一区二区三区三州_亚洲少妇熟女av_久久久久亚洲av国产精品_波多野结衣网站一区二区_亚洲欧美色片在线91_国产亚洲精品精品国产优播av_日本一区二区三区波多野结衣 _久久国产av不卡

?

AnAnalysisonComputer—AssistedTranslationthroughGoogleTranslator Toolkit

2018-05-04 09:18:06張嵩松
校園英語(yǔ)·上旬 2018年1期
關(guān)鍵詞:三星集團(tuán)飛利浦簡(jiǎn)介

【Abstract】With the burgeoning of artificial intelligence, computer-assisted translation (CAT) has become more popular than ever before. This paper describes problems and pitfalls encountered within the process of translating on the basis of Google Translator Toolkit platform. In addition, it provides some solutions for those problems and pitfalls as well as some suggestions to how to improve CAT system.

【Key words】Analysis; Computer-Assisted; Translation

【作者簡(jiǎn)介】張嵩松,貴州省農(nóng)業(yè)對(duì)外經(jīng)濟(jì)合作中心。

1. General Information

The paper makes use of Google Translator Toolkit platform to help translate some legal documents and regulations form English to Chinese. It is universally acknowledged that legal documents and regulations in different industries or fields are quietly similar in their grammatical structure, sentence structure so that machine translation are perfect to aid in translating in these documents (either form English to Chinese or Chinese to English) if an appropriate size of translation memory and glossary is available. Therefore, the paper collects some bilingual legal documents and regulations (Chinese and English) from some official website of famous companies such as Samsung, Apple and Philips. The collected text is produced into translation memory, which the number of English characters is 2, 848 and Chinese is 4.068. What is more, it also selects some frequently used words or terms to make a glossary.

2. Translation Memory

In order to achieve the goal of the paper which assists translating in legal documents and regulations, the translation memory of the CAT containing 100 translation units, mainly are collected from some legal documents and regulations. Moreover, the segment size is sentence in each translation unite. And the translation quality of each segment is very professional because all translation units in these documents are collected from official translation. On the other hand, the English text and Chinese text are fully matched. There provide some examples form our translation memory to illustrate.

(1)Eng:All information, documents, products and services, trademarks, logos, graphics, and images (“Materials”) provided on this site are copyrighted or trademarked and are the property of Samsung Group, Samsung Electronics and its listed subsidiaries.

CHN:本網(wǎng)中所提供的所有信息、文件、產(chǎn)品, 以及服務(wù)、商標(biāo)、logo、圖形,以及圖片(以上涉及內(nèi)容以下簡(jiǎn)稱為“資料”)都是具有版權(quán)或已經(jīng)注冊(cè)的商標(biāo),是三星集團(tuán)、三星電子及其子公司的財(cái)產(chǎn)。

Example 1, the segment size is a sentence from the official website of Samsung, and its quality of translation is very faithful. In addition, the English text and Chinese text is nearly perfect matched.

(2)ENG:In addition, you may not distribute an End User Product the purpose of which is to replay the courseware, presentations, interactive multimedia material, interactive entertainment products and the like of others.

CHN:另外, 您不得為播放課件、演示文稿、交互式多媒體資料、交互式娛樂(lè)產(chǎn)品等目的而分發(fā)“最終用戶產(chǎn)品”。

Example 2 shows that the Chinese sentence structure (target language) is different from English structure because there is a clause in the English sentence (source language). However, the quality of translation is very professional even some different in grammatical structure.

The data in the translation memory is from official website of some first-class enterprises. Therefore, the source language (English) and target language (Chinese) are accurate and the quality of translation can be guaranteed.

3. Glossary

It is well-known that there many terminology and set translation in legal and regulation translation. Hence, colleting some frequently used words and terms in English and Chinese are needed to assist machine translating in the legal documents.

The glossary contains 104 frequently used words and terms, which are selected from the translation memory. The quality of translation in the glossary is also professional and reliable because the source of the glossary is the same as that of the translation memory. The paper would like to give some examples as following:

copyright laws—版權(quán)法, trademark law—商標(biāo)法, laws of privacy and communications statutes—通信條例, patent laws—專利法. These terms are all name of law, so their translations are fixed. Therefore, collecting these words can improve the efficiency and accuracy of the translating in the legal documents and regulations.

4. Evaluation of the System

The purpose of this evaluation is to assess the translation result of the CAT system. It is clear that the performance of the CAT system is affected by the data collected in translation memory.

For a test of the performance of the CAT system on legal document, an appropriate evaluation legal textual content is needed. The chosen sample for this evaluation is extracted from the official website of Philips (http://www.philips.com.cn/), including 25 translation units and total 801 English words. Logging in Google Translator Toolkit then uploading the translation memory and glossary respectively which are multilingual texts of a very high quality in both Chinese and English language translated by the professionals. And then upload the sample text to the Google Translator Toolkit to get the machine translation result and compare the MT result with professional translation version.

The quality of MT output is not ideal. Only15% of the pre-translation is from human translation mainly from the glossary uploaded, and 85% (709 words) are from machine translation, in addition, there are only eight words in TM 100% matches, no in context matches, ‘High fuzzy TM matches, Repeated text. If texts with unambiguous vocabularies, easy sentence structures and grammar often lead to understandable translation rendered by the machine, allowing readers to understand the general idea of the source language. Nevertheless, texts with terminology, long and complex sentence structure and different punctuation can cause the text to be translated wrongly. The paper set “sentence” as the translation unit, so the translation result is not good enough. However, if the paper set “word” as the translation unit, the translation quality might be largely improved.

In the paper, none of pre-translation form professional translation can be found because the translation memory is not big enough. And the paper selects the following good translations:

(1)“Philips is a registered trademark of Philips electronics.” is the source text and the machine translation result is “飛利浦是飛利浦電子公司的注冊(cè)商標(biāo)?!盩he machine translation result is pretty advanced not only in words but also in the sentence structure which can be easily accepted by the readers.

(2)“Please contact your local Philips business contact for further information.” is translated to”請(qǐng)聯(lián)系您當(dāng)?shù)氐娘w利浦進(jìn)一步信息的業(yè)務(wù)聯(lián)系?!?Although the translation result seems unnatural, the meaning of the source text cannot be confused.

Actually, these suggested good translations in the Google Translator Toolkit are not good enough. Reasons will be greatly confirmed by the following discussion:

Any perfect match or even fussy match in the global shared translation memory and uploaded translation memory cannot be found because the segment size is sentence and the genre of the sample is legal documents and regulations in which repletion rate is nearly zero. In addition, the size of the translation memory is not big enough to find some previous translations. In addition, Legal documents and regulations involve many grammatically complex and extraordinarily long sentence, a slew of terminologies so that the Google Translator Toolkit are considered unsuitable for legal translation if they are not equipped with adequate legal translation memory and glossary. Some short sentence with few terms or unambiguous words can be properly translated by the toolkit and the quality of translation is acceptable and readable. However, if the sentence is too long and complex, the translator toolkit cannot be translated properly even its quality of translation is not readable. For example:

(1)The source language “In such case, such exclusions or limitations shall be limited to the greatest extent permitted by applicable law.” The machine translation is “這種情況下,這種排除或限制,應(yīng)僅限于適用法律所允許的最大程度?!?and the suggested translation is “在此情況下,此類(lèi)例外或限制僅限于適用法律所要求的范圍?!?In this case, the machine translation is readable but uncompressible, which makes any sense in Chinese and meaning is far from the source language.

(2)The source language is “Philips is in no way responsible for the content of any site owned by a third party that may be linked to the web site via hyperlink, whether or not such hyperlink is provided by the web site or by a third party in accordance with the terms of use.” The machine translation is “飛利浦是絕不可能通過(guò)鏈接的網(wǎng)站鏈接到由第三方擁有的任何網(wǎng)站的內(nèi)容負(fù)責(zé),不論這種超鏈接網(wǎng)站或由第三方提供,按照條款使用?!?and the suggested translation is “飛利浦對(duì)通過(guò)超鏈接連接到本網(wǎng)站的任何第三方所屬站點(diǎn)的內(nèi)容概不負(fù)責(zé),無(wú)論此類(lèi)超鏈接是由本網(wǎng)站還是由第三方根據(jù)使用條款提供。” In the example, the structure and grammar of the sentence is more complex than that of example one and machine translation is a simply word-to-word translation which can not acceptable and readable at all in Chinese.

Therefore, the quality of machine translation is constrained by that of the source text input. A text with proper grammar and unambiguous wording often leads to unreliable translation by the machine, especially in a text with slang, misspelled or ambiguous words and complex or lengthy sentences can easily cause the text to be translated incorrectly.

5. Conclusion

Legal documents and regulation texts are conceived to be suitable for machine translation in regard to its use of standard and formal grammar and its non-ambiguous language style and set words and terminology. However, legal translation requires the highest translation quality. The translation quality in the paper is unsatisfactory due to the following three reasons:1. inadequate translation memory and glossary, 2. inappropriate segment size in the translation memory, 3.the limitations in the Google Translator Toolkit.

Therefore, improving CAT system should firstly reset the segment size from sentence to word so that can help machine translation become more accurate and precise, and enhance translation memory by adding more terminologies and words which can offer more references for the machine. The Google Translator Toolkit cannot translate long and complex sentences properly, however, the toolkit does help improve translation efficiency and its sharing system is quietly useful and convenient for translators.

猜你喜歡
三星集團(tuán)飛利浦簡(jiǎn)介
飛利浦Hue Play漸變光帶發(fā)布
日用電器(2020年9期)2020-12-09 23:06:58
Research on Guidance Mechanism of Public Opinion in Colleges and Universities in Micro Era
Book review on “Educating Elites”
Hometown
飛利浦在二戰(zhàn)中如何存活
飛利浦“不達(dá)標(biāo)”,羽博有效放電量低
飛利浦開(kāi)啟全新“汽車(chē)生活”
克拉玛依市| 武清区| 观塘区| 潞西市| 咸宁市| 怀安县| 若尔盖县| 巴彦淖尔市| 高雄县| 开化县| 泾阳县| 五峰| 秦皇岛市| 凤凰县| 仁布县| 临朐县| 凤冈县| 习水县| 西昌市| 台山市| 武穴市| 无极县| 岳普湖县| 呼图壁县| 商水县| 阳城县| 盈江县| 龙南县| 斗六市| 杭州市| 隆昌县| 大连市| 长泰县| 郸城县| 永平县| 鸡泽县| 牡丹江市| 全椒县| 迁安市| 通州区| 循化|