Within the translation industry, there are a lot of terms and concepts to keep track of. We understand that it may be difficult to remember all of them. For this reason, we have created a technical knowledge bank, where you can find definitions of some concepts related to translation and our other language services. Examples include concepts within translation, machine translation or the translation technology we use. Find out what a fuzzy match is, what localisation means and what a Translation Memory does.
100 % Matches are sentences that are identical with sentences that have been translated previously and saved in the TM. The translator still has to check that the existing translation matches the context.
You can use alignment to reuse old translations. By using the alignment tool, previously translated text and its original source can be aligned and saved in the TM. In other words, a segment from the original text is aligned with the corresponding segment of text it was translated into. This is repeated until every source and target language segment is aligned.
API is the acronym for Application Programming Interface. API integration allows one software program to “talk” with another software program. Using API integration, the interface between your content and document management system (CMS, PIM and DMS) and our translation tools can be integrated, enabling the automatic exchange of data. Learn more about our API solution.
CAT is the abbreviation of computer-aided translation. When translators use a CAT tool, it means they use a computer program to help them work much more efficiently while they translate, by among other things, reusing previously translated text where possible (TM). However, CAT should not be mistaken for machine translation. Examples of CAT tools: SDL Trados Studio, STAR Transit and Across.
Context Matches are sentences that are not only identical with previous translated sentences in the TM, they also occur in the same context as when they were first translated.
DTP is the abbreviation of Desktop Publishing. When translating with CAT tools, often the layout of the translated document needs to be worked on, where the text needs to be adjusted because of the specific language (e.g. the changing of font for Eastern European languages) and format adaptations (hyphenation, line breaks).
A file format states what kind of file you are using. Not all software programmes can deal with all file formats. Our translation tools allow us to work with every kind of editable file format, including:
Word files (.docx)
InDesign files (.indd)
Photoshop files (.psd)
Framemaker files (.fm)
Excel files (.xls)
Text files (.txt)
XML files (.xml)
HTML files (.html)
However, we prefer not work with PDF files, since generally PDF files are in a non-editable format and a “copy” of the original document. This means for instance, that the text can be “locked” in the format. However, if a file is only available in a PDF format, we can convert it to another format.
World Translation has its own network of freelance translators. A freelance translator is allocated work on a task-by-task basis. Our extensive network of freelance translators means that we can provide a great deal of flexibility and a large number of different language combinations. All of our translators are highly trained and qualified native language translators, who are evaluated on an ongoing basis.
Fuzzy Matches are sentences, which partially match sentences in the TM. This means that the translator can reuse part of a sentence from the database, i.e. a portion of a previously translated sentence. The degree of matching is stated as a percentage.
Hard/soft line change
A hard line change is used for a new paragraph. For example, you press the Enter key to do this in WORD.
A soft line change is used for a new line in the same paragraph. For example, you press Shift + Enter to do this in WORD.
HTML is the acronym for Hyper Text Markup Language. HTML is designed to show data and is used to create web documents and websites. HTML uses pre-defined tags, which determine what your web document or website will look like in a browser.
Example of HTML:
IDML files (see also InDesign)
IDML stands for InDesign Markup file. It is a file that has been created in InDesign and contains a hierarchy of different XML files, which together represent the InDesign file.
InDesign is a software program from Adobe for creating page layouts and designs. The aim of the software is to arrange the layout of text and illustrations. InDesign is primarily intended for tasks where creativity and visual design have top priority, e.g. when creating brochures and other marketing material.
International SEO covers different aspects: the translation of the website, extraction of keywords, establishment of keywords in the target language, translation of AdWords, etc.
International SEO involves language experts using different techniques to establish the most correct and most frequently used keywords in the target language. Only experienced translators who translate into their native language and who have SEO experience and training are involved. Keywords cannot simply be translated from one language to another, because search behaviour and search engine use changes from country to country. Learn more about our international SEO.
Localisation is the linguistic, cultural and stylistic adaptation of a text for a target group. This ensures that all of the cultural requirements are met. In addition to the linguistic adaptations, this can also include the adaptation of units of measurement, date formatting and possibly layout.
Machine translation uses professional translation software to automatically translate text. Machine translation is usually used when a lot of content needs to be translated in a short period of time, where manual translation is not a realistic option. After the text has been automatically translated, usually a translator will edit the text (post editing).
PDF is the acronym for Portable Document Format. The PDF format is used to present documents independent from what system they were created in, e.g. Word or InDesign. Since PDF files have limited editing options, the content must be extracted, so that the text can be translated using CAT tools. In addition, if the intention is for the translated files to have the same layout as the original, it can require a lot of time to set up the text and images into a file format that can be edited.
Perfect Matches are sentences that are identical with previous sentences in the selected reference file in Trados format.
Typically, when machine translation has automatically translated a text, a translator will then edit the text, this step is known as post editing. How much post editing is required shall be agreed with the customer and is dependent on the desired quality of the translated text. There are two levels of post editing – light or full.
When a text is proofread, it is checked for errors, which are corrected before it is published. At World Translation, we can proofread texts in all languages, which means we also help you to localise the texts. Learn more about our proofreading.
When a document is translated using a CAT tool, it is split into smaller units, which are called segments. The segments are then compared in a TM for the purpose of reusing previous translations. A segment is usually a whole sentence, it can however also be a whole paragraph. The CAT tool automatically segments the text, based on punctuation and formatting in the document. The segmentation settings in the CAT tool can be adapted as required.
SEO is the acronym for Search Engine Optimisation. SEO is the term used to describe the work to make a website more visible in a search engine’s organic search results. SEO is the optimisation of the page’s code, the selection of the correct keywords, content on the website and hyperlinks to/from other pages. All of this combined, helps to ensure that your website is better placed in search results compared to your competitors.
Software localisation is the translation of a software user interface and local adaptation. Local adaptation can be the adjustment of a currency, units of measurement and adaptation to suit cultural differences.
When a document is translated using a CAT tool, the translator does not see the document’s formatting and structure (e.g. font types, images). Instead, this information is shown as tags. Since the tags are specific to each file format, this information is different from format to format. However, they share general characteristics. A CAT tool like SDL Trados Studio defines tags in accordance with their function and in that way presents the translator with more information about the content of the segment. To ensure that the translated document retains its original formatting and structure, the tags must not be deleted in the translation.
A termbase contains the defined terminology for the desired language. It may be that you want to use different terms depending on the type of text in question. In that case, different termbases for different types of texts is the smart choice. The termbases are confidential and belong to you.
Terminology management ensures the terms and phrases specific to your company are translated consistently and according to your rules. Terminology management identifies industry-specific and company-specific terms, acronyms and abbreviations. You can also stipulate terminology that must not be used in your translations. Learn more about terminology management.
A Translation Memory (TM) is a specific customer database containing all of the previously translated sentences. Using a TM, if you are translating text that is similar to text that has already been translated, you can reuse the translated text. This increases translation speed and ensures uniform quality. A TM is the most important element in a CAT tool.
XBench is a quality assurance and terminology management program, and it can work with many different CAT formats. The programme can be used to carry out quality assurance checks, for example in SDL Trados Studio or STAR Transit, and it checks translations for consistency, completion, concordance with numbers and much more.
XML is the acronym for Extensible Markup Language. XML is a file format that is used to save and move data from one location to another.
A single-language XML file A single-language XML file contains a single language. On translation, the source text is replaced by the translated text. For example, if we are tasked with the translation of an XML file from English into Danish, we will receive an English XML file and deliver a Danish XML file.
A dual-language XML file A dual-language XML file contains two languages. On translation, the source text is retained in the XML file and the translated text is inserted into the same file. For example, if tasked with a translation from English into Danish, we will receive an English XML file and deliver an XML file that contains both Danish and English texts.
You can be assured at all times, that the translated XML files which we return to you will have the same intact XML structure and syntax, so that they can be re-imported into your system without any problems.
Example of XML: