This article will elucidate how the optional configuration of the files to be uploaded operates.
1. Why would you need to configure the upload parameters for your files?
2. How to configure the upload parameters for your files
3. File extensions and their adjustable parameters
3.1 CSV
3.2 DOCX
3.3 HTML
3.4 JSON
3.5 PPTX
3.6 XLIFF
3.7 XLSX
3.8 XML
1. Why would you need to configure the upload parameters for your files?
While Bureau Works is equipped to process your files using the most common parameters, there might be instances where adjustments are necessary. Recognizing that users have diverse requirements, we provide flexibility in how the platform interprets and displays your source file in the editor and generates your final translated file.
If the results are not as expected, it could be due to the way you configured the file's parameters or if you haven't configured them at all.
On the other hand, it's worth noting that a significant portion of use cases function perfectly well without requiring this advanced configuration.
2. How to configure the upload parameters for your files
The configuration of upload parameters is accessible during Step 2 of the project creation.
Hover your mouse over the document to reveal customizable parameters for each file, which will open in a new tab:
3. File extensions and their adjustable parameters
In this section, we will elaborate on the adjustable parameters offered by our platform for the most frequently used file extensions.
Please note that when uploading multiple documents, you can quickly apply the same configuration to all by clicking the arrow next to the filter. If you prefer, you can also select specific files to configure individually by clicking the filter. This configuration can be saved as a filter for future use. For more details, refer to the article 'How to Customize File Parameters.'
⚠️ Please keep in mind that there may be unique configurations that are not covered in this article. That's why our support team is always available to assist you with any questions or assistance you may require.
3.1 CSV
A CSV (comma-separated values) file is a text file that has a specific format that allows data to be saved in a table-structured format. Commonly employed delimiters include the comma, tab, space, and semicolon. It's worth noting that delimiter-separated files are frequently denoted with a .CSV extension, even if the field separator is not a comma.
In Bureau Works, the default delimiter configuration is established as a semicolon, but you have the flexibility to modify it at any time.
There are numerous configurable items for .CSV files, and each item will provide an explanation when you hover your mouse over it.
3.2 DOCX
A DOCX file is a file format used by Microsoft Word, a popular word processing program. It is a type of document file that contains formatted text, images, tables, graphs, and other elements to create richly formatted documents.
This kind of file is among the most commonly used file types in Bureau Works, and for it, we offer several options. You have the choice to decide whether to clean tags aggressively, translate comments, translate document properties, exclude graphical metadata, translate headers and footers, and translate hidden text.
3.3 HTML
An HTML (HyperText Markup Language) file is a standard web page document that contains text, images, links, and other multimedia content.
Both of the options provided are associated with character escaping in the translated file. Depending on your specific use case, you may need to disable the 'encode' option to prevent characters from being encoded multiple times.
3.4 JSON
A JSON (JavaScript Object Notation) file is a text-based data format used for storing and exchanging structured data. It is commonly used for transmitting data between a web server and a web application, as it's lightweight and easy for both humans and machines to read and write.
In JSON files, you have the option to disable the default configuration and instruct Bureau Works not to infer tags from regex, refrain from escaping forward slashes and white spaces, and avoid extracting all key/string pairs. Additionally, you can insert a regex to exclude specific keys when utilizing the "Extract all key/string pairs" feature.
3.5 PPTX
A PPTX file is a file format used by Microsoft PowerPoint, widely used presentation software. It is used to create multimedia presentations consisting of slides that may contain text, images, graphics, charts, audio, video, and animations.
When uploading PPTX files in Bureau Works, you have the option to perform aggressive tag cleaning, which is particularly beneficial for converted files. You can also choose to disregard placeholders in masters and decide whether to translate comments, document properties, hidden content, master's content, and notes.
3.6 XLIFF
XLIFF stands for XML Localization Interchange File Format. It is a standardized file format used in the translation industry to exchange and store information during the localization process.
When importing XLIFF files into Bureau Works, you will have various options, including choosing whether to import the target segments, automatically confirm imported target segments, and save the confirmed target segments in the Translation Memory.
3.7 XLSX
A XLSX file is a file format used by Microsoft Excel, a popular spreadsheet application. It is part of the Microsoft Office suite and is designed to store data in a structured grid of rows and columns, known as a spreadsheet or workbook.
In XLSX files, you have the choice to translate diagram data and drawings, and there's an option to select foreground colors for exclusion. Additionally, you can decide whether to translate excluded, hidden, and marked columns.
3.8 XML
XML stands for eXtensible Markup Language. It is a versatile and flexible markup language used to structure, store, and transport data in a human-readable format.
In XML files, it is possible to escape:
- "&" to "&"
- single-quotes to "\'"
- handle ">"
- non-breaking spaces
- doube-quotes in the translated content
- Preserve CDATA sections
- Use break line as code
- Unescape CDATA sections
- Preserve white spaces
If any questions were not answered in this article, please don't hesitate to contact our support team at help@bureauworks.com.
Comments
0 comments
Please sign in to leave a comment.