Okapi ComponentsText Merging Utility |
|
- The utility set identifier for this utility is: oku_set01
- The utility identifier is: merging
The Text Merging utility allows you to merge extracted text back into its original format. The input files are documents created with the Text Extraction utility: Primilary XLIFF, but also Table format.
You must also have available the original files used to create the extracted documents (a.k.a. the "skeleton" files) and the same parameters file for the filter (if one was used). This is necessary to get match the same set of items in the extracted document and in the original file. The location of the skeleton files is computed from the root specified in Option Tab and the information in the extracted document itself.
The filter settings associated to the input file is not used. The utility expects the input files to be the extracted documents and to have information about the filter used for the original extraction. For more information about the expected extracted formats this utility uses, see the Text Extraction utility.
The common parameters are the options specified from the application calling the utility rather than in the options dialog box of the utility itself. For this utility the common parameters you need to specify are the following:
Files of the first input list | - Needed (the files to merge) |
Root for the first input list | - Not Needed |
Files of the second input list | - Not Needed |
Root for the second input list | - Not Needed |
Files of the third input list | - Not Needed |
Root for the third input list | - Not Needed |
Input language | - Needed |
Output language | - Needed |
Input default encoding | - Needed |
Output default encoding | - Needed |
Location and names for output files | - Not Needed |
Root of the skeleton files -- Enter the root of the folder where the skeleton files are located. The skeleton files are the files the processor needs to merge back the text into its original format. In the case of Okapi Framework filters, those files are simply the original files that were used to extract the text.
Root of the output merged files -- Enter the root of the folder where the output files should be generated.
In XLIFF, the full path of both the skeleton file and the output file are the
specified roots plus the relative path of the file defined in the
okp:original
attribute of each <file>
element.
Output folders are
created as needed if they do not exist yet.
You can specify a
relative path (a path with one or more "..\
" at the front)
for both roots. If the root specified is a relative path the complete root is
calculated based on the root of the input file. In XLIFF, the root of the input file is itself computed by
removing the value of the okp:original
attribute in the <file>
element from the full path of the input file.
For example:
Skeleton root = ..\Original Output root = ..\Merged Input file path = C:\My Projects\Pack5\Work\SubDir\MyFile.rc.xlf 'original' attribute = \SubDir\MyFile.rc.xlf The root of the input file will be: C:\My Projects\Pack5\Work The skeleton root will be: C:\My Projects\Pack5\Original The output root will be: C:\My Projects\Pack5\Merged