CAP output report details
There are three types of sheet in every CAP report, Summary, Paragraph Segmentation and Result.
COLUMN NAME |
DESCRIPTION |
|---|---|
Paragraph # |
Incremental paragraph index within the evaluated document. |
Paragraph Id |
Unique identifier of the paragraph ID per the original data import. |
System 1 Engine |
Name of the MT system 1. |
System 2 Engine |
Name of the MT system 2. |
Domain |
Domain classification of the evaluated content. |
Additional information |
Optional additional context or notes provided for the paragraph. |
Metadata |
Propagated metadata from input CSV (Optional). |
Source paragraph |
Original source paragraph text. |
System 1 paragraph |
Full paragraph translation generated by System 1. |
System 2 paragraph |
Full paragraph translation generated by System 2. |
Segment # |
Incremental segment index within the paragraph. |
Source segment |
Segmenter-defined segment created from the source paragraph, potentially spanning multiple chunks (e.g. sentences or similar linguistic units), and aligned with System 1 and System 2 segments. In case multiple chunks are present, they are separated by the ||| divider. |
Source position |
Position of the source segment within the source paragraph. The format is Start1-End1,Start2-End2, … For example when the segment starts at position 5 and ends on position 25, the field will be “5-25”. |
System 1 segment |
Segmenter-defined segment created from the System 1 translated paragraph, potentially spanning multiple chunks, and aligned with the source and System 2 segments. In case multiple chunks are present, they are separated by the ||| divider. |
System 1 position |
Position of the System 1 segment within the paragraph. |
System 2 segment |
Segmenter-defined segment created from the System 2 translated paragraph, potentially spanning multiple chunks, and aligned with the source and System 1 segments. In case multiple chunks are present, they are separated by the ||| divider. |
System 2 position |
Position of the System 2 segment within the paragraph. Same structure as for Source position |
URL |
Link to the particular Paragraph segmentation page. |
COLUMN NAME |
DESCRIPTION |
|---|---|
Paragraph # |
Incremental paragraph index within the evaluated document. |
Paragraph Id |
Unique identifier of the paragraph ID per the original data import. |
System 1 Engine |
Name of the MT system 1. |
System 2 Engine |
Name of the MT system 2. |
Domain |
Domain classification of the evaluated content. |
Additional information |
Optional additional context or notes provided for the paragraph. |
Metadata |
Propagated metadata from input CSV (Optional). |
Scale used |
Scoring system ID. |
Source paragraph |
Original source paragraph text. |
System 1 paragraph |
Full paragraph translation generated by System 1. |
System 2 paragraph |
Full paragraph translation generated by System 2. |
Segment # |
Segment identification (generated incrementally). |
Segment ID |
Segment ID per the original data import. |
Source segment |
Source text for segment within paragraph. |
System 1 segment |
Translation from the system 1 for segment within paragraph. |
System 1 Score |
Score for Translation 1. |
System 2 segment |
Translation from the system 2 for segment within paragraph. |
System 2 Score |
Score for Translation 2. |
Tie Breaker Result |
Which system is better (values: 1, 2 or 0 in case both are of the same quality). |
Judgement Duration (seconds) |
Time spent on the segment judgement |
Total Duration (seconds) |
Time spent on the segment judgement. |
URL |
link to the particular segment evaluation. |
Update Count |
How many additional segment edits were done. |
Last Updated |
Timestamp of the last edit. |
Last Updated By Evaluator |
Name of the evaluator who made last edit. |
Paragraph Id |
paragraph identification(in case the contains the information is available) |
In case the Job is configured with Annotation Set, the annotation values are added to the report. For each type of the text (source, translation1 or 2) the following details are added as new columns: each field header consist of combination of text type and annotation field.
For example, we have the Source annotation field named ‘Translation Issues’, This will be added to report as Source | Translation Issues column.