History: Measuring translation progress in the CLWE Project
View page
Collapse Into Edit Sessions
Source of version: 4
«
»
''Work in progress'' It is useful for the users of a wiki translation system to be able to know how much a given translation is up-to-date with respect to other-language versions of the same page which may feature modifications that have not propagated. In full generality, more than one page may be a source of changes. Let us first consider the case where there are only two languages for a given page. There are many different ways to compute a "translation metric" between two pages. Given a source version S and a target version T, let us agree that the translation metric between them is zero when T is empty, and one (or 100%) when nothing in S needs translation into T. ! Computing a translation metric in the CLWE architecture Huberdeau's translation syncing architecture ([http://wiki-translation.com/tiki-download_wiki_attachment.php?attId=5&page=Cross%20Lingual%20Wiki%20Engine%20Project|Architecture Description (PDF)]) provides two sources of input that could feed into a translation metric. The first is the set of content elements that have been introduced elsewhere in the translation set, but have not yet been propagated to the translation under consideration. Since the content elements have labels but not explicit representations in this architecture, the most that can be derived from them is a number of changes that need to be propagated. The second source of input is the content diff that is computed to enable an update between two translations. The diff is comprised of the changes between two versions of S that require propagation into T. It is represented as text segments that were removed or added. Content diffs have the advantage of providing strings of characters whose length give an approximate measure of required effort. Obviously, an empty diff would correspond to a metric of 100%. Mathematically, let us use diff+(S,T) to represent the total number of characters that appear in additions in the diff and diff-(S,T) represent the total number of characters whose removal needs to propagate into T. One possible translation metric is M(S,T) = Len(T) / ( diff+(S,T) + diff-(S,T)/N + Len(T) ) where N is a fudge factor representing the ratio of effort between translating a chunk of text into and removing it from a translation (Does N=10 seem reasonable?). A picture is worth a thousand words. To visualize, imagine that the state in which the target page finds itself after it is updated is represented as a rectangle that has two segments glued together. {img src=images/code.png}%%% {CODE()} +++++++++++++++++=================================== ^ diff+ & diff- ^ ^^ current version of T ^^ {CODE} The first segment represents the changes that need to be propagated, and the second represents the current size of T. The ratio of Len(T) to the total length is our translation metric. We hypothesize that in most cases it will give an accurate assessment of the work required, but this metric has a few obvious shortcomings. For instance, if T is long but most of the required update work consists in completely replacing T by something short, the metric will be too high (optimistic). Also, the computed content diff occasionally includes changes that are already present in the target and should not be counted; these will unduly lower the metric. !Computing and caching Tran
History
Enable pagination
rows per page
HTML diff
Side-by-side diff
Side-by-side diff by characters
Inline diff
Inline diff by characters
Full side-by-side diff
Full side-by-side diff by characters
Full inline diff
Full inline diff by characters
Unified diff
Side-by-side view
Information
Version
Tue 04 of Mar, 2008 19:12 GMT
Seb
7
Actions
View
Source
Fri 22 of Feb, 2008 22:05 GMT
Seb
6
Actions
View
Source
Fri 22 of Feb, 2008 19:34 GMT
Seb
5
Actions
View
Source
Fri 22 of Feb, 2008 19:11 GMT
Seb
4
Actions
View
Source
Fri 22 of Feb, 2008 18:48 GMT
Seb
3
Actions
View
Source
Fri 22 of Feb, 2008 18:39 GMT
Seb
2
Actions
View
Source
Fri 22 of Feb, 2008 18:14 GMT
Seb
1
Actions
View
Source
Select action to perform with checked...
Remove
OK
SideMenu
Home Page
Demo screencast
Last changes
CLWE To do
Forum Wiki-translation
BabelWiki Workshop
Architecture document (pdf)
CLWE Paper (pdf)
Edit Side Menu
Latest Changes
No records to display
...more
Search
Find
Log In
Username:
Password:
I forgot my password
CapsLock is on.
Log in
Upcoming Events
No records to display