Transifex

  • Documentation
  • Tracking Progress
  • Repetitions

Repetitions

When calculating your localization budget, the biggest factors you take into consideration are the number of source words and the cost per word provided by the vendor you use. String and word repetitions need to be also considered allowing vendors to better estimate the time it will take for jobs to be completed, thus providing a competitive quote, at a lower price than the one they would quote if all strings were considered unique.

Transifex offers statistics for repetitions by generating detailed reports on a resource, language, project-level when this information is requested. By repetitions, we mean the number of the same strings in a resource, considering one of them as the original string. Below, the different types of repetitions Transifex provides statistics for are described:

  • Untranslated repetitions.
    This information refers to the target language and the number of untranslated repetitions found in that specific language.
    This means that the same resource may have a different number of repetitions for each target language
  • Repetitions in the source language.
    This information refers to the source language of a Transifex project and it's unrelated to the translation status of a string


Example

Source file:

<resources>
  <string name="key1">Privacy Policy</string>
  <string name="key2">Privacy Policy</string>
  <string name="key3">Privacy Policy</string>
  <string name="key4">Security Policy</string>
  <string name="key5">Security</string>
  <plurals name="minutes_count">
    <item quantity="one">%s minute</item>
    <item quantity="other">%s minutes</item>
  </plurals>
  <plurals name="minutes_count_11">
    <item quantity="one">%s minute</item>
    <item quantity="other">%s minutes</item>
  </plurals>
</resources>

In the editor, the strings will be available for translation as follows:

strings_lists_repetitions.png#asset:9481


Analysis

  • Repetitions in the source language

    Source string: Privacy Policy (2 words, 3 instances found - one is considered the original string so it's excluded)

    Repetitions: 4 words

    Pluralized source string: %s minutes (2 words, 2 instances found - one is considered the original string so it's excluded)

    Repetitions: 2 words

    Note: In a pluralized string, we count the number of words available in the plural form "other"

    Total repetitions (source language): 6 words

  • Repetitions in the target language (untranslated repetitions)

    Total untranslated repetitions: 6 words (no translations applied yet)

    If we translate the string with key "key1", then, the number of untranslated repetitions remains the same for the string "Privacy Policy" (4 words) since one instance is considered the original string. If we translate both "key1" and "key2", then the number of untranslated repetitions will be 2 words instead since now we have only one untranslated repeated string for this text.

    In this case, the total number of untranslated repetitions for this resource and language will be 2 untranslated words (for string Privacy Policy) + 2 untranslated words (for string "%s minutes")= 4 untranslated words in total


This information is exposed in every language pop-up as follows:

pop-up_repetitions.png#asset:9482

Note

Repetitions are updated when one of the following events happens:

1. When a resource is created or updated,

2. When a string is translated

3. When a new target language is added to a project

Resources Progress Report

In order to get repetitions for a specific project on a resource level for every language, you can use the report that can be generated via the resources page.

resources_progress_report_repetitions_request.png#asset:9486


Once you request the report, a zip file will be generated where 2 CSV files will be included:

1. A CSV file with string and word repetitions per resource/language

resources_progress_report_repetitions_csv_report.png#asset:9487

2. A CSV file as it is described here

Resource Progress Report

In order to get repetitions for a specific resource only, on a language level, you can use the report that can be generated via the resource detailed page.

resource_progress_report_repetitions.png#asset:9483

Once you request the report, a CSV file will be generated where repetitions information for the specific resource will be included for both source and target languages:

  • untranslated repeated stringcount (untranslated repetitions found per target language)
  • untranslated repeated wordcount (untranslated repetitions found per target language)
  • total repeated stringcount (repetitions found in source language)
  • total repeated wordcount (repetitions found in source language)

Languages Progress Report

In order to get repetitions for a specific project on a language level, you can use the report that can be generated via the languages page.

languages_progress_report_request_csv.png#asset:9490

Once you request the report, a CSV file will be generated containing the information described here. The columns that refer to repetitions are:

  • untranslated repeated stringcount (we compare the untranslated repetitions across all resources in the project, on a target language level)
  • untranslated repeated wordcount (we compare the untranslated repetitions across all resources in the project, on a target language level)
  • total repeated stringcount (we compare the repetitions across all resources in the project, on a source language level)
  • total repeated wordcount (we compare the repetitions across all resources in the project, on a source language level)

Languages Progress Report for All Projects

If you have multiple projects in your organization and want to quickly export statistics for all of them, you can request the report that can be generated via All projects page as described here.

all_projects_report.png#asset:9491

The columns that refer to repetitions are:

  • untranslated repeated stringcount (we compare the untranslated repetitions across all resources per project, on a target language level)
  • untranslated repeated wordcount (we compare the untranslated repetitions across all resources per project, on a target language level)
  • total repeated stringcount (we compare the repetitions across all resources per project, on a source language level)
  • total repeated wordcount (we compare the repetitions across all resources per project, on a source language level)

Repeated Strings Filter

In order to find all the strings and their repetitions in Transifex Web Editor, apply the filter "Repeated Strings" as follows:

Repeated_Strings_Filter.png#asset:6653

A list with all the strings and their identical instances will be returned:

repeated_filter_in_the_editor.png#asset:9492


If you want to find the untranslated repetitions, then you can combine the above filter with the filter ' translated:no'