Text Recognition Pane (Options Dialog Box > GUI Testing Tab)

Relevant for: GUI tests and components

This pane enables you to configure how OpenText Functional Testing identifies text in your application. You can use this pane to modify the default text capture mechanism, optical character recognition (OCR) mechanism mode, and the language dictionaries the OCR mechanism uses to identify text.

To access

Select Tools > Options > GUI Testing tab > Text Recognition node.

Important information

  • Clicking the Restore Factory Defaults button resets all product settings, including ALM connection information, the recent files list, and anything defined in the Options dialog box, such as startup options, UI layout, GUI, API, and BPT testing options, remote connection credentials, and more.

  • The settings in this pane are used when using OCR to identify text, as described in Text recognition in run-time.

  • To use the ABBYY OCR Engine for text recognition, you must include it when installing OpenText Functional Testing. If ABBYY is not installed, Tesseract is used as the default engine instead.

Related tasks

See also

Options are described below. The options differ depending on the text recognition engine you select.

When deciding which OCR engine to use, see Decide which OCR engine to use.

ABBYY OCR Text Recognition engine (default)

UI Element Description
Text recognition mode

The manner in which to recognize text in the application:

  • Single text block mode: The single text block mode instructs the OCR mechanism to focus on the area and treat it as a single text block. This is especially useful when trying to capture text on small objects or in a small text area. Select this radio button if the text on the object is uniform in font, size, color, and background. For example:

  • Multiple text block mode: The multiple text block mode instructs the OCR mechanism to handle each text area in the object that has a different background font and size. The OCR mechanism decides where to divide the text blocks according to an internal algorithm. Select this radio button only if the text on the object comprises different fonts, font sizes, colors, and/or backgrounds. For example:

Available languages

Lists all of the language dictionaries that the OCR mechanism can potentially use when retrieving text from the object.

To specify the language dictionaries used by the OCR mechanism: Move a language to the Supported languages list box by selecting a language and clicking the right arrow button (>).

Supported languages

Lists the language dictionaries that the OCR mechanism uses when capturing text.

Add languages from the Available languages to the Supported languages to specify one or more languages to use for text recognition.

Preview

Previews the text recognition result.

Select or customize a set of OCR configuration settings, upload an image of your application, preview the corresponding result, and then adjust your settings if necessary.

For details, see Preview Pane (Options Dialog Box > GUI Testing Tab > Text Recognition) .

Pattern Training

Click Use a trained pattern to help the ABBYY OCR engine identify unusual or unclear characters in your application.

You can create a trained pattern, specify an existing pattern to use, or modify an existing pattern.

For details, see ABBYY OCR Pattern Training.

Preprocess the image before using text recognition Enables the text recognition to identify image elements before identifying the text in the specified object or area.

Baidu or Google OCR Text Recognition engine

UI Element Description
Language Type

(Baidu only)

A single language to support for text recognition, or the combination Chinese and English.

Access Token or Key The access token or key to use to access your Baidu or Google Cloud account.
Proxy Settings

If your Internet connection requires a proxy, specify the proxy server address and authentication details.

Proxy type. Defines the type of proxy to use when connecting to the Digital Lab server:

  • System Proxy

  • HTTP Proxy

Proxy address. The proxy IP address and port. If System proxy is selected, this field is populated automatically, and is read-only.

Specify authentication. Enables you to connect to the Digital Lab server using a specific account.

Proxy user name and password. The credentials used to connect to the Digital Lab server, if Specify authentication is selected.

Note:  

  • When proxy settings are not selected, no proxy is used for this connection.

  • If you are using UFT OCR for mobile applications, make sure that the proxy settings for connecting to Digital Lab server (Options ->GUI Testing -> Mobile ->Proxy Settings) are the same as the OCR proxy settings (Options -> GUI Testing -> Text Recognition -> Proxy Settings).

Test Connection

Press Test Connection to test your connection details and make sure OpenText Functional Testing can connect to the cloud OCR service.

Preview

Upload an image of your application, preview the text recognition result, and adjust your Text Recognition options if necessary.

For details, see Preview Pane (Options Dialog Box > GUI Testing Tab > Text Recognition) .

Tesseract OCR Text Recognition engine

UI Element Description
Text recognition mode

The manner in which to recognize text in the application:

  • Single text block mode: The single text block mode instructs the OCR mechanism to focus on the area and treat it as a single text block. This is especially useful when trying to capture text on small objects or in a small text area. Select this radio button if the text on the object is uniform in font, size, color, and background. For example:

  • Multiple text block mode: The multiple text block mode instructs the OCR mechanism to handle each text area in the object that has a different background font and size. The OCR mechanism decides where to divide the text blocks according to an internal algorithm. Select this radio button only if the text on the object comprises different fonts, font sizes, colors, and/or backgrounds. For example:

Symbols for text recognition

Enables you to restrict text recognition to specific characters.

This option is supported for English only.

Current language pack

The current language to use in text recognition. Only one language pack can be used at a time.

To download and install a new language pack, you can visit the Tesseract OCR language pack download site: https://github.com/tesseract-ocr/tessdoc/blob/master/Data-Files.md#data-files-for-version-400-november-29-2016.

After downloading, add the files to the <Installdir>/dat/tessdata4.1 folder.

Note: Depending on the Tesseract version, the Tesseract OCR results may be different.

Use default Tesseract configuration Instructs OpenText Functional Testing to use the standard Tesseract configuration, as noted in the language data file.
Use configuration from file

Enables you to load configuration settings from an externally defined file.

Not all configuration options are supported for use in OpenText Functional Testing. The Output pane displays a list of parameters that are ignored when running a test:

  • %parameter_name% parameter is not supported

  • The Tesseract OCR engine has stopped due to an error. Check your Tesseract configuration and try again.

  • The value type for the %parameter_name% parameter is incorrect. The parameter was ignored during the test run

  • The %parameter_name% parameter is not supported by the Tesseract OCR engine

For details on how to create your own configuration file, see https://github.com/tesseract-ocr/tesseract/blob/master/doc/tesseract.1.asc.

Preprocess the image before using text recognition

Enables the text recognition to identify image elements before identifying the text in the specified object or area.

When you use this option, OpenText Functional Testing converts the image to a black and white image and resizes it. However, this slows the performance of OpenText Functional Testing when performing text recognition. Therefore, expect additional time in test runs when performing text recognition with this option.

This option should be used when your application uses very small font size (10 pt. and lower).