EMC Documentum Content Intelligence Services 7.1

EMC Documentum Content Intelligence Services 7.1
EMC® Documentum®
Content Intelligence Services
Version 7.1
Installation Guide
EMC Corporation
Corporate Headquarters:
Hopkinton, MA 01748–9103
1–508–435–1000
www.EMC.com
Legal Notice
Copyright ©1994-2015 EMC Corporation. All rights reserved.
EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without
notice.
THE INFORMATION IN THIS PUBLICATION IS PROVIDED "AS IS." EMC CORPORATION MAKES NO
REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS PUBLICATION,
AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR
PURPOSE.
Use, copying, and distribution of any EMC software described in this publication requires an applicable software license.
For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com. Adobe and Adobe PDF
Library are trademarks or registered trademarks of Adobe Systems Inc. in the U.S. and other countries. All other trademarks used
herein are the property of their respective owners.
Documentation Feedback
Your opinion matters. We want to hear from you regarding our product documentation. If you have feedback about how we can
make our documentation better or easier to use, please send us your feedback directly at ECD.Documentation.Feedback@emc.com.
Table of Contents
Chapter 1
Introduction......................................................................................................5
Components ......................................................................................................5
Compatibility ......................................................................................................6
Related documentation .......................................................................................6
Chapter 2
Preparing the Installation .................................................................................7
Preinstallation tasks............................................................................................7
Additional preinstallation tasks for Linux hosts......................................................9
Additional preinstallation tasks for Windows hosts ..............................................10
Migrating status information for document sets...................................................10
Chapter 3
Installing Content Intelligence Services ......................................................... 11
Installing Content Intelligence Services .............................................................. 11
Installing CIS in silent mode ..............................................................................14
Chapter 4
Completing the Installation ............................................................................17
Authenticated connection between CIS and the repository ..................................17
Deploying CIS artifacts (DAR file) manually........................................................18
Enabling the repository for CIS..........................................................................18
Upgrading JDK Bundle .....................................................................................19
Chapter 5
Validating the Installation ...............................................................................23
Verifying the deployment of CIS artifacts (DAR file) ............................................23
Verifying that the repository is enabled for CIS ...................................................23
Verifying that the tables are created...................................................................24
Verifying the configuration of the entity detection server ......................................24
Verifying that all services are started..................................................................25
Chapter 6
Troubleshooting Installation Issues ...............................................................27
Modifying the ports for the entity detection server ...............................................27
Some Luxid services (4/7) are not started ..........................................................28
Chapter 7
Uninstalling Content Intelligence Services.....................................................29
Uninstalling (Windows hosts) ............................................................................29
Uninstalling (Linux hosts) ..................................................................................29
Downgrading Content Intelligence Services .......................................................30
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
3
Preface
This guide provides the instructions for installing the server-side components of Content Intelligence
Services (CIS). CIS is administered through Documentum Administrator. The EMC Documentum
Administrator Deployment Guide provides instructions for installing Documentum Administrator.
Intended audience
This guide is intended primarily for administrators who are installing Content Intelligence Services
with Documentum platform or with xCelerated Composition Platform (xCP).
Revision history
Revision Date
Description
June 2015
Added the section Upgrading JDK Bundle, page 19.
May 2014
Initial publication
4
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
Chapter 1
Introduction
This chapter contains the following topics:
•
Components
•
Compatibility
•
Related documentation
EMC Documentum Content Intelligence Services (CIS) is the content analytics component for EMC
Documentum. Use content analytics to analyze the textual content of documents and know what
the documents are about without having to read them. It enables you to find documents rapidly by
enriching search facets with discovered metadata and get the gist of a document by viewing the
discovered metadata with the document.
As in the previous versions of Documentum, you can perform categorization for a WDK-based
application. This mode of categorization is referred to as classic categorization.
Content Intelligence Services is also available in xCP deployments. The entity detection and pattern
detection are only available in xCP applications. Categorization is also available in xCP applications
but the taxonomies used for the categorization are not available in the Content Intelligence node in
Documentum Administrator.
The EMC Documentum Content Intelligence Services Administration Guide provides more information
about CIS and the various types of processing.
Components
Content Intelligence Services includes these key components:
• The Content Intelligence Services client (CIS client) creates, manages, or displays the taxonomy
used for categorizing documents. Examples of CIS clients are Documentum Administrator,
Webtop, xCP 2.1 applications or any custom application using the Content Intelligence Application
Programming Interface (CI API). Use Documentum Administrator to configure CIS. The CI API
handles communication between the CIS client, the CIS server, and the Documentum repository.
• The Content Intelligence Services server (CIS server) performs the automatic categorization of
documents based on taxonomy and category definitions. It also performs the pattern detection, and
triggers the entity detection.
• The entity detection server performs the entity detection analysis using cartridges.
• A repository is also required to store the CIS data (such as taxonomy definitions, document set
definitions, and discovered metadata).
Documentum Administrator (DA) includes a Content Intelligence node that enables you to manage
CIS resources and the analysis results. You must install Documentum Administrator separately. The
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
5
EMC Documentum Administrator Deployment Guide provides instructions for installing Documentum
Administrator.
Compatibility
For any Documentum product, EMC recommends you to install CIS with the other Documentum
components of the same version. To avoid potential compatibility issues, install CIS 7.1 with Content
Server 7.1 and Documentum Administrator 7.1.
CIS 7.1 is compatible with:
• Content Server version 6.7SP2 or higher, with a Global Registry Repository version 6.7SP2 or
higher.
• Documentum Administrator version 6.7SP2 or higher.
CIS 7.1 is not compatible with any version of CenterStage or WebPublisher.
CIS does not support Branch Office Caching Services (BOCS).
Related documentation
The following documentation is available for using and customizing Content Intelligence Services:
• EMC Documentum Content Intelligence Services Administration Guide
• EMC Documentum Content Intelligence Services Release Notes
• EMC Documentum Environment and System Requirements Guide
6
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
Chapter 2
Preparing the Installation
This chapter contains the following topics:
•
Preinstallation tasks
•
Additional preinstallation tasks for Linux hosts
•
Additional preinstallation tasks for Windows hosts
•
Migrating status information for document sets
Preinstallation tasks
• Uninstall any previous installation of CIS. Uninstalling Content Intelligence Services, page 29
provides details to uninstall CIS depending on the installed version.
• Make sure that the free disk space is higher than 4 GB and the temporary directory at least 1 GB
before starting the installation. If the temporary directory is not large enough, you can modify the
corresponding environment variables (TEMP and TMP) to use another (bigger) directory. You can
do it in the Environment variables dialog box on Windows hosts, or using the set command in a
command DOS session. The temporary directory can be on a different drive than the one used for
the CIS installation.
• If you install CIS on a host machine with an underscore character in its name, the installation does
not create the authentication file that stores the credentials for the repository access. To work
around this issue, use the IP address of the host machine instead of its name, or enable CIS in DA to
trigger the creation of the authentication file.
• If not already done, configure the global registry in Documentum Administrator.
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
7
Preparing the Installation
Installation check list
Prepare the required information
Before starting the installation process, make sure that you know the information required during
the installation.
Table 2
Required information for CIS installation
Required information
Description
Host name and port number for the connection broker
The host name of the connection broker and its port
number.
Installation Owner Password
The network password for the user performing the
installation.
Repository name
The name of the repository that CIS uses.
CIS host
The full name (including the domain name) or IP
address of the server on the network (by default, it is
set to the machine name).
CIS port
The port of CIS server, default it 8079.
CIS JMX port
The port for CIS JMX agent, default is 8061.
Entity detection server port
The port for the entity detection server, default is
55550. The 20 following ports are reserved and must
not be used.
User name and password for CIS repository
The name and password of the user to authenticate
CIS server against the repository used by CIS.
Provide the following information about the global registry:
Repository Name
The name of the global registry.
Global registry user login and password.
The login name and password of the user for the
global registry
Make sure that no application (such as an antivirus application) is locking the following ports:
• RMI port 1099
• port 3690
• port 4445
• port range 7130-7229
• port 9000
• port range 10000-11000
• port range 32000-32xxx
• port 37579
• port 40002
• port range 55550-55649
8
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
Preparing the Installation
The entity detection server uses these ports. If one of them is locked, the installation can continue but
you must free the required ports before performing any entity detection analysis. To change the default
RMI port, refer to the procedure Modifying the ports for the entity detection server, page 27.
Additional preinstallation tasks for Linux hosts
On Linux hosts, perform the following tasks before you begin the installation process:
• Confirm that these 64-bit RPM packages are installed:
– glibc-version-release.architecture (e.g. glibc-2.12-1.107.el6.x86_64.rpm)
– libXau-version-release.architecture(e.g. libXau-1.0.6-4.el6.x86_64.rpm)
– libxcb-version-release.architecture (e.g. libxcb-1.8.1-1.el6.x86_64.rpm)
– libX11-version-release.architecture (e.g. libX11-1.5.0-4.el6.x86_64.rpm)
– libXext-version-release.architecture(e.g. libXext-1.3.1-2.el6.x86_64.rpm)
– libXi-version-release.architecture (e.g. libXi-1.6.1-3.el6.x86_64.rpm)
– libXtst-version-release.architecture(e.g. libXtst-1.2.1-2.el6.x86_64.rpm)
• Install these 32-bit RPM packages for Luxid:
– glibc-version-release.architecture (e.g. glibc-2.12-1.107.el6.i686.rpm)
– nss-softokn-freebl-version-release.architecture (e.g. nss-softokn-freebl-3.12.9-11.el6.i686.rpm)
– libgcc-version-release.architecture (e.g. libgcc-4.4.7-3.el6.i686.rpm)
– libstdc++-version-release.architecture (e.g. libstdc++-4.4.7-3.el6.i686.rpm)
Here version, release, and architecture in the package label are the available version number,
release number, and architecture specifier of the packaged software that is compatible with your
Linux version.
• Set the environment variables manually for CIS.
The environment variables are common to Documentum applications, they are described in the
following table. If the installation program does not find the needed environment variables, it aborts
the installation.
Some of these environment variables may exist, and if you have other Documentum products installed
on your system, some of the values may exist as well. The EMC Documentum Foundation Classes
Installation Guide provides more information about these variables.
To configure your Linux environment:
1. Set the environment variables:
a. Edit the installation owner’s .cshrc file (C shell) or .profile file (Bourne or Korn shells).
Alternatively, edit a file that has the name .cshrc file or .profile file.
b. Add the following variable:
Table 3
Required environment variable
Environment variable
Description
DOCUMENTUM
The full path of the destination directory (and
Content Server root, if it is installed on the
application server host).
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
9
Preparing the Installation
2. Check the installation requirements:
• /usr/dt/bin and /usr/openwin/bin are on the path
• DISPLAY is set to localhost:0.0
Additional preinstallation tasks for Windows
hosts
Ensure that the file msvcr71.dll is located at: C:\Windows\SysWOW64 in the target environment
before you begin the installation process. The installation process fails if you try to install CIS in an
environment that does not include this library.
Migrating status information for document sets
If you upgrade from CIS version 6.0, version 6.0 SP1, or version 6.5, you can use the migration script
(introduced in CIS 6.5 SP2). The script enables you to migrate the information related to the status
of the document sets already processed for categorization by a previous CIS version. Migrating this
information avoids the reprocessing of all documents by the new CIS server.
This procedure is not applicable for entity detection.
To migrate status information:
1. Before installing CIS, create a backup copy of the following file:
<CIS installation directory>\deploy\cis.ear\cis.war\repodata\docstatus
\document_processing_status.serialized
In previous versions of CIS, the default installation directory for CIS was:
C:\Documentum\jboss4.3.0\server\DctmServer_CIS.
2. Install the new version of CIS.
3. Copy the document_processing_status.serialized file to the new CIS server.
4. On CIS host machine, locate the import_docstatus.bat file (on Windows hosts, or import_docstatus
on Linux hosts); it can be found at <CIS installation directory>/bin.
5. Run the migration script with the following parameter:
import_docstatus.bat
document_processing_status.serialized
The script output indicates which document sets have been successfully migrated and which
ones have been skipped.
10
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
Chapter 3
Installing Content Intelligence Services
This chapter contains the following topics:
•
Installing Content Intelligence Services
•
Installing CIS in silent mode
Installing Content Intelligence Services
Before performing the following procedure, review the EMC Documentum Environment and System
Requirements Guide to ensure that you have met the hardware and software requirements.
To install Content Intelligence Services:
1. Log in to the CIS server host machine. You must have Administrator privileges on the host
machine to run the installation program. The person who installs CIS server is automatically
the installation owner.
On Windows, log in as a user with Administrator privileges.
On Linux, you can be a non-superuser to be able to install CIS.
2. From the EMC Online Support (https://support.emc.com), download the CIS
software file: Content_Intelligence_Services_<version_number>_Windows.zip or
Content_Intelligence_Services_<version_number>_Linux.tar to a temporary directory on the host
machine.
Starter taxonomies are also available from the download site. The EMC Documentum Content
Server Administration and Configuration Guide provides more details.
3. Unzip the downloaded file.
4. Run the installer file:
• On Windows hosts: cisSetup.exe
• On Linux hosts: cisSetup.bin
On Windows hosts, right-click on the installer file cisSetup.exe and select Run as administrator.
The Welcome window of the installation wizard appears with a list of products and components
that you can install on the machine. If an older version of a product or component is already
installed, uninstall it before you proceed.
5. Click Next.
The license agreement appears.
6. Read the license agreement, select the option I accept the terms of the license agreement and
click Next.
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
11
Installing Content Intelligence Services
7. On Windows hosts, enter Installation Owner Password. For the installation owner password,
enter the network password for the user performing the installation. The password is required for
setting up server security and services on the server that hosts CIS. Click Next.
On Linux hosts, this step is skipped.
8. On Windows hosts, specify the destination directory for CIS, and click Next.
Documentum products use this directory to store working files, as well as program settings and log
files. If the installation program finds a registry entry that contains the required information for a
previously installed DFC runtime environment, it skips this step.
On Linux hosts, this step is skipped because you specified the DOCUMENTUM environment
variable before the installation.
9. Provide the following information to configure the CIS server:
Field
Description
Repository name
The name of the repository that CIS uses.
CIS host
The full server name, including the name of the
machine and the domain name (sub-domain is not
supported), or IP address of the host machine for
CIS server. By default, it is set to the machine name.
You can configure the same host as the production
server and as the test server. CIS clients such as
Documentum Administrator use the host name to
connect to the CIS server. You can modify it later in
Documentum Administrator.
CIS port
The port number for CIS server. Default is 8079.
CIS JMX port
The port number for CIS JMX agent. Default is
8061.
Entity detection server host port
The port used for the communication with the
entity detection server. The 20 following ports are
reserved and must not be used. Default is 55550.
After the installation, you can modify the CIS port and CIS JMX port in the cis.properties
configuration file as described in EMC Documentum Content Intelligence Services Administration
Guide.
10. Review the summary. The installation program summarizes what it plans to install and where it
plans to install it. Click Back to make changes, if any. Otherwise, click Install.
The installation checks whether the required ports are available. If some of them are already used,
a warning message appears. You can either:
• Ignore and continue: Allows you to proceed with the installation but you can modify the
ports manually afterwards.
• Cancel and free ports: Allows you to cancel the installation, manually free up the required
ports, and restart the installation.
If all ports are available, CIS and related products are installed. You cannot undo this installation
step.
11. Specify the hostname and port number for the machine that hosts the primary connection broker,
and click Next.
12
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
Installing Content Intelligence Services
You can use an IP address or a DNS name.
After the installation, you can change the connection broker host and port values by editing
the dfc.properties file in<CIS installation directory>/config/ and modifying the parameters:
dfc.docbroker.host and dfc.docbroker.port.
12. In the Designate Global Registry window, complete these substeps:
a. To designate a global registry at a later time, unselect Designate the global registry repository
to use and click Next to bypass the following substeps.
You will have to enable a global registry after completing the installation process.
b. In Repository Name, type the name of the repository to be used as the global registry.
c. In the remaining two text boxes, specify the global registry user login and password. The global
registry user must be a user who is restricted to READ privileges on the /System/Modules
folder on the repository designated as the global registry.
d. If the global registry or the global registry user is not configured or inaccessible to the client
where you are installing, unselect Test Connection.
e. Click Next.
If Test Connection is checked, the installation program tries to validate the global registry and
user settings that you have specified.
If the installation program detects a global registry on the machine, it skips this step.
13. For the repository used by CIS, specify the following information:
Field
Description
Repository user name for CIS
The name of the user to authenticate the CIS server
against the repository used by CIS.
Repository user password for CIS
The password of the user to authenticate the CIS
server against the repository.
Repository user domain for CIS
The Windows domain as part of the credentials of
the user to authenticate the CIS server against the
repository.
Test Connection
Select this option to test the connection with the
repository.
The information you provide during this step enables the repository for CIS and deploys the CIS
artifacts (DAR file) in the repository. The repository must be running for this action to occur.
If the repository is not running, or if the enabling fails during the installation, you can perform or
modify this action later in Documentum Administrator.
If the repository has already been enabled for another CIS instance, the configuration is not
modified. A warning message prompts you to modify the configuration in Documentum
Administrator.
If CIS artifacts (DAR file) cannot be deployed during CIS installation, deploy the DAR file
manually as described in Deploying CIS artifacts (DAR file) manually, page 18.
14. Click Done.
15. On Linux, if you have not logged in as the root user, change to the root user and then run the
command $DOCUMENTUM/CIS/service/cis_service_register add to create the CIS service.
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
13
Installing Content Intelligence Services
By default, CIS is installed in the directory:
• C:\Documentum\CIS on Windows hosts
• $DOCUMENTUM/cis on Linux hosts
This directory is referenced in CIS documentation as the path <CIS installation directory>.
The default installation folder for the entity detection server is:
• C:\Documentum\CIS\Temis\Luxid on Windows hosts
• $DOCUMENTUM/cis/Temis/Luxid on Linux hosts
CIS also installs the following third-party software:
• Oracle Outside In Content Access
• Snowball Stemmer Libraries
• Temis Luxid® Annotation Factory and TM360
Check the installation log file install.log located in the CIS installation folder to make sure that the
installation is successful.
You can modify some installation parameters such as CIS port, CIS JMX port, the repository name
for CIS, and so on in the cis.properties file as described in EMC Documentum Content Intelligence
Services Administration Guide.
Installing CIS in silent mode
You can install Content Intelligence Services using the silent (unattended) installation process. You
must use this procedure cautiously and only if you cannot use the graphical installation process
(installation wizard) because any error is hard to fix during silent installation.
Most parameters correspond to the information asked during the installation using the wizard, refer to
Installing Content Intelligence Services, page 11, and write down all parameters.
The silent installation invokes the installation program from a command line and gives it a
configuration file that enables the installation to proceed without further interaction.
To install CIS in silent mode:
1. Create a configuration file. You have two possibilities:
• Update the sample configuration file silent_install_sample.ini. The sample file
is located at the root of the archive file for the CIS software. For example,
Content_Intelligence_Services_<version_number>_Linux.tar, at the same level as the installer
file.
• Create the configuration file from scratch by running the installation in real-time, as described
in the following step.
To create a configuration file by recording a graphical installation, use a command such as the
following at the command prompt:
cisSetup.bin -r home/installer/myFile.ini
This is the command for Linux. For Windows operating systems, replace cisSetup.bin with
cisSetup.exe.
14
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
Installing Content Intelligence Services
After the file is generated, you can open the file with an editor, update the file, and change the
values of the required variables.
You can replace C:\myFile.ini with any file you choose. Give the full path, not a path relative to
the current directory.
Running this command creates myFile.ini as an installer configuration file. It runs the installation
program interactively and saves your inputs.
Note: This process records the information during a real-time installation. If you use this method
to create your configuration file, it performs an actual installation during the process.
2. Run the installation program silently by using a command such as the following at the command
prompt:
cisSetup.bin -f C:\silent_install_sample.ini
Note: To run the installer in silent mode, you have to update the value of the variable
INSTALLER_UI to ’Silent’.
Check the installation log files located in the CIS installation folder to ensure that there are no
installation errors.
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
15
Chapter 4
Completing the Installation
This chapter contains the following topics:
•
Authenticated connection between CIS and the repository
•
Deploying CIS artifacts (DAR file) manually
•
Enabling the repository for CIS
•
Upgrading JDK Bundle
Authenticated connection between CIS and the
repository
The CIS server processes documents in the repository that you have enabled for CIS. CIS works only
with one repository. You can enable only one repository with one server in the production mode and
one server in the test mode. The test mode is used only for classic categorization processing.
The connection between CIS and the repository is secured with an authentication mechanism when
you install CIS, or when you enable the repository for CIS, provide the user name and password to
authenticate the CIS server against the repository.
The authentication against the repository is required for CIS to retrieves documents, assign documents
to categories, and store the discovered metadata values.
When the CIS server starts, it checks the user credentials against the repository before it opens a
session. If the CIS server does not find any credentials, or if the credentials are invalid (for example,
after a repository change), it starts in a restricted mode. This mode allows only receiving new or
updated credentials. You cannot launch any categorization run but you can change the credentials in
Documentum Administrator. When the CIS server receives the valid credentials, it tries to connect
to the repository. If successful, it switches to full mode. The section Defining Content Intelligence
Services configuration of the EMC Documentum Content Intelligence Services Administration Guide
provides information on setting the user credentials.
If you modify the authentication information in Documentum Administrator after enabling the
repository, it creates another authentication file. There is one authentication file per repository on
the CIS server.
The authentication file name is user_<repository_name>.properties and it is available at:
<CIS installation directory>/repodata/authentication
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
17
Completing the Installation
Deploying CIS artifacts (DAR file) manually
The CIS DAR file (cis_artifacts.dar) is automatically deployed during CIS installation. If the automatic
deployment fails, install CIS DAR manually.
If you decide to use CIS with a different repository, install CIS DAR manually.
To install CIS artifacts manually:
1. From the EMC Online Support (https://support.emc.com), download the CIS
software file: Content_Intelligence_Services_<version_number>_Windows.zip or
Content_Intelligence_Services_<version_number>_Linux.tar to a temporary directory on the host
machine.
2. Unzip the archive file.
3. Go to the dar folder. This folder contains only one file: cis_artifacts.dar.
4. Install the DAR file using the DAR Installer or headless Composer. The EMC Documentum
Composer User Guide provides more details.
5. Check that the DAR is installed successfully as described in Verifying the deployment of CIS
artifacts (DAR file) , page 23.
Enabling the repository for CIS
You must enable the repository for CIS use in the following cases:
• The repository has not been enabled for CIS during CIS installation. For example, if the repository
was not started or was not reachable.
• You want to use CIS with a different repository which has never been enabled for CIS.
To enable CIS in Documentum Administrator:
1. While CIS server is running, log in to Documentum Administrator.
2. Navigate to Administration > Content Intelligence for the repository you want to process
documents from.
3. Click the Enable repository for category assignments link.
The Enable Repository for Content Intelligence page appears.
When you create taxonomies and categories, Documentum Administrator creates corresponding
folders, one folder for each taxonomy and category with the same hierarchical relationships.
When the Link to Folders option is active, CIS links categorized documents into the folders
corresponding to their assigned categories.
The default location for these folders is in a cabinet named Categories.
The default path for the Content Intelligence administrative information is /System/Application/CI.
You cannot modify these two locations.
4. Specify the following information:
18
Field
Description
Production Server
The host name of the CIS server in the production
mode. [1]
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
Completing the Installation
Field
Description
Test Server
For classic categorization only, the host name of
CIS server in test mode. [1] [2]
Link assigned documents into category folders
and Update document attributes with category
assignments
For classic categorization only.
User Name for CIS Server and Password
The EMC Documentum Content Intelligence
Services Administration Guide and EMC
Documentum Content Server Administration and
Configuration Guide provide more details about
these features.
Name and password of the user to authenticate the
CIS server against the repository.
The authentication against the repository is
required when retrieving documents and assigning
documents to categories.
[1] The host name is made of the IP address or the DNS name followed by the port number. The
port number is optional. Specify it if the version of Documentum Administrator is 6.0 SP1, 6.5,
or 6.5 SP1, or if you have modified the port during installation.
192.168.1.250:8079
The default port number is 8079.
You can define the host names using the IPv6 address. When using an IPv6 address, with or
without a specific port number, enclose the host name within square brackets. For example:
[2001:0db8:0:0:0:0:1428:57ab]
[2001:0db8:0:0:0:0:1428:57ab]:5678
[2] CIS enables you to categorize documents in the production mode or the test mode. Although
you can use the same CIS server for both production and testing, separate servers are recommended
for better performance and availability. The specified CIS server must be running when you enable
the repository. The test mode is only available for categorization and not for entity detection
or pattern detection.
5. Click OK.
If the repository is already enabled in Documentum Administrator, updating this configuration
creates another authentication file.
Verifying that the repository is enabled for CIS, page 23 describes the procedure to check that the
repository is enabled correctly.
Upgrading JDK Bundle
To update the latest 1.7 version of the bundled JDK:
1. Install the latest JDK version 1.7 to $DOCUMENTUM/java64, for example
$DOCUMENTUM/java64/1.7.0_XX.
2. Stop CIS.
3. Copy the following JAR files from $DOCUMENTUM/java64/1.7.0_51/jre/lib/ext to
$DOCUMENTUM/java64/1.7.0_XX/jre/lib/ext.
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
19
Completing the Installation
• certj.jar
• cryptojce.jar
• cryptojcommon.jar
• jcmFIPS.jar
• sslj.jar
4. Rename $DOCUMENTUM/java64/1.7.0_51 to $DOCUMENTUM/java64/1.7.0_51.bak.
5. Create a backup of all the files listed in the List of CIS installation files table.
Table 7
List of CIS installation files
Operating System
File Type
Windows
C:\Documentum\CIS\bin\build_metadata_extractor.bat
C:\Documentum\CIS\bin\clear_annotations.bat
C:\Documentum\CIS\bin\convert_docset_configuration.bat
C:\Documentum\CIS\bin\create_registered_tables.bat
C:\Documentum\CIS\bin\extract_metadata.bat
C:\Documentum\CIS\bin\import_docstatus.bat
C:\Documentum\CIS\bin\tef2repository.bat
C:\Documentum\CIS\composer\ComposerHeadless\plugins\com.emc.ide.external.dfc_1.0.0\dmbasic\java.ini
C:\Documentum\CIS\composer\ComposerHeadless\plugins\com.emc.ide.external.dfc_1.0.0\documentum.config\java.properties
C:\Documentum\CIS\service\conf\wrapper.conf
C:\Documentum\CIS\service\startCIS.bat
C:\Documentum\CIS\service\statusCIS.bat
C:\Documentum\CIS\service\stopCIS.bat
Linux
$DOCUMENTUM/CIS/service/statusCIS
$DOCUMENTUM/CIS/service/stopCIS
$DOCUMENTUM/CIS/service/startCIS
$DOCUMENTUM/CIS/composer/ComposerHeadless/plugins/com.emc.ide.external.dfc_1.0.0/documentum.config/java.properties
$DOCUMENTUM/CIS/composer/ComposerHeadless/plugins/com.emc.ide.external.dfc_1.0.0/dmbasic/linux/java.ini:java_library_path
$DOCUMENTUM/CIS/bin/build_metadata_extractor
20
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
Completing the Installation
Operating System
File Type
$DOCUMENTUM/CIS/composer/ComposerHeadless/plugins/com.emc.ide.external.dfc_1.0.0/dmbasic/linux/java.ini:java_classpath
$DOCUMENTUM/CIS/bin/convert_docset_configuration
$DOCUMENTUM/CIS/bin/create_registered_tables
$DOCUMENTUM/CIS/bin/import_docstatus
$DOCUMENTUM/CIS/bin/clear_annotations
$DOCUMENTUM/CIS/bin/tef2repository
$DOCUMENTUM/CIS/bin/extract_metadata
6. Update all the files related to JDK 1.7.0_51 that are listed in the List of CIS installation files table.
For example, find 1.7.0_51 in file C:\Documentum\CIS\service\startCIS.bat and replace it with
1.7.0_XX.
7. On Windows, change the Path variable:
a. Right-click Computer option in the Start menu, and select Properties.
b. Click Advanced system settings.
c. In the System Properties window, click Environment Variables.
d. In the Environment Variables window, select the Path variable in the System
variables section.
e. Delete the path for the old JDK version and add the path for the new JDK version, which is
1.7.0_XX.
8. Start CIS.
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
21
Chapter 5
Validating the Installation
This chapter contains the following topics:
•
Verifying the deployment of CIS artifacts (DAR file)
•
Verifying that the repository is enabled for CIS
•
Verifying that the tables are created
•
Verifying the configuration of the entity detection server
•
Verifying that all services are started
Verifying the deployment of CIS artifacts (DAR
file)
After you deploy the CIS DAR file (cis_artifacts.dar), check that the modules are created in
Documentum Administrator.
1. Log in to Documentum Administrator.
2. Navigate to Cabinets > System > Modules > Aspect and check that the module
cis_annotation_aspect is present.
3. Verify that the tables dm_annotation and dm_object_annotations have been created, as described
in Verifying that the tables are created, page 24.
Verifying that the repository is enabled for CIS
When you enable a repository for CIS, a number of sections and folders are created. You can check
their existence to make sure that the repository is enabled successfully.
To check the existence of CIS sections and folders in the repository:
1. Log in to Documentum Administrator.
2. Navigate to the Content Intelligence node and verify that the following sections are present:
• Taxonomies
• Category Class
• Document Set
• My Categories
3. Navigate to Cabinets > System > Applications > CI and verify that the following folders are
present:
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
23
Validating the Installation
•
•
•
•
•
•
•
•
•
AttributeProcessing
Classes
Configuration
DocsetConfiguration
DocumentSets
MetadataExtrationRules
Runs
TaxonomySnapshots
XMLTaxonomies
Verifying that the tables are created
The following tables are created in the repository for CIS:
• When you enable the repository, it creates the table dm_docstatus.
• When you deploy the CIS DAR file (cis_artifacts.dar), it creates the tables dm_annotation and
dm_object_annotations.
To check the existence of the tables:
1. Log in to Documentum Administrator.
2. Select Tools > DQL Editor.
3. Run the query to check the existence of the dm_docstatus table:
Select * from dm_docstatus
The result structure must be:
st_object_id
st_docset_id
st_mode
st_last_modified
st_date
4. Run the query to check the existence of the dm_annotation table:
Select * from dm_annotation
The result structure must be:
ann_id
ann_type
ann_value
5. Run the query to check the existence of the dm_object_annotations table:
Select * from dm_object_annotations
The result structure must be:
ann_object_id
ann_index
ann_chronicle_id
ann_confidence
ann_id
Verifying the configuration of the entity
detection server
CIS server needs to communicate with the entity detection server to start the detection process, and
retrieve the entities.
24
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
Validating the Installation
To verify the configuration of the entity detection server:
1. On CIS host, open the configuration file <CIS installation directory>/config/cis.properties.
2. Check that the property cis.entity.luxid.annotation_server.host indicates the IP address of the
entity detection server.
Verifying that all services are started
You can verify that all services for the entity detection server have started.
On Windows hosts, the CIS services are installed in the automatic startup mode. You can make sure that
all services are started correctly, and, if not, start them manually or reboot to start them automatically.
To verify the status of the services (Windows hosts):
1. Select My Computer > Manage > Services and Applications > Services.
2. Make sure the service Documentum Content Intelligence Services is started. If not, start it.
3. For the entity detection analysis, make sure that the following services are started:
• Documentum CIS Luxid Admin Server
• Documentum CIS Luxid Xelda MI Server
• Documentum CIS Luxid IDE Server
• Documentum CIS Luxid Annotation Server
• Documentum CIS Luxid Annotation Node
• Documentum CIS Luxid Tomcat Server
• Documentum CIS Luxid Starter (optional)
If you want to start them manually, start the Documentum CIS Luxid Starter service first. This
service starts the other services in the correct order.
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
25
Chapter 6
Troubleshooting Installation Issues
This chapter contains the following topics:
•
Modifying the ports for the entity detection server
•
Some Luxid services (4/7) are not started
Modifying the ports for the entity detection
server
The entity detection server requires several ports. If some of them are used when CIS is installed, you
can proceed with the installation and free them afterwards. The following procedure describes how to
modify the default RMI port: 1099 after the installation process.
To modify the default RMI port (1099):
1. Stop the entity detection server. Go to Services and Applications > Services, right-click
Documentum CIS Luxid Starter and select Stop. If the service is not started, select Start and
then stop the service.
On Linux, run the LuxidStarterCmd.sh stop command to stop the service.
2. Navigate to <CIS installation directory>/cis/Temis/Luxid/.
3. Modify the properties files as follows:
File
/adminserver/admin.properties
/IDE/IDEServer.properties
Properties
com.temis.server.rmi.port =
<RMI port>
com.temis.server.rmi.port =
<RMI port>
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
27
Troubleshooting Installation Issues
File
Properties
/node/AnnotationNode.properties
com.temis.server.rmi.port =
<RMI port>
com.temis.admin.server.host =
localhost:<RMI port>
com.temis.lan.ideHost = localhost:
<RMI port>
com.temis.lan.idkHost = localhost:
<RMI port>
com.temis.lan.idcHost = localhost:
<RMI port>
com.temis.lan.tmsHost = localhost:
<RMI port>
/server/AnnotationServer.properties
com.temis.server.rmi.port =
<RMI port>
com.temis.admin.server.host =
localhost:<RMI port>
com.temis.las.ideHost =
localhost:<RMI port>
<RMI port> is the new RMI port number that replaces 1099.
4. Restart the entity detection server. Go to Services and Applications > Services. Right-click
Documentum CIS Luxid Starter and select Start.
On Linux, run the LuxidStarterCmd.sh start command to start the service.
Some Luxid services (4/7) are not started
Problem
Only 4 out of 7 Luxid services are started.
The following error is logged in <installation path for the entity extraction server>\IDE\log\ide.log:
ERROR [Timer-5] server.remote.RemoteBinder () - Some remote objects are no more bound to the
ERROR [Timer-5] server.remote.RemoteBinder () - Unable to export a registry on the localhost
2009-10-02 08:43:18,118
registry on 127.0.0.1:1099.
2009-10-02 08:43:18,134
that accepts requests on port 1099.
Cause
Luxid runs a Java RMI registry on port 1099. If another program stops the port, the registry is stopped,
and Luxid fails.
Resolution
Modify the RMI port as described in Modifying the ports for the entity detection server, page 27.
28
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
Chapter 7
Uninstalling Content Intelligence
Services
This chapter contains the following topics:
•
Uninstalling (Windows hosts)
•
Uninstalling (Linux hosts)
•
Downgrading Content Intelligence Services
This chapter contains the procedures for uninstalling Content Intelligence Services.
Uninstalling (Windows hosts)
Before uninstalling CIS, stop the CIS server and the entity detection server as described in the EMC
Documentum Content Intelligence Services Administration Guide.
To uninstall:
1. Select Start > Settings > Control Panel > Add/Remove Programs.
The Add/Remove window appears.
2. In the Change or Remove Programs tab, select Documentum Content Intelligence Services in
the list of software.
3. Click Change/Remove.
The entity detection server component ’Luxid’ is visible as a separate product in the Add/Remove
window. Uninstalling CIS automatically uninstalls this component. Do not try to uninstall it separately
from CIS.
Uninstalling (Linux hosts)
Before uninstalling CIS, ensure that you run the $DOCUMENTUM/CIS/service/cis_service_register
delete command to delete the CIS service.
To uninstall:
1. Navigate to the $DOCUMENTUM/_uninst/cis directory.
2. Run CIS uninstaller: uninstall.bin.
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
29
Troubleshooting Installation Issues
Downgrading Content Intelligence Services
If you plan to install a lower version of CIS after installing and uninstalling CIS version 7.0 or lower,
you must also uninstall other components that were installed with CIS. If you do not uninstall these
components, they are not updated. The following procedure describes which components to uninstall
and the required order.
To uninstall embedded components (Windows hosts):
1. Select Start > Settings > Control Panel > Add/Remove Programs.
The Add/Remove window appears.
2. Uninstall the embedded components as follows:
a. Select Documentum Service Wrapper and click Change/Remove.
b. Select Documentum DFC Runtime Environment and click Change/Remove.
For CIS version 7.1, all components are uninstalled automatically when you uninstall CIS.
30
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
Index
A
S
artifacts
deploy manually, 18
verify deployment, 23
authentication, 17
script
import_docstatus.bat, 10
T
C
components, 5
connection
with the repository, 17
Content Server compatibility, 6
D
DAR file
deploy manually, 18
verify deployment, 23
disk space, 7
dm_annotation, See tables
dm_docstatus, See tables
dm_object_annotations, See tables
docstatus, 10
Documentum Administrator compatibility, 6
tables
verify, 24
U
uninstallation
embedded components, 30
entity detection components, 29
W
Windows requirements, 10
Windows services
verify status, 25
E
entity detection server
modify RMI port, 27
ports, 8
verify configuration, 24
environment variables, 9
G
global registry, 8
J
JMX port, 8, 12
M
migration, 10
P
ports
CIS server, 8, 12
entity detection server, 8, 12
R
repository
connection, 17
enable for CIS, 18, 23
EMC Documentum Content Intelligence Services Version 7.1 Installation Guide
31
Was this manual useful for you? yes no
Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Download PDF

advertising