RelativityOne - Recipes PDF
RelativityOne - Recipes PDF
4/4/2018
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
For the most recent version of this document, visit our documentation website.
Relativity | Recipes - 1
Table of Contents
1 Adding translated text to documents 20
1.1 Recipe overview 20
1.2 Requirements 20
1.3 Directions 20
4.2 Directions 28
5 Coding the first item in a family group with the skip function 36
5.1 Recipe overview 36
5.2 Directions 36
6 Creating a read-only layout to lock produced documents 38
6.1 Recipe overview 38
6.2 Requirements 38
Relativity | Recipes - 2
6.3 Directions 38
6.4 Considerations 42
7 Cluster Visualization QC Workflow 42
7.1 Requirements 42
7.2 Directions 42
8 Creating an application for managing attorney lists 51
11.2 Requirements 56
11.3 Directions 57
11.3.1 Data Load QC 57
11.3.2 Database Stats 58
11.3.3 Document Characteristics 58
11.3.4 Review Prioritization 59
11.3.5 Review Progress/QC 60
Relativity | Recipes - 3
11.3.6 Privilege Review/QC 61
11.3.7 Reviewer Stats 62
11.3.8 Search Term Review 63
11.3.9 Pre-Production QC 64
11.3.10 Post-Production Review 65
11.3.11 Email Threading 66
Language Identification 66
11.3.12 Tips 67
11.3.13 References 67
12 Creating commonly used Pivots 68
12.1 Recipe overview 68
12.2 Directions 68
13 Creating secured saved search folders for multiple groups 68
13.1 Recipe overview 68
13.2 Directions 69
14 Creating static searches using multiple-choice fields 69
14.1 Recipe overview 69
14.2 Requirements 70
14.3 Directions 70
15 Creating training resources tabs 70
Relativity | Recipes - 4
16.3 Directions 73
16.3.1 Information 73
16.3.2 Fields 73
16.3.3 Conditions 73
16.3.4 Sort 73
17 Displaying production fields in a layout 74
19.4 References 80
20 Estimating diskspace utilization for databases and Analytics indexes 80
20.1 Requirements 80
20.2 Directions 80
20.2.1 Workspace sizes 81
20.2.2 Analytics index sizes 81
21 Exporting PDFs using document identifiers as file names 81
Relativity | Recipes - 5
21.1 Recipe overview 81
21.2 Requirements 81
21.3 Directions 82
22 Exporting to HTML for expert witness review 82
22.1 Recipe overview 82
22.2 Requirements 82
22.3 Directions 82
22.3.1 Results 83
22.3.2 Offline coding form 84
23 Filtering to find empty fields 84
23.1 Recipe overview 84
23.2 Directions 84
24 Finding duplicates or near duplicates in a new set that match up to a previous set 85
24.1 Requirements 86
24.2 Directions 86
24.3 References 86
25 Finding emails exchanged within the same domain 86
25.1 Recipe overview 87
25.2 Requirements 87
25.3 Directions 87
Relativity | Recipes - 6
26.3.4 Calculating Recall 96
26.3.5 Additional considerations 97
27 Identifying emails between two or more specific individuals 98
27.1 Requirements 98
27.2 Directions 98
28 Identifying standalone emails 103
Relativity | Recipes - 7
33.2 Requirements 121
33.3 Directions 121
33.3.1 Option 1 121
33.3.2 Option 2 122
34 Mapping Processing Fields 122
34.1 Requirements 122
Relativity | Recipes - 8
37.3.4 Views Setup 132
37.3.5 Layout Setup 132
37.3.6 Search indexes 133
37.3.7 Batches 134
38 Native Imaging - Dithering Algorithm Options 134
38.1 Recipe overview 134
Relativity | Recipes - 9
42.1.2 Components 156
42.1.3 Considerations 157
42.2 Deploying and configuring the solution 157
42.3 Preparing the workspace 157
42.4 Running the solution 157
42.4.1 All Custodians script 158
Relativity | Recipes - 10
45.4.2 Email headers and repeated content 169
45.4.3 Images 169
45.4.4 Numbers 170
45.4.5 Families and the "Four Corners" test 172
45.5 Handling good language / bad example exceptions 173
45.6 Sample Based Learning 174
Relativity | Recipes - 11
50.2 Directions 187
50.2.1 Create saved searches 187
50.2.2 Create or configure an Analytics profile 188
50.2.3 Create a Structured Analytics set 189
50.2.4 Email threading results - reports 191
50.2.5 Setting up an Email Threading Saved Search 192
Relativity | Recipes - 12
53.3 Directions 210
53.3.1 Create a Quality Control layout 210
53.3.2 Batch reviewed documents for QC review 212
53.3.3 Identify total documents put through QC and resulting overturns 213
53.4 Focus QC on Substantive Issues or Privilege 215
53.4.1 Conduct targeted substantive QC 215
Relativity | Recipes - 13
57.1 Recipe overview 224
57.2 Directions 224
58 Searching for handwritten documents 225
58.1 Overview 225
58.2 Requirements 225
58.3 Directions 225
Relativity | Recipes - 14
63 Setting up in an upgraded 7.5 environment 236
63.1 Recipe overview 236
63.2 Requirements 236
63.3 Directions 236
64 Setting up CJK document workspaces in Relativity 237
64.1 Recipe overview 237
Relativity | Recipes - 15
69 Transfer tags from other document review applications 247
69.1 Recipe overview 247
69.2 Directions 247
70 Threading review setup 247
70.1 Requirements 247
70.2 Directions 247
Relativity | Recipes - 16
73.4.1 Considerations for creating Analytics indexes containing non-English languages 257
73.4.2 Considerations for using Structured Analytics with non-English languages 259
73.4.3 Checklist for using Analytics with non-English language and multiple language
workspaces 261
74 Using categorization to find privileged documents 262
74.1 Requirements 262
74.2 Directions 262
Relativity | Recipes - 17
81.1 Requirements 273
81.2 Overview 273
81.3 Directions 273
81.4 Workflow considerations 274
82 Using redactions from other applications 275
82.1 Recipe overview 275
Relativity | Recipes - 18
88.1 Requirements 289
88.2 Directions 289
88.2.1 Publishing or loading additional documents 290
89 Verifying imaged natives 290
89.1 Recipe overview 290
89.2 Requirements 291
Relativity | Recipes - 19
1 Adding translated text to documents
If your documents require translation, this recipe describes one way of including both the original
text and the translated text in your Relativity workspace after you receive the translated documents
from the translation service.
1.2 Requirements
n Relativity security access with the ability to edit/add fields
n Relativity Desktop Client
1.3 Directions
1. Create a field to hold the translated text using the following settings:
n Field type - Long Text
n Available in Viewer - Yes
n Included in Text index - Yes (not required, but helpful when searching with the keyword index)
2. After the documents have been translated and are ready for import, use the Relativity Desktop Client
to overlay the information into the newly created field. The translated text must exist in the load file
and not as separate text files.
3. Index the translated text. The newly added text may not be immediately searchable. The following
three options exist for indexing the text:
n Build a new dtSearch index with only the new translated text.
n Add this field to an existing dtSearch index, and then rebuild.
n Set the Include in Text Index attribute to Yes on the translated text field to add the content to
the keyword index.
4. Configure your workspace to display both the translated and original text:
a. Launch the standalone viewer.
b. Change the Extracted text drop-down to the translated text field you created in step 1.
c. Click the Sync button at the top of the standalone viewer to continuously update this window as
you navigate through documents.
5. If there is an agreement to share the translated text, export the content as you would any other field.
Relativity | Recipes - 20
2 Adding documents to an existing Assisted Review
project
2.1 Recipe Overview
Relativity allows you to add new documents to an Assisted Review project that is already in
progress. This recipe discusses the suggested steps to add documents to a live project.
2.2 Requirements
n Relativity Assisted Review
n Analytics Core
n Analytics
2.3 Directions
To apply a categorized value to new documents, you must complete a round. It is easier to get
immediate results if you have a round in process when beginning these steps. When you finish the
round, the new documents get categorized from the examples already in the system. Generally it
will be a QC round in progress, but any round type except Control Set will work. If you are not in
the middle of a round, begin one and code at least one document so a round can be finished.
1. Load the new documents into the workspace.
2. Ensure the new documents are included in your saved searches for the Analytics index.
3. Navigate to the Indexing & Analytics tab, select Search Indexes and then the Analytics
index.
4. From the Analytics Index console, rebuild the Analytics index to include the new documents.
A full rebuild of the Analytics index may or may not be needed. Consider the following:
n When adding new concepts or types of documents, a full rebuild is recommended.
n If the new documents are more of the existing types of documents and the system does
not need to learn anything new, an incremental population will probably suffice.
5. In the Assisted Review console, click the View Project Settings link. Ensure the saved search
used for Documents to be categorized includes the new documents.
6. Once manual review is completed for the current round and the index is finished rebuilding,
click Finish Round. The new documents will categorize alongside the old ones.
7. Return to the Project Home and verify that the total in the Docs in project column reflects
the additional documents.
8. Continue the Assisted Review project as normal.
Note: Any previously established control set in the project will no longer be in sync with the
document set. To correct this, make a new control set round to replace the previous one after
the existing round is completed.
Relativity | Recipes - 21
3 Advanced workflows with Relativity Legal Hold
When working in Relativity Legal Hold, there may be a need to customize it to fit your unique
workflows. In particular, we discuss tracking custodian information (like employee location),
tracking custodian response information in real time, and creating custom dashboards to visualize
and provide snapshots of your Hold data.
3.2 Requirements
n Relativity 9.4 or higher
n Relativity Legal Hold v4.1.437.2+
3.3 Directions
Below are some workflow suggestions one can employ to customize Relativity Legal hold to cater to
their unique needs and / or for advanced reporting functionality.
Relativity | Recipes - 22
On a similar note, custom fields are used on the Custodian object to track information like
Custodians Office location or company assigned devices and their information. Below is an
illustration of what those fields would look like on a layout.
Relativity | Recipes - 23
When creating the custom fields, layouts, or views, keep in mind the following considerations:
n You should not associate custom fields, layouts, or views with the Relativity Legal Hold application.
n You can enable the Open to Association setting to Yes.
n You can enable the Allow Group By and Allow Pivot settings to Yes.
Relativity | Recipes - 24
3.3.2 Custodian related information
The Custodian Status Tab can be unhidden and the view on the object made available. It’s useful to
report information about the Custodian’s interactions, such as:
n Number of reminders sent
n Numbers of escalations
n Date / Time the reminder was sent
n Date / Time the last escalation was sent
n Date / Time of the last scheduled escalation
n When questionnaire was completed and submitted
n When the Hold (Communication) Notice was sent
n When the Communication was viewed
n If the Hold notice was acknowledged
To create the Custodian Status Tab, fill out the settings as follows:
n Name - Custodian Status
n Link Type - Object
n Parent - Legal Hold
n Object Type - Custodian Status
To create a new View on the Custodian Status Object, fill out the setting as follows:
Relativity | Recipes - 25
n Name - Custom View – Custodian Status
n Object Type - Custodian Status
n Fields - Select the fields of your choice from the following recommendations:
o Project
o Escalation Sent
o Reminders Sent
o Acknowledgment Status
o Questionnaire Status
To create a new View on the Custodian Role Object, fill out the settings as follows:
n Name - Custom Role – Custodian Status
n Object Type - Custodian Role
n Fields - Select the fields of your choice from the following recommendations:
o Role
o Project
o Release Date
o Custodian:: Email
3.3.3 Reporting
Dashboards provide a way to make information available in Relativity realtime, providing self-
service with very little training. Layering the Dashboards with custom views make it a powerful
reporting aid. Below are a few examples of custom Dashboards that address reporting needs.
Relativity | Recipes - 26
3.3.3.2 Custodian Object - Custodian Information Dashboard:
Name Group By Pivot On Type
Employee Status (Custom field) by Department <Grand Bar Chart (OR) Pie
Department Total> Chart
Employment Start Date Employment Start <Grand Line Chart
Date Total>
Custodian Office Office <Grand Bar Chart (OR) Pie
Total> Chart
3.4 References
Relativity Legal Hold
Relativity | Recipes - 27
4.1 Requirements
n An active Analytics index
n An existing cluster
4.2 Directions
This recipe involves a scenario where the case team has performed a first-level review for
responsiveness. Cluster visualization can help QC the responsiveness coding. The approach
described here could also be used to QC privilege coding or issue coding.
1. Ensure that the reviewed and coded documents are included in an existing cluster set.
2. From the Documents tab, select the cluster set in the cluster browser and click Visualize Cluster.
Relativity | Recipes - 28
3. In the search panel, click Add Condition and select the responsiveness coding field. Create a condition
that will return documents coded as responsive. Click Apply, then click Run Search.
Relativity | Recipes - 29
4. The system displays a heat map overlay on the cluster visualization that indicates the percentage of
documents coded as responsive in each cluster. The darker the shading, the higher the concentration
of responsive documents in the cluster.
5. Throughout this scenario, you can focus in on the documents in a cluster by right clicking the cluster
and clicking Select, then Apply. You can select multiple clusters by doing this for each cluster you want
to focus on.
Relativity | Recipes - 30
6. The main cluster visualization view (the circle pack view) provides the following visual clues to aid in
your QC review:
a. Darker clusters – The darker shading indicates that a cluster contains a high percentage of doc-
uments coded as responsive, meaning that the documents in the cluster have been coded con-
sistently. Since documents within a cluster are conceptually similar, we would generally expect
them to be treated similarly, and this is what the darker shading tells us.
Relativity | Recipes - 31
b. Lighter clusters – The lighter shading indicates that only a small percentage of documents in the
cluster have been coded as responsive. The documents in these lighter shaded clusters would
warrant further investigation to determine why only a small portion of conceptually similar doc-
uments were coded as responsive.
Relativity | Recipes - 32
7. Continue your QC review by right clicking a darker shaded cluster (one with a high percentage of
responsive documents) and selecting View Nearby Clusters from the right click menu.
Relativity | Recipes - 33
8. The Nearby Clusters visualization places the selected cluster in the middle of the screen and shows you
other clusters that are conceptually similar to it. The closer a cluster is to the center cluster, the more
conceptually similar it is.
Relativity | Recipes - 34
Along with the circle pack, the clusters can be visualized on the dial by right clicking and selecting View
Dial.
9. The Nearby Clusters visualization reveals the following additional insights to help guide your QC review:
a. Darker clusters – We would expect clusters that are nearby, or conceptually similar to, the center
cluster to also have a darker shading, indicating that they too contain a high percentage of doc-
uments coded as responsive.
b. Lighter clusters - Clusters with a lighter shading would probably warrant additional investigation.
The system is telling us that these clusters are conceptually similar to the dark cluster in the
middle (which contains a high percentage of responsive documents). However, only a small por-
tion of the documents in these lighter clusters have been coded as responsive; we might want to
investigate why this is. A cluster that is completely white contains no responsive documents, yet
is conceptually similar our center cluster, and should also be investigated.
Relativity | Recipes - 35
Note that with the Responsiveness filter set and active, only documents coded as responsive
are returned. If you would like to access all the documents in a selected cluster, simply clear
the box on the filter card, click Run Search, and all documents within a cluster will be
returned.
5.2 Directions
1. Create a new saved search and name it Responsiveness is Responsive and Family.
2. Choose the following search information and conditions:
n Set Includes to Include Family.
n Set Scope to Entire Workspace.
Relativity | Recipes - 36
n Set Field to Responsiveness, Operator to any of these, and Value to Responsive.
Relativity | Recipes - 37
5. Use the Skip function with this view. After the first item in a family group is coded, even without
propagation set, the rest of the family group is removed from the review set without being coded. This
way you can manually code each document without the need for a first level review.
6.2 Requirements
n Relativity security access
o Fields: Add
o Object rules: Add
o Ability to build layouts
6.3 Directions
Perform the following steps to create a read-only layout:
1. From the Layouts tab, create a new layout with the following settings to display the documents that
have been coded:
a. Object type: Document
b. Name: Previously Produced
c. Order: 10
d. Enable Copy From Previous: No
Alternatively, you can copy the original coding form and change only the fields that should not
be altered after production to read-only.
Relativity | Recipes - 38
2. From the Fields tab, create a new field with the following settings to indicate which documents need to
be locked down:
a. Object type: Document
b. Name: Produced and locked
c. Field type: Single choice
d. Available in field tree: Yes
Relativity | Recipes - 39
3. From the Choices tab, add a single choice field with the following settings:
a. Field: Produced and Locked (Document)
b. Name: Yes
Relativity | Recipes - 40
4. From the Administration tab, select Object type and select the Document object.
5. In the Rules section, click New and create a new rule with the following settings:
a. Rule type: Default layout
b. Name: Production lock documents
c. Field: Produced and locked
d. Value: Yes
e. Action: Previously produced
f. Leave User can select another layout unchecked.
Relativity | Recipes - 41
6. Once you've completed steps 1-5, a system admin must switch the produced documents choice in the
Produced and Locked field to Yes.
7. Examine the records. In the layout drop-down, there should only be the read-only version.
6.4 Considerations
n Make sure your reviewers don't have rights to edit layouts, as they can then switch read-only to No.
n This doesn't work on Mass Edit operations, and you must limit that right for users.
7.1 Requirements
n An active Analytics index
n An existing cluster
7.2 Directions
This recipe involves a scenario where the case team has performed a first-level review for
responsiveness. Cluster visualization can help QC the responsiveness coding. The approach
described here could also be used to QC privilege coding or issue coding.
Relativity | Recipes - 42
1. Ensure that the reviewed and coded documents are included in an existing cluster set.
2. From the Documents tab, select the cluster set in the cluster browser and click Visualize Cluster.
Relativity | Recipes - 43
3. In the search panel, click Add Condition and select the responsiveness coding field. Create a condition
that will return documents coded as responsive. Click Apply, then click Run Search.
Relativity | Recipes - 44
4. The system displays a heat map overlay on the cluster visualization that indicates the percentage of
documents coded as responsive in each cluster. The darker the shading, the higher the concentration
of responsive documents in the cluster.
5. Throughout this scenario, you can focus in on the documents in a cluster by right clicking the cluster
and clicking Select, then Apply. You can select multiple clusters by doing this for each cluster you want
to focus on.
Relativity | Recipes - 45
6. The main cluster visualization view (the circle pack view) provides the following visual clues to aid in
your QC review:
a. Darker clusters – The darker shading indicates that a cluster contains a high percentage of doc-
uments coded as responsive, meaning that the documents in the cluster have been coded con-
sistently. Since documents within a cluster are conceptually similar, we would generally expect
them to be treated similarly, and this is what the darker shading tells us.
Relativity | Recipes - 46
b. Lighter clusters – The lighter shading indicates that only a small percentage of documents in the
cluster have been coded as responsive. The documents in these lighter shaded clusters would
warrant further investigation to determine why only a small portion of conceptually similar doc-
uments were coded as responsive.
Relativity | Recipes - 47
7. Continue your QC review by right clicking a darker shaded cluster (one with a high percentage of
responsive documents) and selecting View Nearby Clusters from the right click menu.
Relativity | Recipes - 48
8. The Nearby Clusters visualization places the selected cluster in the middle of the screen and shows you
other clusters that are conceptually similar to it. The closer a cluster is to the center cluster, the more
conceptually similar it is.
Relativity | Recipes - 49
Along with the circle pack, the clusters can be visualized on the dial by right clicking and selecting View
Dial.
9. The Nearby Clusters visualization reveals the following additional insights to help guide your QC review:
a. Darker clusters – We would expect clusters that are nearby, or conceptually similar to, the center
cluster to also have a darker shading, indicating that they too contain a high percentage of doc-
uments coded as responsive.
b. Lighter clusters - Clusters with a lighter shading would probably warrant additional investigation.
The system is telling us that these clusters are conceptually similar to the dark cluster in the
middle (which contains a high percentage of responsive documents). However, only a small por-
tion of the documents in these lighter clusters have been coded as responsive; we might want to
investigate why this is. A cluster that is completely white contains no responsive documents, yet
is conceptually similar our center cluster, and should also be investigated.
Relativity | Recipes - 50
Note that with the Responsiveness filter set and active, only documents coded as responsive
are returned. If you would like to access all the documents in a selected cluster, simply clear
the box on the filter card, click Run Search, and all documents within a cluster will be
returned.
8.2 Requirements
n Relativity Desktop Client (RDC)
n Microsoft Excel attorney list (this can be saved as a CSV file)
8.3 Directions
1. Create a new object in your workspace called Attorney. You may wish to place this new tab under an
Attorney List parent tab.
2. A Name field will automatically be created on the Attorney object. Change the Name field to Last
Name, First Name.
3. Click New Field to create the following single-choice fields on the Attorney object:
a. Alphabet
Relativity | Recipes - 51
4. Edit the Attorney tab view and layout to include the new fields.
5. In Excel, create or modify the attorney list you wish to import.
a. Name the columns in the spreadsheet the same as the fields created above.
b. The column that corresponds with the Alphabet field should be populated with the first letter of
each attorney’s last name. This will facilitate sorting and quick access to the list for your review-
ers.
Relativity | Recipes - 52
9.2 Requirements
n Relativity Analytics 6.10-7.5
n Existing decimal-type field
9.3 Directions
1. Run “Set Extracted Text Size” Script - The Set Extracted Text Size script is included with every instance
of Relativity . If it’s not already part of your workspace, select the script from the library and run it.
Select the field to hold the data, and click run. This will apply a number to your decimal field to hold the
text size in KB.
2. Saved Search Setup - Relativity Analytics indexes use saved searches to determine the document set
for the index. There are two default searches in the index setup. These searches will remove documents
without text, as well as documents with more than 2 MB of extracted text, for the training set. To optim-
ize the index, you will usually need to remove additional documents. The following types of documents
do not typically return desired results, and increase the size and build time of the index:
n Documents with OCR errors, such as “OCR not processed”
n Documents with poor-quality OCR, such as OCR on handwritten documents
n Compressed files, such as ZIP or RAR files
n System files
n Image files
n CAD drawings
n Maps
n Calendar items
n Excel files with minimal text (numbers and symbols are not conceptually searchable)
We only remove documents with more than 2 MB of text from the training set search. These
documents can have large amounts of text and many concepts, which make them difficult for the
system to index. For documents that have errors such as “OCR not processed,” we need to create our
own searches.
To optimize the index, we will create two saved searches—the training set and searchable set—instead
of using the defaults. The searchable set includes all documents that need to be searched, minus the
items listed above. The training set is a copy of the searchable set, but will limit the size of extracted text
to items less than 2 MB.
a. If you find other document groups that do not have helpful text, criteria can be added to these
searches later.
b. The only field to be returned in the saved searches will be the Extracted Text field.
c. Concept searching works by finding repeated word content across documents. This means that
metadata that repeats across documents, such as author and recipient, will bring emails
together conceptually because of that metadata, and not because of the actual content in the
body of the documents. For that reason, you should be mindful that email headers are con-
tained within the Extracted Text field—which brings us to the next section.
3. Filtering - Filters remove extraneous data that should not be conceptually indexed or searched. For
example, you can use filters to ensure that repeating header or footer information is not included in the
index to avoid the interference mentioned above. Note that filters don’t permanently remove data.
Rather, they inhibit data from being included in the indexes. There are five filter types:
Relativity | Recipes - 53
n Email – Removes email header information throughout email-formatted text
n OCR – Uses groupings of three characters to determine if the text to be indexed contains valid
words; this cannot be edited
n Go Words – Removes bad OCR by comparing all words against a lengthy dictionary; this dic-
tionary file can be edited
n Repeated Content – Removes blocks of text, such as email footer information
n Java Regular Expressions – Identifies blocks of formatted text that can be removed from an index
10.1 Requirements
n Relativity Analytics
n Existing decimal-type field
10.2 Directions
1. The key search is based on text size. Your processing tool might provide the text size. If not, run the Set
Extracted Text Size script. This updates a decimal field with the text size in KB.
2. Create a saved search of the data and (name it Analytics Data Search or something similar).
n Bring back all text less than 30 MB which translates to “Extracted Text Size is less than 30,000 and
Extracted Text Size is greater than 0. (This will return documents that have had this script run
and nothing too large that will encumber the system.)
n Ensure that the Extracted Text field is the only one returned.
Relativity | Recipes - 54
3. Run Structured Analytics > Repeated Content Identification on your Analytics Data Search.
n Set the Minimum number of occurrences to 0.5% of the total population. Leave the rest of the
settings to the default.
n Scan the results of repeated content identification to ensure that the repeated text blocks cor-
respond to unwanted (non-authored) content.
4. Create a new Analytics profile:
n Set the Email header filter to Yes.
n Set OCR and Go Words to No.
n If you intend to run email threading, you can match up fields for Structured Analytics email
threading.
n Save the Analytics profile and link any repeated content filters which comprise non-authored con-
tent.
5. Create your Analytics index
n Use the Analytics Data Search created above for both training and searchable (unless it is for a
very large workspace).
n Select the Analytics profile created above.
n Set Optimize Training Set and Auto Remove Signatures to Yes.
6. Build your index.
1. The key search is based on text size. Your processing tool might provide the text size. If not, run the Set
Extracted Text Size script. This updates a decimal field with the text size in KB.
2. Create a saved search of the data and (name it Analytics Data Search or something similar).
n Bring back all text less than 30 MB which translates to “Extracted Text Size is less than 30,000 and
Extracted Text Size is greater than 0. (This will return documents that have had this script run
and nothing too large that will encumber the system.)
n Ensure that the Extracted Text field is the only one returned.
3. Run Structured Analytics > Repeated Content Identification on your Analytics Data Search.
n Set the Minimum number of occurrences to 0.5% of the total population. Leave the rest of the
settings to the default.
n Scan the results of repeated content identification to ensure that the repeated text blocks cor-
respond to unwanted (non-authored) content.
4. Create your Analytics index
n Use the Analytics Data Search created above for both training and searchable (unless it is for a
very large workspace).
n In the Advanced Settings, ensure that you set Optimize Training Set, Remove English Sig-
natures and Footers, and Email Header Filter to Yes.
n Link the appropriate Repeated Content filter results from Step 3 above.
Note: If you have a large amount of Repeated Content results, typically the results which are most
occuring and with the most words will suffice.
Relativity | Recipes - 55
n Email Header Filter - removes email headers throughout email-formatted text.
n Repeated Content - removes blocks of text, such as email footer information. Instances of repeated
content may be detected and converted into filters automatically by a structured analytics set.
n Optimize Training Set - this option automatically excludes poor quality index training documents
based on an analysis of their text. Commonly excluded content includes extremely large documents
and files predominated by tables of numbers, long garbage strings of characters, and a preponderance
of symbols (rather than words).
n Auto-removal of email signatures and footers - this option automatically excludes email signatures
and footers when it finds them (although it is intentionally conservative to avoid over-exclusion. This
feature works for English signatures and footers only.
n Remove English Signatures and Footers - this option excludes email signatures and footers when it
finds them (although it is intentionally conservative to avoid over-exclusion). This feature works for Eng-
lish signatures and footers only.
11.2 Requirements
Relativity 9.3 or higher
Relativity | Recipes - 56
11.3 Directions
Use the following dashboards as examples to generate ideas for dashboards you may want to create
in your cases:
n Data Load QC below
n Database Stats on the next page
n Document Characteristics on the next page
n Review Prioritization on page 59
n Review Progress/QC on page 60
n Privilege Review/QC on page 61
n Reviewer Stats on page 62
n Search Term Review on page 63
n Pre-Production QC on page 64
n Post-Production Review on page 65
n Email Threading on page 66
n Language Identification on page 66
n Tips on page 67
Relativity | Recipes - 57
11.3.2 Database Stats
To create the Database Stats dashboard, group by and pivot on the following fields:
n Group by Native Type
n Group by Extension
n Group by Sort Date
n Group by Key Term STR
Note: You can collapse your widgets (Count of File Type highlighted below). If you save your dashboard
with a widget collapsed, it will remain collapsed.
Relativity | Recipes - 58
11.3.4 Review Prioritization
To create the Review Prioritization dashboard, group by and pivot on the following fields:
n Cluster Visualization Widget
n Group by Key Term STR
n Group by Custodian
Relativity | Recipes - 59
11.3.5 Review Progress/QC
To create the Review Progress/QC dashboard, group by and pivot on the following fields:
n Group by Responsive Designation
n Group by Privilege
n Group by Issues
n Group by Key Term STR
n Group by Custodian
n Group by File Extension
Relativity | Recipes - 60
11.3.6 Privilege Review/QC
To create the Privilege Review/QC dashboard, group by and pivot on the following fields:
n Group by Privilege
n Group by Privilege Designation
n Group by Privilege Terms STR
n Group by Custodian
Relativity | Recipes - 61
11.3.7 Reviewer Stats
To create the Reviewer Stats dashboard, group by and pivot on the following fields:
n Group by Reviewed By
n Group by Reviewed By and Pivot on Reviewed On
n Group by File Extension
Note: This was created using the Track Document Field Edits by Reviewer application, found in the
Relativity Community. It was set up to track the first edit on the Responsive Designation field. If you aren't
setting up this application in your template, you'll need to create this dashboard in your workspace, once
you’ve installed the application and created the fields.
Relativity | Recipes - 62
11.3.8 Search Term Review
To create the Search Term Review dashboard, group by and pivot on the following fields:
n Group by Responsive Designation
n Group by Key Term STR
n Group by Issue Coding STR
n Group by Issues
n Group by Privilege
n Group by Privilege Terms STR
Relativity | Recipes - 63
11.3.9 Pre-Production QC
To create the Pre-Production QC dashboard, group by and pivot on the following fields:
n Group by Custodian
n Group by Responsive Designation
n Group by Privilege
n Group by Key Term STR
n Group by Has Images
n Group by Issues
Relativity | Recipes - 64
11.3.10 Post-Production Review
To create the Post-Production Review dashboard, group by and pivot on the following fields:
n Group by Custodian and pivot on Production::Production Name
n Group by Production Type
n Group by Production::Production Name
Note: Currently the production fields are not enabled for group by/pivot on and the application is locked.
In future releases we plan to have the fields enabled for grouping and pivot in the production application.
Relativity | Recipes - 65
11.3.11 Email Threading
To create the Email Threading dashboard, group by and pivot on the following fields:
n Group by Inclusive Email
n Group by Inclusive Reason
n Group by Custodian
n Group by Date Sent (year)
Note: We recommend using your Email Threading view with this dashboard.
Language Identification
To create the Language Identification dashboard, group by and pivot on the following fields:
n Group by Primary Language
n Group by Docs_Languages::Language
n Group by Custodian
Note: We recommend using your Language Identification view with this dashboard.
Relativity | Recipes - 66
11.3.12 Tips
n Create a default Document List dashboard (lowest order number) with only the item/document list, no
additional widgets.
n You can only have one Cluster Visualization widget on a dashboard.
n You can create dashboards on other objects.
n Dashboards can be created in your template, so they carry over to your new workspaces. However,
some dashboards, such as those with search term report widgets or cluster visualization, will need to
be created in the individual workspace once you’ve created your clusters and/or STRs.
11.3.13 References
n Dashboards
n Dashboard and Pivot quick reference
n Advice Hub solution:Track document field edits by reviewer
n Dashboards
n Dashboard and Pivot quick reference
n Advice Hub solution:Track document field edits by reviewer
n Dashboards
n Dashboard and Pivot quick reference
n Advice Hub solution:Track document field edits by reviewer
Relativity | Recipes - 67
12 Creating commonly used Pivots
Relativity Pivot is a powerful feature that can help you visualize your data set in a digestible way.
Through interactive charts and graphs, Pivot allows you to gain insight into trends and patterns in
your case. The following suggestions describe how to make the most of Pivot.
12.2 Directions
Set the following fields as suggested to create a number of useful Pivot charts.
Group by Pivot on Description
Custodian Sent date Review sent dates of emails for each custodian to verify a date range is not
missing from the collection.
Custodian File type Verify all documents were collected, including emails and loose files.
Custodian STR-Keywords Examine the count of keywords that occur by custodian.
Sent date Issues Examine the occurrence of issues over a time period.
Batch::Assigned Responsiveness View reviewer progress of responsive items across batches.
to
Cluster STR-Keywords Find out which clusters or document groups contain the most interesting
documents based on keywords.
Author Recipient Dive into internal emails, potentially privileged documents, or find per-
domains domains sonal emails received at work.
Author Issues or STR- Determine the most important domains where issues or keywords
domains Keywords appear.
Relativity | Recipes - 68
13.2 Directions
Set workspace security as follows:
Group Security Setting Comment
1 & 2 Search: View This ensures users in these groups can access the saved search browser, but can't
save searches outside of their assigned folders.
1 & 2 Search Folder:
View
1 & 2 Saved Search
Browser Access
1. Create two folders in the saved search browser, one for each group.
2. Secure the first search folder to remove Group 2, and then set permissions for Group 1 as follows:
Group Security Comment
Setting
1 Search: Edit This allows the user to save searches only in this folder. You can substitute Delete
and Add for the Edit permission if you want to allow the users to delete their own saved
searches or sub-folders.
1 Search
Folder: Edit
and Add
3. Repeat step 2 for the search folder for Group 2.
Note: If you need to make any modifications to either the folders or saved searches within the secured
folders, you must address these changes individually at the folder level and not in workspace permissions.
Relativity | Recipes - 69
14.2 Requirements
n Mass updates
n Relativity security access with the following permissions:
o Fields: edit/add
o Choices: edit/add
o Document: edit
14.3 Directions
1. Create a new multiple choice field with the following settings to hold the items for each search.
n Object Type - Document
n Name - Static Searches
n Field Type- Multiple Choice
n Available In Field Tree- Yes
2. Add a choice to this field, and then name it after the appropriate search.
3. Build, and then run the desired search.
4. Mass update the Static Searches field with the choice you created in step 2.
5. Review the Field Tree and the Static Searches field to view your new search choices and results.
Relativity | Recipes - 70
d. Set Parent to Training Resources.
e. Set Link to https://www.relativity.com/ediscovery-training/.
4. Click New Tab and create the Documentation tab:
a. Name the tab Documentation.
b. Set Order to 20.
c. Set Link Type to External.
d. Set Parent to Training Resources.
e. Set Link to the appropriate URL based on your Relativity version:
n RelativityOne: https://help.relativity.com/RelativityOne
n 9.5: https://help.relativity.com/9.5/
n 9.4: https://help.relativity.com/9.4/
n 9.3: https://help.relativity.com/9.3/
n 9.2: https://help.relativity.com/9.2/
n 9.1 – 9.0: https://help.relativity.com/9.0/
n 8.2: https://help.relativity.com/8.2/
n For earlier versions, locate the appropriate link on the documentation archives page:
https://help.relativity.com/9.3/Content/Resources/Shared/Documentation_archives.htm
5. Click New Tab and create the Recipes tab:
a. Name the tab Recipes.
b. Set Order to 30.
c. Set Link Type to External.
d. Set Parent to Training Resources.
e. Set Link to the appropriate URL based on your Relativity version:
n R1: https://help.relativity.com/RelativityOne/Content/Site_Resources/RecipesHome.htm
n 9.5: https://help.relativity.com/9.5/Content/Site_Resources/RecipesHome.htm
n 9.4: https://help.relativity.com/9.4/Content/Site_Resources/RecipesHome.htm
n 9.3: https://help.relativity.com/9.3/Content/Site_Resources/RecipesHome.htm
n 9.2: https://help.relativity.com/9.2/Content/Site_Resources/Home.htm
n 9.1 – 9.0: https://help.relativity.com/9.0/#../Subsystems/Default/Content/Site_Resources/Re-
cipesHome.htm
n 8.2: https://help.relativity.com/8.2/#../Subsystems/Default/Content/Site_Resources/Re-
cipesHome.htm
n For earlier versions, locate the appropriate link on the documentation archives page:
https://help.relativity.com/9.5/Content/Resources/Shared/Documentation_archives.htm
Relativity | Recipes - 71
15.2.1 Directions
1. Visit the Relativity Community and search for RA_Training*.
2. Download the training application.
3. From the Applications tab in either the workspace or admin mode, install the application to your
desired workspace(s).
4. Once the application installs successfully, a new tab, Training Resources, appears in the work-
space(s) you selected. You will also see the sub-tabs for Tutorials, Videos, and Documentation.
Here, you can navigate to these tabs to access training resources without leaving Relativity.
16.2 Requirements
n Workspace access with permission to the following tabs:
o Administration | Views
Relativity | Recipes - 72
16.3 Directions
Create a view with the following settings:
16.3.1 Information
View information:
n Object Type: Document
n Name: Documents
n Owner: Public
n Order: 10
Other:
n Group Definition: Family Group (Family related items field)
n Relativity Applications: none
n Query Hint: leave blank
n Visible In Dropdown: Yes
16.3.2 Fields
Suggested fields - Include Edit, File Icon, Control Number, Group Identifier, File Name (or Name
or Unified Title), Record Type, and any additional fields you want to place on the view
16.3.3 Conditions
Set conditions - None
16.3.4 Sort
It is necessary to sort based on the related item field. It might be Group Identifier or Family Group.
Then how the families are displayed is optional.
n Control Number
n Family Date - Use the Propagate Sent Date to Family Documents script to set values for this field.
Save the View.
Final View:
Relativity | Recipes - 73
Note: The screen shot is not intended to show as indented (with dots), but to illustrate the view you see
when you use the indented list method to group the families together. Notice in the view that each family is
now divided by bolded lines.
17.2 Requirements
n Relativity 9.3 and higher
n Access to Views and Layouts, to include add and edit permissions
Relativity | Recipes - 74
17.3 Directions
To display your production related fields in a document object layout:
1. Go to the Views tab or your view drop down in the Folder Browser pane to create your new view.
2. Create the view on the production Information object.
3. Select the production fields you would like to display in the layout, such as Begin Bates, End Bates,
Begin Attachment, and End Attachment. In addition, you can select a condition, such as Begin Bates is
set and you can apply a sort order on the view, however a condition and sort order are not required.
4. Now that you have your view created, navigate to the layout in which you want to display the pro-
duction information object list.
5. Click the pencil icon ( ) to edit the layout, or go to the Layouts tab and select Build Layout on the
appropriate layout.
6. Select Add Object List from the Add Category drop down.
7. In the Object drop down, select the Production Information (Production) object. The View will default
to the new view you created on the Production Information object. If it does not, you can select it in
Relativity | Recipes - 75
the View drop down.
Relativity | Recipes - 76
18 Email threading and near dupe - workflow
alternatives
Relativity workspaces often have a need for email threading and document near dupe detection.
System admins need to decide whether both functions should be addressed together or
independently.
18.2 Requirements
n Workspace & system admin rights
18.3 Directions
The following sections contain three workflow alternatives to help system admins who decide to run
a Structured Analytics Set (SAS) involving Email threading and/or Near Dupe detection on a data set.
18.3.1 Running the Email Threading & the Near Dupe in a single SAS
Use this workflow to identify email duplicate spares independent of the duplicate loose documents.
In other words, use this where the case demands that you handle the workflow for emails and loose
files separately. If both options are selected on a single set, only run near duplicate identification
against non-emails (attachments and loose files).
Once you run the SAS, you can create a document view or a batch of documents that contain the
documents flagged inclusive (Inclusive Email = Yes) and marked as non duplicate email (Email
Duplicate Space = No) for email documents.
You can then identify the loose documents as the documents flagged as Text Duplicate Principle
(Text Near Duplicate Principal = Yes).
Relativity | Recipes - 77
18.3.3 Running only Near Dupe in an SAS.
Use this workflow to identify duplicates whether they are emails or loose files. For example:
n Two parties in discovery produce documents. There is an agreement that each is to review documents
that were outside the set provided to the opposite side. In this case, the workflow requires you treat all
documents as loose documents and identify all near duplicate groups of single documents (just a Near
Duplicate Principal and no other documents in the group). You can then isolate all such documents via
a saved search, placed into a singular view or into review batches.
n A case team must identify duplicates across two or more groups of produced documents. Running the
Near Dupe process would be ideal. This allows the team to review only the necessary documents—the
Near Duplicates—while the other documents in the group are the duplicates.
19.2 Requirements
n Relativity 9.4 and above
n Structured Analytics
19.3 Directions
There are two ways to isolate the threads that have missing emails.
To use workflow suggestion number 1, complete the steps below:
1. Add the Email Threading ID field to the Email Threading view.
2. Filter for the minus sign to view the total threads with missing items.
3. Save those documents as a search.
4. Tag them as partial threads.
To use workflow suggestion number 2, create a widget to view the totals for partial versus complete
threads using the steps below:
Relativity | Recipes - 78
1. Create a single choice field called Email Threads and allow pivot on and group by.
2. Create two choices on the new field:
1. Partial Thread
2. Complete Thread
3. Create a search for documents where the Email Thread group is set and return these fields:
1. Email Threads
2. Email Threading ID
4. Run the search.
5. Filter on Email Threading ID for the minus sign.
1. Tag documents returned as Partial Thread.
2. Then filter for Email Threads field is not set.
Relativity | Recipes - 79
2. Visualize the totals:
19.4 References
n Structured Analytics
n Structured Analytics
n Structured Analytics
20.1 Requirements
n Workspace with system admin-level access
20.2 Directions
The rule of thumb measurements and methods below help system admins estimate the size of their
Relativity database and associated Analytics indexes.
Relativity | Recipes - 80
It’s noteworthy that these metrics have a high standard deviation and tend to scale up or down
linearly. Every data set is different, so please note that this is only an average. Files will vary in size
considerably.
21.2 Requirements
n Workspace access with the following permissions:
o Search edit/add
o Corresponding search browser
Relativity | Recipes - 81
n Relativity Desktop Client
21.3 Directions
1. Create a saved search with the documents to export.
2. Confirm all documents in the saved search have images created.
3. Use the Relativity Desktop Client to export the saved search.
a. Select Tools | Export | Saved Search.
b. Select the location for the export.
c. Under the Image section, select the following settings:
n Data File Format - Select either Opticon or IPRO
n File Type - PDF
d. Under the Metadata section, select the following settings:
n Data File Format - Select the format that suits your needs.
n Data File Encoding - Select the encoding that suits your needs.
The export consists of two load files and PDF files named by the Document Identifier.
22.2 Requirements
n Workspace access with the following permissions:
o Saved search
o Print to TIFF
o Export to File mass action
22.3 Directions
1. Create a saved search containing the documents you would like to review.
2. Image the documents.
Relativity | Recipes - 82
Note: To create PDF files—which may be easier to review—make sure you have TIFF images
available.
Note: Fields are automatically brought over from the saved search.
22.3.1 Results
The result is an HTML document that includes all of your selected fields and hyperlinks to the image
files.
Relativity | Recipes - 83
22.3.2 Offline coding form
If your expert witnesses need to make any comments while reviewing these records, create a CSV
file export from the saved search.
1. Edit your search to show the control number.
2. Using mass operations, export this list of control numbers to a CSV file.
3. Track their comments in this file.
4. Overlay this information into a new document field.
n Exporting with the RDC
n Saved search
23.2 Directions
1. Under the Administration tab, select Fields.
2. Click Edit for the desired text field.
3. Change the Filter Type to CustomOnly.
Relativity | Recipes - 84
5. Select (Advanced…) from the field filter’s drop-down to activate the CustomOnly filter.
6. Select is not set from the value drop-down and leave the text query field blank. Select is not like from
the value drop-down and leave the adjacent field blank.
Relativity | Recipes - 85
24.1 Requirements
n Structured Analytics
n Saved searches
24.2 Directions
1. Define the old and new datasets by marking a field or using the prefix of the ID.
2. Create a Structured Analytics set of Near Duplicate items across old and new data.
3. Create a saved search of the tagged items in the old document set and include the condition that Tex-
tual Near Duplicate Group is set. Add the include Near Duplicate Items from the Relational Items drop
down to see both the old information and new data that contains near duplicates of the original data.
4. Create a second search with the first condition as (Saved Search) = First Search. The second condition
is to return all the new documents.
5. This search returns documents that are in the new set which match up as duplicates or near duplicates
of the requested items in the first set. You can include the Near Duplicates to this search to see both
the original and new set items.
24.3 References
n Structured Analtyics
n Searching
n Structured Analtyics
n Searching
n Structured Analtyics
n Searching
n Structured Analtyics
n Searching
n Structured Analtyics
n Searching
n Structured Analtyics - RelativityOne
n Searching - RelativityOne
Relativity | Recipes - 86
25.1 Recipe overview
This recipe describes how to use a transform set to parse email domain information from your data
and use Pivot to analyze the volume of communications between or among companies.
25.2 Requirements
n Relativity security access:
o Field tab with add/edit permission
o Object type tab with add/edit permission
o Transform set tab with add/edit permission
o Transform add/edit permission
o Use Pivot/Chart permission
25.3 Directions
1. From the Administration tab, click the Object Type sub-tab.
2. Click New Object Type and create a new object type with the following settings:
a. Name: Domain
b. Leave all other fields as default
3. Create a new field with the following settings to hold the domain information:
n Object type: Document
n Name: Recipient Domains
n Field type: Multiple Object
n Associative Object Type: Domain
n Allow Group By: Yes
n Allow Pivot: Yes
n Leave all other fields as default
Note: The object field type must be Multiple Object for the transform set to run.
Email To::is set
o
Relativity | Recipes - 87
n Name: Domains
n Handler: Domain Parsing
n Data Source: Domains Saved Search (the saved search you set up in the previous step).
7. Click New next to the transform and enter the following:
n Transform set: Domains
n Name: Author Domains
n Source field: Field that stores the author email address (e.g., Email From or Email Author)
n Destination field: Author Domains field created above (where Object Type = Document)
8. Click New next to the transform again and enter the following:
n Transform set: Domains
n Name: Recipient Domains
n Source field: Field that stores the Recipient email address (e.g., Email To or Recipients)
n Destination field: Recipients Domains field created above (where Object Type = Document)
You can repeat this process with the CC and BCC field domains in the Recipient Domains field. You just
have to create a transform to do so. For this example, we concentrate on just the two fields we already
set up.
9. Click Start Transform: Full on the Transform Set Console. This runs through the workspace and
extracts domains from the email address field. When this process is finished, the status changes to
Completed.
Now you can use Pivot to determine the overlap between the two fields containing the author and
recipient domains.
10. From the Documents tab, click the Pivot icon.
a. Turn on Grid and List.
b. Group by Author domains and Pivot on Recipient domains.
Relativity | Recipes - 88
If you scroll to the author domain enron.com and then go across to the recipient domain enron.com,
you can see in our example that 68 items are from and to people within the Enron domain. You can
also review those 68 documents from the list view beneath the grid.
Relativity | Recipes - 89
26 How to calculate Precision and Recall without a
control set
26.1 Recipe overview
Relativity will automatically calculate Precision and Recall for Assisted Review projects that have a
control set. However, the control set is an optional round type, and not recommended for all
projects, specifically (though not limited to) situations where documents continue to be added or
removed throughout the process.
There are situations, however, where you might wish to calculate these values without a control
set, or alternatively other instances where you feel that your control set is not a reliable model of
your document universe.
Although this calculation is not performed for you automatically under these conditions, the
process is very simple, requires just a few fields on a view, some filtering, and very basic math.
26.2 Requirements
n Relativity Assisted Review
n Analytics
n View / Edit permission
Recall is the measurement of completeness. Ask yourself: Of everything Responsive that the system
was supposed to find, how much did it actually find?
The equation:
The following table illustrates the meaning of terms like “True Positive”:
Relativity | Recipes - 90
Consider the illustration below:
n The entire rectangle represents all of the documents of a control set, or for the purposes of
this exercise, a QC round which is being used as a control set.
n The circle contains all documents which the system has categorized as Responsive.
n Blue documents, regardless of their location, are truly Responsive, as decided by the human
reviewer.
n The blue documents inside the circle are True Positives.
n The blue documents outside the circle are False Negatives.
n Orange documents, regardless of the location, are truly Non-Responsive, as decided by the
human reviewer.
n The orange documents inside the circle are False Positives.
n The orange documents outside the circle are True Negatives.
n Under different circumstances, some of the documents outside the circle might also be
uncategorized. This workflow focuses solely on categorized documents, however.
Relativity | Recipes - 91
26.3.1 Initial setup
The purpose of a control set is to serve as a miniature model of your document universe where the
documents in the set are not used as seeds which influence categorization. They can however, be
categorized by seed documents from other rounds. This lack of direct influence allows the system
admin to gauge the accuracy and success of the project through the control set categorization
results.
However, it is possible to use a QC Round as an ad hoc control set. In order to do so, the following
conditions are required:
n The QC Round must be a random sample. This is true of all QC rounds, and this recipe
workflow is no exception. Remember, judgmental sampling is typically not appropriate for QC
rounds.
n Only sample from [Documents to be categorized] - Categorized, where [Documents to be
categorized] is the saved search in your project settings. This saved search is comprised of all
your categorized documents for both categories (such as Responsive and Not Responsive).
n Coding for the round should be complete, but the round itself should not be finished, and
post-round categorization should not be triggered. Once you finish the current round, the
documents in it become examples, and are no longer viable for this ad-hoc control set.
1. Navigate to the Overturn Summary Report and scroll down to the section called Overturns
by Designation Category.
2. Identify the Responsive overturned docs percentage for the current round.
Relativity | Recipes - 92
3. Subtract this value from 100% to calculate your Precision.
So, for the above example, 100% - 26.54% (Responsive Overturn) = 73.46% (Precision).
Method 2:
This method involves filters on the view which was set up earlier.
To calculate Precision using this method:
1. Navigate to your Assisted Review document view; ensure before you start that all filters have
been cleared.
2. Filter on the RAR Sample Set field for the current round. Again, coding should be complete
but the round should not have been Finished/Categorized.
3. Ask yourself the question: Of everything the system thought was Responsive, how much of it
truly was Responsive according to the human reviewers?
4. Filter for Responsive on the Categories – XXX RAR Designation Cat. Set field. This is
everything the system thought was Responsive. Make a note of the number of documents
returned. This will be the denominator for your calculation.
5. Filter for Responsive on the Designation field. Make a note of the number of documents
returned. This will be the numerator for your calculation.
6. Divide the numerator by the denominator. This is your Precision.
Example, Precision denominator (True Positive + False Positive): 456 documents:
Relativity | Recipes - 93
Relativity | Recipes - 94
Example, Precision numerator (True Positive): 335 documents:
Relativity | Recipes - 95
Example, Precision calculation: 335 ÷ 456 = 73.46%.
1. Navigate to your Assisted Review document view; ensure before you start that all filters have
been cleared.
2. Filter on the RAR Sample Set field for the current round. Again, coding should be complete
but the round should not have been Finished/Categorized.
3. Ask yourself the question: Of everything Responsive that the system was supposed to find,
how much did it actually find?
4. Filter for Responsive on the Designation field. This is everything the reviewers determined to
be Responsive this round. Make a note of the number of documents returned. This will be the
denominator for your calculation.
5. Filter for Responsive on the Categories – XXX RAR Designation Cat. Set field. Make a note of
the number of documents returned. This will be the numerator for your calculation.
6. Divide the numerator by the denominator. This is your Recall.
Example, Recall denominator (True Positive + False Negative): 630 Documents:
Relativity | Recipes - 96
Example, Recall calculation: 335 ÷ 630 = 53.17%.
References
n Assisted Review
n Analytics
n Views
n Filters
n Assisted Review
n Analytics
n Views
n Filters
n Assisted Review
Relativity | Recipes - 97
n Analytics
n Views
n Filters
n Assisted Review
n Analytics
n Views
n Filters
n Views
n Filters
n Assisted Review
n Analytics
n Views
n Filters
n Assisted Review
n Analytics
n Views
n Filters
n Assisted Review
n Analytics
n Views - RelativityOne
n Filters - RelativityOne
n Assisted Review - RelativityOne
n Analytics - RelativityOne
27.1 Requirements
n Relativity access
27.2 Directions
This recipe involves a scenario where the case team has a list of specific individuals and only wants
to review emails in which two or more of these individuals were involved in the email
correspondence.
Relativity | Recipes - 98
1. Create a custom index containing only the correspondence fields (To, From, CC, BCC):
a. Create a saved search that returns only the correspondence fields (To, From, CC and BCC).
b. Create a custom dtSearch index using this new saved search as your searchable set.
Relativity | Recipes - 99
d. Click Save.
e. Click Modify Terms, and then paste in the email addresses for the individuals you are trying to
identify.
You can now set up a workflow for reviewing these emails in which two or more individuals were
identified.
28.1 Requirements
n Existing email threading set
29.1 Overview
This recipe shows you how to set up your workspaces to support the import of productions with
natives when natives are available. This approach conserves disk space by storing only one copy of
the images and the natives.
29.2 Requirements
n Workspace access
o Documents: Add, Edit, Delete
o Images: Add, Delete
n Relativity Desktop Client
29.3 Directions
1. When setting up and importing into the workspace, be sure that the files are available in a shared repos-
itory accessible from your Relativity environment, and import the data using the links to the repos-
itory.
Note: You must perform the imports (Steps 6 and 7) on a machine that has access to the repository to
avoid import errors. The import of the natives (step 7) executes successfully only when natives have been
imported into the workspace (step 1) using the files that already reside in a valid location that Relativity can
access.
30.1 Requirements
n Workspace access
n Relativity Legal Hold
Note: Relativity maintains an audit log of all the actions associated with the legal hold; however, we
strongly recommend that you document all your activities leading up to the legal hold.
In addition, keep the following in mind as you establish a repeatable process to execute a legal hold:
n Factor in employee mobility and turnover.
n Consider checking for non-traditional data sources (for example, data in Cloud Services, mobile
devices).
n Define a process for immediate collection (to account for potential equipment failures).
30.3 References
Relativity Legal Hold
31.1 Overview
Files that are designated unprocessable don't actually have an error associated with them. This is
because these files weren't able to be processed in the first place, so they weren't able to register a
processing error. This recipe walks you through the process of identifying and reviewing
unprocessable files from processing.
31.1.1 Requirements
n Relativity processing
n Processing repository access
31.2 Directions
To identify and review unprocessable files, perform the following steps:
1. Create a view in your workspace called Unprocessable Documents on the document object with the fol-
lowing properties:
1. View Information:
1. Name - Unprocessable Documents
2. Order - 9999
3. Visible in Drop-down - Yes
1. Information:
1. Object Type - Document
2. Name - Unprocessable Documents
3. Owner - Public
4. Order - 9999
2. In the view, include the processed metadata fields that provide additional descriptive information
about the files, such as the Document Extension, File Type, File Name (which includes the original doc-
ument extension), and File Size (especially useful for empty files) fields.
Note: The Unprocessable field is set to Yes on any file for which Relativity doesn’t have an Invariant plugin that
is capable of extracting text or imaging/OCR’ing that document type. For example, it’s not set for a corrupt file
for which Relativity can’t extract text, such as a corrupt Word document that logs an error during data
extraction. Unprocessable documents don't have errors associated with them because they never reach a
point at which they can register a processing error.
4. Finally, set your Sort to sort by Document Extension and File Type. This allows you to complete your
review while addressing similar file types together.
You can also reference your list view to identify records individually using the view.
32.1 Requirements
n Relativity Desktop Client (RDC)
32.2 Instructions
1. Build a new production set. This is an empty 'staging' production to load the received production into.
2. Open the RDC.
3. Click Tools > Import > Production Load File… to:
a. Load the image load file (opt file) into the ‘staging’ production.
b. Use Append Only to ensure only new documents are created.
4. Click Tools > Import > Import Document Load File… to:
a. Load the .dat metadata file.
b. Map the appropriate metadata files and use the Overlay Only option on the Overwrite behavior
to the production documents that you want to load.
32.3 Notes
n Loading documents into a 'staging' production marks the production set as produced.
n When loading a production set into Relativity, the systems loads the newly created documents into the
root folder. It may be prudent to move these documents to a new folder to ensure proper foldering
and security of the documents.
n Using Append/Overlay during the metadata import creates new documents that did not have images,
but these documents aren't associated with the production. To associate these files with a production
set, create and load an image load file pointing to placeholder images and then overlay the metadata.
Using the Overlay Only option shows an error on any document not already loaded, and you can use
the error file as a list to identify documents that may not have had images.
33.3 Directions
There are two options for creating a new set of production images that retain the Bates numbering
from a previous production. Both options generate the same results.
33.3.1 Option 1
Load the documents to re-produced back into Relativity as new documents.
1. Create a saved search of the documents to produce again.
2. Export the documents’ images from the saved search.
Ensure you select the production version of the images in the Production Precedence section of the
Relativity Desktop Client.
3. Import the documents back into your original workspace as new documents. Use the Bates number as
the control number.
33.3.2 Option 2
Produce the documents in a “staging” workspace.
1. Export the documents to re-produce from the original workspace.
Note: Ensure you select the correct production version in the Production Precedence section of the Relativity
Desktop Client.
2. Create a new “staging” workspace. Load the first version of the produced images into the staging work-
space.
3. Create a production set in the staging workspace, and then produce the documents. The images con-
tain the Bates numbers from the original production, and from the new production.
4. Export the re-produced documents from the staging workspace.
Note: Ensure you select the new production version in the Production Precedence section of the Relativity
Desktop Client.
34.1 Requirements
You need to install the Relativity Processing application in the workspace
before you install the Mapping Workflow application:
n Fields
n Scripts
34.2 Directions
1. On the Relativity Applications tab, click New Relativity Application.
2. Select Import from File and click Browse.
4. Click Import.
Note: Relativity displays an error message after installation. This error is expected. Select Unlock
Processing Application and click Retry Import.
When the installation completes, Relativity displays a message indicating that import was
successful.
5. Navigate to the Script Library.
6. Execute the Map Processing Fields script provided with the application. After the script com-
pletes, you can view the processing fields mapped to their respective Relativity fields.
35.1 Overview
This recipe shows you how to upload multiple documents into a file field stored on a custom object.
35.2 Requirements
n Dynamic object with a file field
n Workspace access
o Relativity Desktop Client
35.3 Directions
After you create your custom object and add a new file field, you can use the RDC to import into this
object. This example uses an Expert Witness object.
Note: When you look at the list of available fields in your workspace, your file field does not appear. However,
the Native File Behavior is enabled.
36.2 Requirements
n Workspace access
n Relativity Desktop Client
36.3 Directions
36.3.1 Migrating images
n Clients sometimes find it easier to use the image volumes and load files on their server rather than
exporting images from Summation. You can account for all the images by comparing the total images in
Summation with the total number of images contained in the volumes on the server.
n Use a conversion tool like iConvert to convert .dii files to .opt files.
Note: Use a text editor like TextPad to combine multiple .dii files into one. Doing so saves time during the
conversion and import processes.
n Workspace access
o Edit, Delete permission for all items under the Admin tab
37.3 Directions
In this illustration, we’ll reference a single workspace that contains three different matters: A, B,
and C. Each matter is actively being reviewed by a combination of reviewers, some common to all
three matters and some who are unique to certain matters.
System admins should use the following best practices as they configure this type of workspace.
As illustrated, you'll have a minimum of seven unique user groups for the three matters in this
workspace. The minimum number of unique user groups a workspace requires (ignoring tiered
permissions within a user group, which will be described in the following example) is determined by the
following formula:
(2^m) – 1, where m is the number of matters being set up in the workspace
System admins must analyze which users belong in which groups and consider the permission levels
required for each matter and each user. For example, if we add a user to UG-4 and UG-3, and the
permission levels for the matters are identical, that particular user may be for UG-7.
However, if the permission levels at UG-5 and UG-3 are different—perhaps the user has permissions to
code in Matter A and Matter B, but not the documents in Matter C—then tiered user group access is
You can link multiple associative objects to the layout being built if you set up separate objects for
each matter.
37.3.7 Batches
The first step is to identify the documents included in the different matters within the workspace.
Batching can be done as part of the linear review workflow, or a Relativity Analytics-based
categorization workflow. In a categorization workflow, begin by sampling documents the same way
you would perform issue coding using categorization.
As described in the Fields Setup section of the Building Custom Objects workbook, batching can
leverage the Matter ID field for further review or to tag the documents during initial review.
All other system admin setup functions — such as production sets, markup sets, fields, persistent
highlight sets, and choices on the document object — are managed the same way they would be in a
single matter workspace. However, we recommend the following best practices:
n Use intuitive naming conventions. For example:
o SEC productions on Matter A could be MA – Production SEC 20110229
o EPA markup sets on Matter B could be MB – EPA Priv Set
n System admins need to be particularly aware of permissions provided to each user group. We
recommend setting up tiered access for each user group combination based on who needs access to
the views, layouts, markup sets, production sets, etc., as described above.
n Since batching is based on saved searches, further review, batching, and dtSearch indexes can be based
on search results responsive to the matters in question.
38.2 Requirements
n Workspace and system admin access
Clustered 8x8 25 KB
Clustered 16x16 23 KB
Dispersed 4x4 24 KB
Dispersed 16x16 24 KB
Threshold 0 2 KB
Threshold 127 22 KB
Threshold 215 22 KB
39.1 Requirements
n Workspace access
o OCR Profile, OCR Set, Production Sets
o Create access on Saved Searches
39.2 Directions
First, OCR a production. You should OCR your production after you run the production, but before
you perform the final export. Perform the following steps to OCR documents with redactions:
1. Create an OCR profile with your preferred settings. You must have an OCR profile before you can save
an OCR Set.
2. Create a long text field to hold your OCR results. You can't OCR directly into the Extracted Text field.
3. Create an OCR Set.
4. Point the OCR Set to completed Production. If you only want to OCR redacted documents for the pro-
duction, select Yes next to Only OCR Production Documents Containing Redactions.
5. Click Save on the OCR Set. The OCR Set Console appears.
6. Click OCR Documents from the OCR Set Console to kick off the OCR job.
When the job completes, the text becomes ready to export.
Now you can export a production using text precedence. To only produce non-redacted text when
exporting a production, you must use the Text Precedence feature. When you export a production
through the RDC, don't add the Extracted Text field to the exported field list. This results in
producing redacted text. In order to preserve the integrity of your redactions, don't add the
Extracted Text field to the selected columns tab.
Let’s envision a scenario in which you ran a production and did not specify a relational field. You
currently do not have any Attachment values in the Begin and End Attachment fields. This would
require you to overlay new Begin Attachment and End Attachment values into the Production
Information object. Here is an example of what this load file should look like:
40.1 Requirements
n Relativity 9.3
41.3 Directions
41.3.1 Correcting coding inconsistencies between true or conceptual duplic-
ates
n Each seed-overturn pair has a rank (or score) which indicates the degree of conceptual similarity they
share. The maximum possible score is 100, which means the two documents are conceptual duplicates.
Conceptual duplicates are documents which may or may not have identical text, but do contain the
same conceptual content according to the Analytics index. While it is possible that conceptual duplic-
ates may also be exact textual duplicates (i.e., documents with the same MD5 hash value), this should
not be assumed from a score of 100.
n We recommend that you use the Overturn Documents report to locate these documents by filtering
on the round and sorting by descending rank. A good best practice is to re-evaluate each seed-overturn
pair having a rank of 95 and higher to see which document was coded correctly, as well as whether each
is a suitable example.
Note: Some versions of this application may not be eligible for support by Relativity Client Services. For
more information, see the Version support policy on the Relativity documentation site.
Note: You must have valid Relativity Community credentials in order to download any Community file
linked to from the documentation site. You'll need to enter those credentials on the Community login
screen if you're not already logged in. If you're already logged in to the Community at the time you click a
link, the file is automatically downloaded in the bottom left corner of your screen. If you get an error
message stating "URL No Longer Exists" after clicking a Community link, it may be due to a single sign-on
error related to the SAML Assertion Validator, and you should contact your IT department.
42.1.2 Components
This custom solution consists of the following components:
n Relativity application
n Relativity scripts that run at the workspace level within a script group
status, include family, and then use a mass edit to update the child documents with the status
value of the parent.
o After copying the parent duplicate status value to children, create a search that returns all non-
duplicates. Family does not need to be included in this search, since all family members will now
have a duplicate status value.
n Make the non-duplicate documents identified in the above steps available to reviewers.
The following table lists and describes the columns in the report.
Column Description
Document Identifier The value of the document
identifier on the Document
object.
Custodian The value of the Custodian
associated with the document.
The following table lists and describes the columns in the report:
Column Description
Document Identifier The value of the document identifier on the Document
object.
Folder Path The value of the source field file location associated with
the document.
Relational Identifier The value of the relational field chosen to define groups of
related documents.
All Source Locations The semi-colon delimited list of all source locations
associated with the document.
43 Performing QC of Productions
Creating and exporting a production successfully can be complicated. It's important to perform a QC
of your productions to ensure accuracy.
43.2 Requirements
n Workspace Access
o Saved Searches
o Production sets
n Relativity Desktop Client
n This recipe is applicable to all versions of Relativity.
43.3 Directions
When you're preparing to create a production, it's beneficial to verify some standard checklist items
to prevent incorrect items mistakenly included in a production. Many of these items are intended to
check for conflicts between items in a group and to show where unwanted items are brought in
while segmenting desired items and including a group.
This recipe assist you in preparing a plan on how to move forward when locating unwanted data.
This recipe also assist you in identifying who reviews this data and when a review of the data should
occur.
See Using saved searches to complete conflict checks for conflict searching when considering the
following in the production set:
See Using saved searches to complete conflict checks for conflict searching when considering the
following in the production set:
44 RAR checklist
Before you begin a RAR project, think about whether the project is a good fit for assisted review.
Assisted review centers on the concept of training the system, so it learns how to interpret
uncategorized documents. The system learns best from documents that are good examples. To be
good examples, documents should have rich text with lots of concepts, not just numbers.
Also consider what constitutes a responsive document. If, for instance, responsiveness hinges on a
name or a date, that's not enough for RAR, because there are no concepts to learn, only absolutes.
Successfully completing a RAR project requires you to spend a little time at the beginning of the
process to determine if RAR is the best way to proceed.
44.1 Checklist
1. Ensure the document set you plan to use is a good population for Assisted Review. Ensure the pop-
ulation contains the following:
n Minimum 50k records with text
n Concept rich files (not primarily numbers)
n Issue or privilege coding is in a separate field or not part of RAR workflow
2. Make sure you set your timeline and goals. The stakeholders should discuss goals and timelines prior
to beginning a RAR project. This helps establish clear deliverables. Decide and determine the following:
n Determine level of Precision, Recall, and F1
n Decide on a manual review plan (i.e., all docs categorized as Responsive; privilege screen only)
n Put a production plan in place
3. Ensure your administrative prep is complete. Use the following to ensure you complete all admin-
istrative tasks before reviewers look at documents. This decreases the chance of delays during the pro-
ject.
n Install the Assisted Review application
n Create a layout and fields for the project
n Build a layout
n Create a RAR designation field
n Create choices
n Create the use as example field
n Create a text excerpt field
n Build a Reviewers’ view
n Complete the index build
n Eliminate documents without enough text
n Run the extracted text size script
n Create saved searches (training set and searchable set)
a. Optional – Use the following if the project requires categorizing for issues:
nKey issues field (single choice)
n Key issues excerpt field (long text)
n Cluster all documents in RAR universe
n Pull UNCLUSTERED documents from project
b. Optional - Use the following to further cull documents:
nEmail threading
n Key word culling
n STR
4. Ensure you prepare your reviewers. Reviewer preparation is key to success. A RAR project isn't like
other document coding. Use all the tools available to train all reviewers in RAR protocols.
n Show the RAR for End Users webinar to reviewers
n Distribute and discuss Assisted Review Reviewer Protocol
Note: This recipe is applicable to both Sample Based and Active Learning projects. To see information only
applicable to Sample Based Learning, see the Sample Based Learning section below.
45.2 Requirements
Analytics Server
In this example, the system will only learn from the text that is not framed by a red box. Even if the
subject line and short sentence fragment that remain are responsive, there is not enough text to
warrant this document’s inclusion as an example.
45.4.3 Images
Consider the following example from the Enron data set. This document, which appears to contain
useful text, is actually a JPEG image of a paper document:
Because the system only works with a document’s extracted text, all of the responsive text located
in the image will be unavailable for machine learning. Consequently, this document, while highly
responsive, turns out to be a poor example document.
45.4.4 Numbers
As mentioned above, numbers are not considered in the machine learning process. It follows that
spreadsheets consisting largely of numbers, while potentially responsive, do not make good example
documents.
Consider the two examples below. Both are spreadsheets, but only the second would make a good
example document.
Relativity pastes the selection into the Excerpt Text field on the Assisted Review coding layout. You
can perform this action multiple times per document. Each time you select text and click Add to
Excerpt Text, the text is automatically appended to the Excerpt Text field.
46.2 Requirements
n Workspace access
n Object creation permission
46.3 Directions
Perform the following steps:
1. Create a new object type called Work Orders.
2. Create fields in the Work Orders type for coding values, such as Page Number, Bates Number, or
Description.
3. Add the newly created fields to a document coding layout.
4. Set the Work Orders object as an associative object on the layout. The Work Order tab can hold all the
individual Bates Numbers of the work orders.
5. You can also create a document view to identify the documents with work orders.
47.2 Requirements
n Workspace access
o Search index – Edit/Add and corresponding tab
Note: You must have a leading space. You can't have a trailing space.
4. Run a search using the following regular expression for social security numbers:
"##((?!000)(?!666)([0-8]\d{2}))\-((?!00)\d{2})\-((?!0000)\d{4})"
"##((?!000)(?!666)([0-6]\d{2}|7[0-2][0-9]|73[0-3]|7[5-6][0-9]|77[0-1]))(\s|\-)((?!00)\d{2})(\s|\-)
((?!0000)\d{4})"
Note: The quotes and # characters are necessary for the dtSearch index to recognize the regular
expression. Quotes can't be Smart Quotes; they must be ASCII quotes.
Note: dtSearch uses TR1 regular expressions as implemented in Visual C++, which Microsoft's Developer
Network documents.
48.1 Requirements
n Workspace access
n Search index – Edit/Add and corresponding tab
48.2 Directions
1. Create a dtSearch index.
2. Change the alphabet file to include the % sign as a searchable character:
a. Enter the following under [Letters] // Original letter, lower case, upper case, unaccented:
i. [Space] [%] [Space] [%] [Space] [%] [Space] [%]
Note: You must have a leading space. You can't have a trailing space.
c. Repeat these steps for any other symbols that you need to make searchable. Delete the appro-
priate symbols from the [Spaces] section.
3. Perform a full build of the dtSearch index.
You can now search for terms containing the % sign using a regular expression. For example, if you
need to search for documents that contain the term 75%, you would enter the following in your
search box (ensuring you select the proper dtSearch index):
The quotes and # characters are necessary for the dtSearch index to recognize the regular
expression. Quotes can't be Smart Quotes; they must be ASCII quotes.
Note: For most symbols, once you have made them searchable in the alphabet file, you can type them
directly into the dtSearch box without using regular expressions. Thus to search for 30!, enter 30! into the
dtSearch box after you have made ! a searchable character. You only need to employ \W for certain
symbols such as the % sign.
48.3 References
n Searching with Regular Expressions
n Searching with Regular Expressions
n Searching with Regular Expressions
n Searching with Regular Expressions
n Searching with Regular Expressions
n Searching with Regular Expressions
n Regular Expressions for Beginners webinar
49.2 Directions
Consider the following guidelines when using Structured Analytics in Relativity:
n There are advantages to running each of the Structured Analytics Set operations individually. For
example, if you need to make changes to a particular operation, it is best to make the changes and
rerun the set for that one operation, rather than re-running a set with multiple operations selected.
n The ideal groups of Structured Analytics operations to perform as separate sets are:
o Email threading and Near Duplicate Identification
o Language Identification
o Repeated Content Identification
n We do not recommend clicking Clean Up Set following your first Full Analysis if you expect to run an
Incremental Analysis in the future. Keeping the results of the first full run on the server will expedite the
incremental run.
n When setting up email threading, the Identify emails in document set search items should also be in
the Select document set to analyze search, but the documents to analyze should also include attach-
ments.
n When running Near Duplicate Identification, system admins have two options:
o Run with email threading – the Near Duplicate Identification process identifies all textual duplic-
ates within your document set which are not emails, while the email threading operation iden-
tifies all the email duplicates.
o Run it without email threading (either by itself or with another Structured Analytics operation)
– the Near Duplicate Identification process includes all email and non-email textual duplicates.
n When performing Repeated Content Identification, you should change the default settings if you’re
working with large data sets. See Sampling for repeated content for more information.
Once you run the Structured Analytics Set, the following workflows can assist with an efficient
review process.
n System admins can set up a review to include only the inclusive email threads. Listed below are some
settings for a Saved Search that can be used to batch documents:
o Search name and settings (you can choose whether to include family members)
o Search fields
o Sort order
n Similar to identifying email threads, you can set up a review for all near duplicate items by setting up a
saved search to identify such documents.
o Search conditions
o Search fields
n As a system admin, you can continue to manually identify repeated content items and include them as
a part of the Repeated Content Filters tab. Alternatively, you can use the Repeated Content Iden-
tification feature and link to an Analytics profile.
n To facilitate review in multiple languages, system admins can use the Language Identification feature to
batch out documents based on the primary language identified, or use the Pivot feature to identify and
filter on the languages the system identified.
50.1 Requirements
n Workspace access and system admin rights
n Relativity Structured Analytics application
50.2 Directions
50.2.1 Create saved searches
First, create the following saved searches when setting up a Structured Analytics Email Threading
set.
n Saved Search #1 = Parent emails in document set: Create a saved search that returns only
the parent emails and family. The search conditions you set will depend on the metadata
fields available in your workspace.
Note: For RelativityOne, documents with more than 30 MB of extracted text will automatically be excluded
from the structured analytics set and you no longer need to manually exclude them using an additional
search.
Note: Rather than modify the Default profile, you should always create a new profile or edit an
existing profile you have created.
3. Email Threading Settings - map the email threading fields on the Analytics Profile Layout to
the appropriate workspace fields containing the following specified metadata.
Note: When setting up an Email Threading Structured Analytics set, if you choose to
not use email header fields, then you will not need to map any of the above email
header fields. In this scenario, email threading relies on extracted text, and the Parent
Document ID and Attachment Name fields.
n Attachment name field – file name of the document attachment. This only needs
to be populated for child documents and not the parent email. This field is used
to display an attachment’s file name and icon in the Email Threading Display visu-
alization field. Relativity Processing creates the Attachment name field. You may
use a File name field instead.
Note: In order to properly thread and visualize attachments, the Parent document ID
and Attachment name fields must be mapped.
5. Advanced Options – these settings affect the building of the conceptual Analytics Index, but
will not have any effect on your Structured Analytics Email Threading set. Thus, you may
leave the default settings as is in this section.
n Click Run Full Analysis to begin running your Email Threading analysis.
l Display Field - Email Threading Display (Choose Control Number if you wish to see
n Email Threading View - Set Conditions - Set the conditions of the view such that it returns the
saved search created in the previous step. In the screen shot below, the saved search was
named Email Threading Results:
o (Saved Search) = Document is in = Saved Search created in previous step
n Email Threading View - Set Sort Order - Configure each field below to sort inascending or des-
cending order, depending on the case team's preference:
Note: If you wish to see the latest email in a thread first, set Indentation to sort in descending
order.
n Advanced Settings:
Select the following Advanced Settings options when creating the Email Threading
view:
Object Type - Document
o
Note: If you wish to see the latest email in a thread first, set Indentation to sort in descending order.
The Email Threading Display field provides the following visual information about email threading:
1. Indentation Bubbles
The indentation bubbles indicate each email message's indentation level within the thread.
For example, the first email in the chain would be "1," an email responding to the first email
would be "2," and an email responding to the second email would have a "3." Email
indentation bubbles go up to 99. For messages with an indentation level over 99, the bubbles
display "99+."
6. Click Search.
To save this search, click the Save as search icon . See Saving searches on the Documents tab for
additional information.
1. To launch the email thread visualization pane, click the email thread visualization ( ) icon in the
lower right corner of the displayed document in the viewer. The pane expands and shows the email
thread group for the selected document in the viewer.
2. From the visualization, the currently opened document is outlined in gray and displays as larger than
the unselected documents in the thread.
4. If you hover over an email icon, a tooltip window displays who the email is from and the sent date. It
also lists any attachments or duplicate spares for the selected email. You can click on the attachment
name or the name of the duplicate spare to open it in the viewer.
6. Select the field you want to highlight in the email thread visualization pane from the Coding Highlight
field drop-down (e.g. Confidential Designation). The emails in the thread are now highlighted with dif-
ferent colors corresponding to the available choices for the field.
50.4 References
Email Threading
Email Threading
Email Threading
Email Threading
Email Threading
Email Threading
Email Threading
51.1 Requirements
n Relativity access
n An active RAR project
51.2 Directions
This workflow assumes that you are between rounds in a RAR project (i.e., you've finished the
current round and are ready to begin the subsequent round). Perform the following steps to remove
documents from a RAR project after the start of the project.
1. Identify the documents that are not good example documents for the RAR project per the Reviewer
Protocol specifications.
a. Create a saved search against your RAR project universe with the following conditions:
n (RAR Use As Example = Yes) AND
n ((Extracted Text Size <=0.75KB) OR
n ((Extracted Text Size >=2048KB) AND (RAR Excerpt Text = Not Set)) OR
n (File Type = compressed files, RAR, ZIP, system files, image files, CAD drawings, maps, and
calendar items, number-rich documents))
2. Once you've identified bad example documents, flag the documents by setting the RAR Use as Example
field to No.
3. Proceed with the regular RAR workflow. Create new training or QC rounds per the needs of your pro-
ject/reports (you can create a round with a single document to force categorization).
Once you complete the round in step 3, all the documents that were flagged RAR Use as example =
No (in step 2) will no longer be used by the categorization engine to make suggestions on the
documents in the RAR project universe.
51.3 References
n Analytics
n Relativity Assisted Review
n Analytics
n Relativity Assisted Review
n Analytics
n Relativity Assisted Review
n Analytics
n Relativity Assisted Review
52.2 Requirements
n Relativity Processing
n Processing repository access
52.3 Directions
1. Select the Processing > Processing Sets tab.
2. Select the processing set with the errors that you want to resolve.
3. Click View ErrorsView Job Errors on the console for the current processing set. You can now view the
errors in the processing set on the Processing > Document ErrorsErrors tab.
Note: When you repair the file, you modify it, therefore changing the meta data. Consider making a copy of
the file before making any changes to preserve the original meta data.
1. Retry the discovery phase after you resolve the file errors:
a. Use the mass operations window in the lower-left corner to check the files you fixed.
b. Select Checked and Retry.
c. Click Go.
53.2 Requirements
n Workspace access
o Use Sampling, Pivot, Tally/Sum/Average
o Delete and Add Fields, Choices, Layouts, Views
o Delete and Add all Search functions
53.3 Directions
General QC workflow
1. Create the QC Layout
2. Batch documents for the QC team to review
3. Identify total documents put through QC and resulting overturns
To include the statistics in your reporting, save results by exporting to a file from the Pivot view:
6. Create review batches using the sample set search as the batch data source.
7. Assign to the QC team for review using the procedure described above.
Note: To learn more about using sampling and how best to apply it to your review, see Sampling and
Understanding the Statistical Sampling Formula.
54.1 Requirements
n Relativity 8.1 or higher
n Relativity Structured Analytics application
n Document set larger than 100,000 documents (no need to sample if you're already smaller than that)
54.2 Directions
54.2.1 Creating the sample
1. First, create your random sample of the target documents.
2. Navigate to the Documents tab and restrict yourself to the documents you want to focus on. It might
be everything in the workspace, the searchable documents from your index, or a set limited by file
type, email inclusives, or some other subset.
n Type: Fixed
n Size: 100,000
5. Click Sample. You should now see only 100,000 documents listed on the page.
Note: In Relativity 8.1-9.2, you create lists with an icon rather than a mass action. Other than that minor
workflow change, this recipe works for releases prior to 9.3. See the Lists topic in the Admin guide for more
information.
Note: You may over time decide that you prefer setting this value higher or lower, but the above
recommendation is consistent with our experience with most customers. Contact
[email protected] if you would like to discuss in greater detail.
55.2 Directions
To search for custom object information on the Document object, perform the following steps:
1. Create a multiple object that links the Document object to the custom Custodian object. In this
example, Custodian Obj Linker is the name of the multiple object .
2. Set Open to Associations to Yes on the Custodian object fields you want to make searchable on the
When creating a new Saved Search, or toggling search conditions on, you can search across your
Custodian object Start Date and End Date fields. Look for the field name that starts with the name of
your multiple object. In this example, it is Custodian Obj Linker.
You can also filter on the Custodian object Start Date and End Date fields by adding them to a view
on your Document object.
56.2 Directions
56.2.1 Field filters
To search for a date using a field filter, perform the following steps:
1. Ensure the date field you want to search for is in the current view. Add it to the view if necessary.
2. Turn on filters, and then enter a valid search string in the text box filter.
The following table lists examples of valid date and number searches, as well as the expected result
set.
Valid search strings Return items where...
>= 7/24/2008 [FIELD VALUE] >= '7/24/2008'
<= 7/24/2008 [FIELD VALUE] <'7/25/2008'
= 7/24/2008 ([FIELD VALUE] > '7/23/2008') AND ([FIELD VALUE] <'7/25/2008)
>= 07/27/2008 1:23 PM [FIELD VALUE] >= '07/27/2008 1:23 PM'
<= 07/27/2008 1:23 PM [FIELD VALUE] <= '07/27/2008 1:23 PM'
= 07/27/2008 1:23 pm [FIELD VALUE] = '07/27/2008 1:23 PM'
7/24/2008 BETWEEN 8/24/2008 ([FIELD VALUE] >= '7/24/2008') AND ([FIELD VALUE] <= '8/24/2008')
7/24/2008 1:23 PM BETWEEN 8/24/2008 ([FIELD VALUE] >= '7/24/2008 1:23 PM') AND ([FIELD VALUE] <=
3:45 PM '8/24/2008 3:45 PM')
07/27/2008 ([FIELD VALUE] >= '07/27/2008') AND ([FIELD VALUE] < '7/28/2008')
>= 100 [FIELD VALUE] >= '100'
<= 100 [FIELD VALUE] <= '100'
= 100 [FIELD VALUE] = '100'
The following table includes examples of invalid data and number search strings.
Invalid search strings Invalid reason
> 7/24/2008 You must use the equal sign with the greater than operator (as in >=).
56.3 Notes
n By default, date fields use the filter type of text box. However, you can change this to custom in the
field information.
n If you need more complexity than the text box filter type can provide, you can set up a saved search.
57.2 Directions
1. Create a new dtSearch index, and then remove all stop words.
2. Create a search terms report with all of the stop words using that index.
3. Create a saved search with the following conditions:
n the STR field is not set AND
n the Extracted Text field is set
The results yield all documents with extracted text, but don't contain any stop words. Documents
that fall into this category are the documents with the wrong encoding.
58.1 Overview
Relativity doesn't automatically identify handwritten documents. This recipe demonstrates how you
can use custom search indexes and search terms reports to execute searches that can identify
potential handwritten documents or documents that contain poor quality OCR.
58.2 Requirements
n Workspace access
n Edit and delete permissions on search indexes
n Edit and delete permissions on searches
58.3 Directions
Note: This recipe assumes that the documents contain available text or were previously OCR-ed to capture
all possible text.
Note: Make sure the searchable set for this search is the same as the searchable set used above.
The results of the search narrow down the universe of documents to the documents that potentially
contain handwritten text or poor quality OCR.
59.1 Requirements
n Structured Analytics - Email Threading
n Saved Search
n Workspace Access
o Threading fields
o Saved Search
59.2 Directions
1. Set up a saved search to return multiple emails with multiple threads.
a. Use the Email Threading ID field. This field indicates how many threads are in a conversation. The
search in the below example asks for items that contain second level emails.
b. Include the Thread group to pull in the rest of the conversation.
2. Create another saved search to return all records where Email Threading is set, but excludes the search
in step 1.
60.2 Requirements
n dtSearch index
60.3 Directions
Using your dtSearch index, construct a search for your term using the NOT and w/n operators to
exclude the term only when it's part of a repeated email disclaimer. For example, to find documents
that contain the term “Confidential” use the following syntax:
Using this syntax returns documents that contain the term “Confidential” and excludes documents
where “Confidential” only appears in the email footer. If a document contains “Confidential” in the
body of the email and in the footer, it returns as a hit.
Your proximity is equal to the number of words that appear between the first word in the disclaimer
and your search term. In this example, “Confidential” is one word from the start of the email footer
string.
61.1 Overview
This recipe shows you how to set up custom search indexes and then use simple search operators to
identify a list of documents.
61.2 Requirements
n Workspace access
o Edit and delete permissions on search indexes
o Edit and delete permissions on searches
61.3 Directions
This recipe assumes that the documents you're attempting to identify contain a unique identifier in
a fixed-length text or long text field.
The examples in this recipe show you how to search for the control number of the document.
However, you can use the same workflow process for other unique identifiers like Bates numbers.
Ensure that you separate the unique identifier values by a line break. This may require you to
reformat the string of control numbers using Notepad++ or Textpad.
61.3.2 Option B
Use this option when you have more than a few hundred documents to identify.
1. Create a saved search called ControlNumber - for Index with the subset of documents you want to
search against.
2. Ensure that the only field the saved search returns is the Control Number.
3. Create a new dtSearch index called dtSearch-ControlNumber.
4. Perform a full build on this new index using the saved search from step 1.
5. Create a search with the following settings to identify documents with the control number once the
index is complete:
n Name- ControlNumber List
n Search Index
n Search With- dtSearch-ControlNumber
n Search String- ABC001 OR ABC002 OR ABC003
Ensure that you separate the unique identifiers with an “OR” operator. This may require you to
reformat the string of control numbers using Notepad++ or Textpad).
The results of the search help identify the documents by their control numbers.
Option A Option B
Ideal when searching for fewer than a few hundred Ideal when searching for more than a few hundred
documents. documents.
Keyword indexes any length of text, but limits the search dtSearch doesn't index (by design) control
for control numbers to 4000 characters in length (overall numbers longer than 32 characters.
length of all characters you search).
Ensure you use the IS operator when searching. You can If your control number contains periods (.) or
also use the CONTAINS operator if, for example, you are hyphens (-), you must index those characters prior
trying to bring in all family members that begin with the to building the index, or enclose the control
same prefix. number in quotes.
62 Searching workflows
Relativity provides great flexibility to help you identify documents through searches so that you can
ready them for further review and analysis. This recipe highlights setting up multiple search
indexes, applying various search techniques, and using filters properly when executing searches.
62.1 Requirements
n Workspace access
o Edit, Delete access on Search Indexes
o Edit, Delete access on Searches
62.2 Directions
Relativity offers two types of search indexes, Basic Keyword Search and Specialized dtSearch.
The Basic Keyword Search Index populates automatically, is available out of the box, and supports
basic Boolean operations. These operations include AND, OR, NOT, and the wildcard (*) operator.
The Specialized dtSearch Index is custom built and must be set up with an index to query against. In
addition to basic Boolean operations, it also supports the following functionality:
n Wildcard operator for a single digit (=)
n Stemming operations (~)
n Fuzzy search operation (%)
n Range Searching on Numeric fields (~~)
n Customize data set to be indexed via custom saved search
n Proximity Searching
The following includes best practices for the use of advanced operators and workflow options:
The following image shows all correct and incorrect combinations of proximity searches that result
in successful hits when using proximity operators:
In this case, we continue to calculate the distance of the string ("confidential and/or privileged
information:) from the word ("message") as shown above. However, when using connector words,
system admins should create an index that removes stop words from the noise word list.
Relativity reserves the following noise words and characters, which continue to behave as operators,
as well as being noise words: and, or, not, to, contains, xfirstword, xlastword, ", ( ), *, ?, %, @, ~, #,
&, :, =.
Once that index is available for query you can either place the search string in quotes or apply
stemming to the connector words to override their function as a connector.
62.2.4 Filters
In addition to the Search Indexes, you can also use filters to search on metadata fields and narrow
down the review set of documents.
The following list includes metadata fields, their corresponding field types, and the filter type
available in Relativity:
n File Type - Single-Choice, List Filter
n Custodian - Single-Choice, Pop-Up Filter
n Date Sent - Date, Textbox Filter
n Email To - Long Text, Textbox Filter
n Email Subject - Fixed Length Text, Custom Filter
n Designation - Single Choice, Multi-Choice List
The following table shows different types of filters you can set up with available fields in Relativity.
63.2 Requirements
n Relativity 7.5
oUpgraded Relativity environment from an earlier version
n Workspace access with the following roles:
o System Administrator
o Script Admin
63.3 Directions
1. Make sure the Distributive Transaction Coordinator (DTC) is turned on. See the Upgrade Guide for
details.
64.2 Requirements
n Workspace access with the following permissions:
o Field: edit/delete
o Searches: edit/delete
n Relativity Desktop Client
2. Set up fields such as Custodian, email fields, File Name, and Extracted Text as Unicode-compliant to cap-
ture CJK characters.
3. Your processing vendor should have provided you with data files that are Unicode/CJK character-com-
pliant. If this is the case, import the data in a Unicode-compliant format. Be sure to include any extrac-
ted text.
n Once you configure your SQL full-text language to the correct language (Japanese in this example), you
can perform keyword searching and filtering in that language.
64.3.1.2 dtSearch
Index
Once you set the fields to be Unicode compliant, leaving the defaults in the dtSearch index setup
(dtSearch index alphabet list: CJKRanges = 2e80-ac00 ac00-d7af f900-faff fe30-fe4f) allows you to
index characters in CJK languages where each character is treated as a separate word, enabling
word breaking with CJK.
n The term 54 所 does not match the subject. The subject has the additional characters, hence no
match.
Note: Refer to the Using Analytics with CJK characters recipe for more detailed considerations.
65.2 Requirements
n Relational field identifying email families.
n Relativity security access
o Field- edit/Add
o Document- edit
o Permissions- access to Scripts tab
o Groups- system Admin and Script Admin (if you didn't add the script to the workspace.)
66.1 Requirements
n An active Analytics index
n Assisted Review Application
66.2 Directions
This recipe assumes that the following:
n The Assisted Review application is installed.
n An optimized Analytics index has been built and includes the project documents.
n The project is a good fit per the RAR Checklist.
n Text for the documents is available or the documents have been OCR-ed to capture all possible text.
n Documents/a subset of documents have been reviewed.
Once you've established these prerequisites, perform the following steps to integrate an Assisted
Review workflow midway into review:
Note: The saved search must exclude documents marked No for use as example, as RAR uses all coded
documents in a pre-coded seed saved search as training examples.
6. Proceed with the regular RAR workflow. Finish the pre-coded seed round and proceed to subsequent
rounds (training or QC rounds per the needs of your project / reports).
66.3 References
n Analytics
n Relativity Assisted Review
n Analytics
n Relativity Assisted Review
n Analytics
n Relativity Assisted Review
n Analytics
n Relativity Assisted Review
n Analytics
n Relativity Assisted Review
n Analytics - RelativityOne
n Relativity Assisted Review - RelativityOne
67.2 Requirements
n Correctly populated email metadata fields
n The Delimiter Count by Saved Search scrip from NSerio, located on their website
67.3 Directions
To tally all email recipients, use the following steps:
1. Create a field to store the consolidated values of Email To, Email CC, and Email BCC.
1. Field Type: Long Text
2. Field Name: Email All Recipients
2. Create a field to store the number of delimeters.
1. Field Type: Whole Number
2. Field Name: Email Recipient Count
3. Create a saved search of all the documents for which you'd like to tally the number of recipients.
1. (Optional) You can pull these results in a saved search by marking the Email To, Email CC, or
Email BCC as set.
1. Return these fields:
n Email All Recipients
n Email Recipient Count
n Email To
n Email CC
n Email BCC
4. Using the filter pane, filter on the Email To field for Is Set.
5. Use the Replace mass action on all documents returned by the filter.
1. Field to Update: Email All Recipients
2. Action: Replace Entire Field
3. Update With: Field
4. Field: Email To
6. Use the Replace mass action on all documents returned by the same filter as step 4.
1. Field to Update: Email All Recipients
2. Action: Append to End
3. Update With: Text
4. Text: ; (semicolon, no spaces)
7. Clear your previous filter.
8. Using the filter pane, filter on the Email CC field for Is Set.
9. Use the Replace mass action on all documents returned by the filter.
1. Field to Update: Email All Recipients
2. Action: Append to End
3. Update With: Field
4. Field: Email CC
67.4 References
n Fields
n Mass replace
n Saved search
n Scripts
n Fields
n Mass replace
n Saved search
n Scripts
n Fields
n Mass replace
n Saved search
n Scripts
n Fields
n Mass Replace
68.2 Requirements
n Applicable to all Relativity versions
n Relativity security access
o Permissions: Access to Tabs option under the Administration tab
o Tab: Edit/Add
o View: Edit
68.3 Directions
1. Your categorization set must have a Maximum Categories per Document value greater than 1.
2. Under Tabs, create a new tab to display the Analytics categorization results.
a. Name the new tab and select an Order.
b. Set Link Type to Object.
c. In the Parent field, you may wish to place the new tab under the Analytics tab.
d. Set Object Type to Analytics Categorization Result.
e. Leave Is Default as No.
3. Navigate to the newly created tab.
4. Create a new view with the object type of Analytics Categorization Result.
5. Ensure the following fields are added to the view:
a. Document – [Name of Categorization Set]
b. Category
c. Category Example
These fields show you the document that was categorized, the name of the category, and the
example document that was used by the Analytics index to make its categorization decision.
69.2 Directions
1. In the application database, write the tag(s) information to a field. Use a semicolon to separate the indi-
vidual tag or folder naming.
2. Export the data from the application, using default delimiters. Ensure that at least the Document Iden-
tifier and Tag fields are in the export.
3. Create a field in your Relativity workspace to hold the tag information.
a. Set the Field type to Multiple choice.
b. Ensure the field is available in the field tree.
4. Import the tag data into the field created in step 3. Ensure that the multi-value delimiter is set to semi-
colon in the Relativity Desktop Client.
5. Navigate to the field tree and observe your tags.
70.1 Requirements
n Workspace access
n Relativity Structured Analytics application
70.2 Directions
The following sections contain sample views that you can create along with workflow use cases for
each. The use cases provide details about the results of the Email Threading Structured Analytics
sets. These views can be very useful for your reviewers. See the table that appears after the
following views for an explanation of each of the fields.
This view provides context for review by displaying family members like attachments to an email
when going doc to doc.
In conjunction with this view, the Related Items pane for a Thread Group view can help reviewers
find context for the email that is being reviewed.
70.2.3 Sample view of the Related Items pane for an Email Thread Group
This default view is available in the Related Items pane. It allows reviewers to identify the context of
an email that is being reviewed. You can use this view with both sample views identified previously.
Note: From a review standpoint, it's important for the reviewer/case team to be aware that the document
with the most amount of text is identified as the Principal document.
F Indentation This Custom (system) field created following Email Thread analysis defines the
depth of the email in the email chain. The example here represents that the email
was third in the chain.
G N/A An example email along with associated attachment.
H N/A An example Inclusive email (based on the message contents).
I N/A An example Inclusive, but duplicate email (based on the message contents).
J N/A An example Inclusive email (based on the message contents & presence of an
attachment).
K Textual Near Dupe Custom (system) field populated following Near Dupe analysis that maintains the
Group relationship of textually near duplicate documents.
L Textual Near Dupe Custom (system) field created following Near Dupe analysis that identifies
Similarity / Textual whether the Principal document (the one with the most text) and the Similarity
Near Duplicate Prin- score provides the score of all the associated Near Duplicate documents in that
cipal Near Dupe Group.
M Email Threading Icon available in the Related Items pane in the document viewer that, when
Relational Field clicked, identifies all the documents related to that Email Thread group.
Icon
N Near Dupe Rela- Icon available in the Related Items pane in the document viewer that, when
tional Field Icon clicked, identifies all the documents related to that Near Dupe group.
O Relativity Compare This text comparison icon, when clicked, allows you to view the differences in text
Icon between two documents (useful for comparing near dupe documents.)
n Z = Z value, which is based on the confidence level (e.g., 1.96 for 95% confidence level)
n p = reflects the chance of a defect (i.e., overturn), expressed as a decimal; Assisted Review con-
servatively sets this at .5
n c = confidence interval — or margin of error — expressed as a decimal (e.g., .04 = ± 4%)
72.2 Requirements
n Relativity Analytics index
n Workspace access
o Document – Edit
72.3.2 Scenario 2
You need to find key documents in an opposing production. Relativity’s categorization functionality
identifies and groups similar documents together based on a set of example documents manually
identified by the user. When you receive documents in an opposing production, and hot documents
have been identified in your documents, you can pass these document values to the new production
items by making them examples for the Analytics engine.
Perform the following steps:
1. Create a categorization set and use the Issue Designation multiple choice field as the Categories and
Examples source.
2. Use the Synchronize feature for your new categorization set.
3. Click Categorize All to categorize the opposing counsel's documents with your categorization set and
to identify opposing counsel's documents which are similar to your hot documents.
Setting the available Categories and Examples source option to use your multiple choice designation
field enables the Synchronize feature for categorization. The Synchronize feature automatically
creates categories for all choices associated with the specified field and designates example records
for all documents with this field coded. With the example document records identified in your data
set, categorization identifies and organizes similar documents in the opposing counsel's data set.
Note: Note: This document draws from various Content Analyst resources with their permission.
73.2 Requirements
n Relativity 9.0 or higher
n Workspace access and system admin rights
Note: This document draws from various Content Analyst (CAAT®) resources with their permission.
There are two approaches to word-breaking (tokenization) for all Java languages (other than
Chinese):
1. Rule-based Iteration: In this approach, spaces and punctuation define word boundaries.
n Take the default English stop words and use a translator tool to translate to the desired language.
The Email Header filter would not find this text. Instead, use a Regular Expression filter as follows:
(?i)(Von.*?Gesendet.*?An.*?Cc:.*?Betreff.*?\r\n)
Note: The regular expression flag (?i) forces a case insensitive match. This is necessary because matching for
what to filter out from Structured Analytics analysis is case-sensitive.
Note: The Optimize training set feature supports all Java-compatible languages and should be enabled.
Note: Language identification in Relativity 9+ supports 173 languages. Language identification considers all
Unicode characters and understands which characters are associated with each of the supported
languages. For example, Japanese has many different character sets - Kanji, Katagana, Hirigana, all of which
are supported. See Supported languages matrix in the System Guides for the full list of supported
languages.
Note: This supports all Java-compatible languages and is highly recommended to use with non-English
cases.
When the Analytics engine parses emails, it looks for English-language cues to determine when it is
or isn't in an email header. In particular, it's looking for words like "To, From, CC, Subject" in the
case of the traditional headers. And it's looking for format such as "on <date>, <author> wrote:" for
single-line replies. There are other variations, but the email header logic is English-centric.
Email threading will be affected as follows by non-English email headers:
n Groups of emails which should be in a single thread will split into multiple threads. This is due to not
matching up the foreign-language-surrounded segment with its English-language version (either when
the email itself is in the collection, or when the email was replied to by both an English and a non-Eng-
lish email client).
n There will be fewer segments than desired in the Email Thread Group of a document which contains for-
eign email headers.
n If emails contain mixed languages in header fields, for example some field names are in English and
some are in a different language, your Indentation field is lower than expected because Analytics
doesn't identify the emails to be emails.
Note: The Analytics engine does not offer capability to effectively deal with email headers which are in a
non-English language.
73.4.3 Checklist for using Analytics with non-English language and multiple
language workspaces
The following table can be used as a general ordered checklist for using Analytics successfully in
workspaces with a non-English primary language or with multiple languages:
Process description Non-English work- Multi-language work-
spaces spaces
Run Textual Near Duplicate identification
a a
Run language identification
a a
Determine if there will be separate indexes (i.e., at least 1K
documents of a given language)
a
Find appropriate stop word list
a a
Create one new Analytics profile with appropriate stop words
a
Create two new Analytics profiles with appropriate stop
words (if separate indexes)
a
Create saved searches
a a
Run Repeated Content identification against the Searchable
Set search
a
74.1 Requirements
n An active Analytics index
n Categorization data source (saved search)
n Edit/Add access to Analytics Categorization Set
74.2 Directions
1. Create a new Analytics Categorization Set (found under the Analytics tab) with the following settings:
a. Set the Documents to Be Categorized and Analytics Index.
b. Set Privilege as the Categories and Examples Source.
c. Save your Categorization Set.
2. Click Synchronize in the Categorization Set console.
3. Click Categorize All Documents.
4. Once categorization is complete, a field is created that contains the categorization results. It will be
named Categories – <your Categorization Set name>. To begin screening for privileged documents, sort
on the CSR – <your Categorization Set name>::Category Rank field, and prioritize the higher-ranking
documents.
Note: Categorization has the potential to return many false-positive results. You can focus on more viable
results by raising the Minimum Coherence Score.
75.1 Requirements
n Active Analytics index
n Analytics categorization set
n Edit/Add access to Analytics categorization set
75.2 Directions
1. Identify hot documents to serve as example documents, using any of these methods:
a. Use key search terms to find hot documents.
b. Use key documents from your review to serve as examples against opposing production.
c. Use the Find Similar Documents functionality to begin a review of main document items.
d. Cluster and analyze documents from each cluster.
2. Mark your hot documents as examples, using any of these methods:
a. Use a layout, and then apply values to those records.
b. Right-click, and then apply the document as an example.
c. Import a list of document identifiers into the Analytics Example Object.
3. Categorize the examples against the database, and check for more hot documents.
76.1 Requirements
n An active Analytics index
n An existing RAR project
76.2 Directions
1. In the Cluster browser, select the cluster set that includes the documents in your RAR project, and then
click Visualize Cluster.
2. In the filter panel, create a filter condition that returns the previously identified responsive or hot doc-
uments, and then click Run Search.
3. The system applies a heat map overlay to indicate which clusters contain the coded documents.
a. Set the cluster visualization depth slider to 2 to see the sub-clusters that contain matching doc-
uments.
6. Now that all the documents in the selected clusters are displayed in the document list, save these doc-
uments as a search using the Save as Search functionality. Save the search in the search folder for your
RAR project so it can be accessed when creating a new round. In the following example, the search is
saved in the folder for the Responsive Review RAR project.
78.2 Requirements
n Workspace access
o dtSearch creation rights
o Saved search creation permissions
n A list of stop words for the desired language
79.2 Directions
1. Set up a privilege conflict search that returns documents that are coded not privileged or have not
been reviewed for privilege within an email thread that contains privileged documents. See the recipe
Using saved searches to complete conflict checks for additional information.
2. Load a document returned by the above search into the document viewer.
3. In the viewer, click on the Show/Hide Email Thread Visualization icon ( ) to open the Email Thread
Visualization panel.
4. In the Display Options section, enable Highlight Field, and select your privilege coding field. Note that
the field list includes single choice and yes/no fields.
5. Click LEGEND to display the field highlighting legend.
6. In the example screen shot below, the color coded field highlighting allows you to quickly see where an
email has been coded as Not Privileged (yellow highlighting), and where the Privileged field is not set
(gray shading).
7. Click on any square node to view the email in the viewer.
8. The interactive visual indicators allow you to quickly focus in on and correct potential inconsistencies,
and provide context as to where the emails are within the conversation thread.
Note: Reviewers can also have the Email Thread Visualization console open with Highlight Field enabled
while conducting their initial review. Doing this provides reviewers with real-time visual feedback regarding
coding decisions, allowing them to work more consistently and efficiently.
80.1 Requirements
n Relativity 9.5 and higher
n An Email Threading set
80.2 Directions
1. Ensure documents are tagged with a load identifier so that newly loaded documents can be readily
identified.
2. Load new documents into the workspace and tag those documents with a load identifier value.
3. Build an email threading set that includes existing documents plus newly loaded documents.
4. Create saved search 1 to return documents that have been coded responsive and include email thread
group.
5. Create saved search 2 to return documents in search 1 that have a load identifier indicating they are
newly loaded documents.
6. Search 2 returns the newly loaded documents that should be prioritized, since these documents are in
email threads containing documents that were previously coded as responsive.
7. Run search 2 and open a document returned by the search in the document viewer.
8. In the viewer, click on the Show/Hide Email Thread Visualization icon ( ) to open the Email Thread
Visualization panel.
9. In the Display Options section, enable Highlight Field, and select your data load indicator field. Note
that the field list includes single choice and yes/no fields.
10. Click LEGEND to display the field highlighting legend.
12. Click DISPLAY OPTIONS and select the responsiveness coding field.
13. Click LEGEND.
14. As can be seen by the field highlighting in the screen shot below, all emails in the thread have been
coded responsive except for the newly loaded documents, which are not set (shaded gray).
15. Email Thread Visualization provides context as to where the newly loaded emails are within the con-
versation thread, and allows reviewers to focus in on these documents by clicking on the gray shaded
nodes to review and code the documents.
81.1 Requirements
n Analytics server setup
81.2 Overview
After running a Near Duplicate analysis, system admins should view the Textual Near Duplicate
Summary on the set’s Structured Analytics console, which breaks down the number of textual near
duplicate groups that have been identified, along with averages of percentage of similarity and of
the number of documents per similar document group.
Textual Near Duplicate Identification sorts the documents by size, from largest to smallest. This is
the order in which they are processed. The most visible optimization and organizing notion is the
principal document. The principal document is the largest document in a similar group and is the
document that all others are compared to when determining whether they are near duplicates. If
the current document is a close enough match to the principal document, as defined by the
Minimum Similarity Percentage, it is placed in that group. If no current groups are matches, the
current document becomes a new principal document. When the process is complete, only principal
documents that have one or more near duplicates are shown in groups.
When running the process, a Minimum Similarity Percentage is assigned. This parameter indicates
how similar a document must be to a principal document to be placed into that principal's group.
81.3 Directions
System admins should create a Textual Near Duplicates view for the review team.
1. In the Near Duplicate Identification view, add the following output fields:
nTextual Near Duplicate Principal - identifies the principal document with a "Yes"
value. The principal is the largest document in the duplicate group. It acts as an anchor
document to which all other documents in the near duplicate group are compared.
n Textual Near Duplicate Similarity - the percent value of similarity between the near
duplicate document and its principal document.
n Textual Near Duplicate Group - the identifier for a given group of textual near duplic-
ate documents.
2. Add a condition to only show documents where the Textual Near Duplicate Group field is set.
3. Set the following sort orders on the Near Duplicate Identification view to list the textual near
duplicate principals with the highest percentage of textual near duplicate similarity at the
top:
n Textual Near Duplicate Group - Ascending
n
Textual Near Duplicate Principal - Descending
n Textual Near Duplicate Similarity - Descending
82.2 Directions
1. Export data from the original database.
2. Create a production, burning in redactions.
3. Export the production with single-page TIFFs and an OPT file.
4. Export either native files, multi-page TIFFs, or PDFs from the original application with the path in a DAT
file.
5. Import the data into the new database.
Note: Load native files/multi-page image files as native files with metadata.
6. Import the production from your original application as single-page images into Relativity . Use redac-
ted versions of images for production unless changes need to be made. A user can then delete the
image and create it again from the clean native version.
83.1 Requirements
n Structured Analytics for Near Duplicates including new and reviewed documents
n Workspace access
o Saved searches
o Structured Analytics
84.2 Directions
To create saved searches for your conflict check:
1. Create a saved search to find what we do want: responsive items and their family items.
a. Set Name to Responsive and Family.
b. Include related group and Family.
c. Set Scope to Entire Workspace.
d. Set Conditions to Responsive, Operator to any of these, and Value to Responsive under Condi-
tions.
e. Add any Fields you find useful, but this is not the search that will display at the end.
2. Create a saved search to find items without coding or anything that is NOT set to Responsive.
a. Set Name to Conflicts.
b. Set Scope to Entire Workspace.
c. Create a Logic Group and add:
a. Set Condition to Responsive, and Operator to is not set.
b. Add an additional Condition, and select OR.
c. Set second condition to Responsive any of these, and Value to Not Responsive under
Conditions.
d. Add Condition of (Saved Search) document is in Responsive and Family.
This search will find items that are not coded or are not responsive that are family members for
Responsive items. This can be used in a variety of ways to find differences between populations.
Change the related items group from Family to Near Duplicates to find overlap between groups of data
that are near duplicates.
Using the dtSearch alphabet file, system admins can take originally ignored words and make them
searchable. However, dtSearch contains some characters and punctuation characters that you can't
make searchable.
85.1 Requirements
n Workspace access
o Edit, delete access on search indexes
o Edit, delete access on searches
85.2 Directions
Relativity references the default list of stop words each time you create a new index. System admins
can't edit stop words in keyword searches. The default stop word list consists of punctuation marks,
single letters and numbers, and the following words:
Note: The following are restricted from dtSearch in Relativity. These are reserved noise words and
characters which will continue to behave as operators as well as being noise words:
Relativity restricts the following from dtSearch. The following noise words and characters continue
to behave as operators: and, or, not, to, contains, xfirstword, xlastword, ", ( )< *, ?, %, @, ~, #, &, :,
=.
85.2.3.2 Spaces
Note: You must have valid Relativity Community credentials in order to download any Community file
linked to from the documentation site. You'll need to enter those credentials on the Community login
screen if you're not already logged in. If you're already logged in to the Community at the time you click a
link, the file is automatically downloaded in the bottom left corner of your screen. If you get an error
message stating "URL No Longer Exists" after clicking a Community link, it may be due to a single sign-on
error related to the SAML Assertion Validator, and you should contact your IT department.
dtSearch defines a space character as a character that causes a word break. Relativity doesn't index
these characters, so you can't search them. By default, dtSearch treats the following characters as
spaces:
\09\0a\0c\0d !@"#$&'()*+,./:;<=>?[\5c]^`{|}~
Values listed as \## are Unicode characters. Their definitions are:
n \09 - horizontal tab
n \0a - line feed
n \0c - form feed
n \0d - carriage return
n \5c - backslash (\)
85.2.3.3 Ignored
dtSearch defines an ignored character as a character that's ignored when processing text. You can't
search these characters. By default, dtSearch ignores the following characters:
\08 %
\08 is the backspace character in Unicode.
85.2.3.4 Hyphen
dtSearch defines hyphens as characters that receive special processing in dtSearch. By default,
dtSearch only classifies the - character as a hyphen.
2. Delete & from the [Spaces] string, so dtSearch doesn't interpret it as a space (word break) in the index.
86.2 Directions
Using production sets you can stamp documents with various pieces of information. Typically, there
is one field selected from the drop-down for each stamp location (e.g., the right footer or left
header). However, if you want to produce with more than one piece of metadata information, you
can to use tokens to customize your stamps.
You can use tokens for any data field and you can use them in combination with static text and/or
other tokens. The syntax to use a token is {!Field:ArtifactID!}. You can find the ArtifactID for a field
by adding the ArtifactID field to your Fields view.
An example of combining static text and a token would be: Custodian: {!Field:1036657!}. In this
example, Custodian is the static text and acts as a label for the token, the custodian field.
An example of combining multiple tokens would be: {!Field:1036657!} {!Field:1037086!}
{!Field:1037082!}. In this example, you are branding a string of three fields together on an
image.You can use hard returns or spaces between the tokens. You can also add separators, such as
carriage returns, hyphens, or commas, between the fields.
87.2 Requirements
n Folder paths less than 200 characters
87.3 Directions
1. Create a new field named Folders and open to the Field Tree.
2. Use a load file with slashes between folder names.
3. Import into the multiple choice field. This loads into the field with semi-colons separating items.
The appearance of folders and folder choices on the field tree look very similar with the exception
of the icon.
88.2 Directions
1. Publish or load documents into a workspace.
2. Set up a saved search that returns parent level emails and parent level loose documents. The duplicate
status field will be set on the parent level documents, and then family members will be included with
the parents, as described below.
n If using Relativity Processing, parent documents can be identified by including a search condition
where the Level field equals 1.
3. Create a single choice Duplicate Status field with 3 choices: Unique, Master, and Duplicate.
4. Run the Update Duplicate Status script:
n Use the saved search described above.
n Use the duplicate status field created above.
n For the Relational Identifier, select a file duplicate hash field. If using Relativity Processing to load
data, the Processing duplicate hash field can be used.
n Leave Duplicate Sort Order blank, unless you have set up a Custodian field to maintain a custom
priority sort order. When this field is blank, the system sorts on document artifact ID, so the first
document loaded in the workspace becomes the Master document when duplicates are iden-
tified.
n After the script runs, the duplicate status will be populated for top level, parent documents.
n At this point, you have a couple of options for including family members:
n Option 1 - Create a search that returns all documents where the duplicate status is set to
Master or Unique, and include family.
to return each status, include family, and then use a mass edit to update the child
documents with the status value of the parent.
o After copying the parent duplicate status value to children, create a search that
returns all non-duplicates. Family does not need to be included in this search, since
all family members will now have a duplicate status value.
n Make the non-duplicate documents identified in the above steps available to reviewers.
89.2 Requirements
n Relativity 7.3 or higher
n Relativity Web Client (ensure you have the latest version)
n Workspace Access
o Add fields
o Add layouts
o Script Admin
o View image
o Viewer
89.3 Directions
1. Create the fields to be used with the Relativity Sample Script, which are added to the Image QC (IQC) lay-
out. Use IQC_ or another designation in front of the field names to assist with filtering for IQC-related
items. In addition, you may want to secure these fields so that reviewers do not see them. Some sug-
gested fields are:
n IQC_Sample
o Field Type - Yes/No (this will be used for the Sample Script)
n IQC_Color
o Field Type - Single Choice
o Choices
l Add
l Remove
n IQC_Excel Limit Pages
o Field Type - Yes/No
n IQC_Excel Page Size
o Field Type - Single Choice
o Choices
l Letter (8.5 x 11)
l Letter (11 x 8.5)
l Tabloid (11 x 17)
l Ledger (17 x 11)
n IQC_Excel Print Area
o Field Type - Single Choice
o Choices
l Original
l Ignore
2. After you create your IQC fields, build a layout that has all of these fields available. Use the IQC_ naming
convention to maintain organization. Add your fields according to your needs.
3. Create a view that contains all your IQC_fields and the document type, if you have it. You will filter on
these fields to re-image documents, if necessary.
4. Run the Sample Script to assemble a random sampling of your imaged documents to be inspected.
This step is not necessary if you choose to inspect all your documents.
n There are two options:
o Fixed number
o Percentage of sample universe
n When running the script, use the newly created IQC_Sample field as your Sample Indicator field.
n After running the script, create a Saved Search for your Sample Set to QC that consists only of
the condition where the value for IQC_Sample is YES. Name this search “IQC_Sample_Universe.”
Please note that if you did not use the Sample Script, you will not have to create this search.
5. To verify images, launch the thumbnail viewer and the standalone viewer. Thumbnails are located at
the lower-right corner of the viewer.
a. If you do not see the option for thumbnails, update your viewer to the latest version. You can
download a new version of the viewer from the Workspace Details tab.
b. Launch the standalone viewer and use the Viewer Mode, located in the upper-right corner. Using
both of these will allow you to compare the image to the native. If using multiple monitors,
move this to your second monitor.