This Page

has been moved to new address

Early Case Assessment (ECA): From Key Word Search to Automating Document Relevance

Sorry for inconvenience...

Redirection provided by Blogger to WordPress Migration Service
----------------------------------------------------- Blogger Template Style Name: Snapshot: Madder Designer: Dave Shea URL: mezzoblue.com / brightcreative.com Date: 27 Feb 2004 ------------------------------------------------------ */ /* -- basic html elements -- */ body {padding: 0; margin: 0; font: 75% Helvetica, Arial, sans-serif; color: #474B4E; background: #fff; text-align: center;} a {color: #DD6599; font-weight: bold; text-decoration: none;} a:visited {color: #D6A0B6;} a:hover {text-decoration: underline; color: #FD0570;} h1 {margin: 0; color: #7B8186; font-size: 1.5em; text-transform: lowercase;} h1 a {color: #7B8186;} h2, #comments h4 {font-size: 1em; margin: 2em 0 0 0; color: #7B8186; background: transparent url(http://www.blogblog.com/snapshot/bg-header1.gif) bottom right no-repeat; padding-bottom: 2px;} @media all { h3 { font-size: 1em; margin: 2em 0 0 0; background: transparent url(http://www.blogblog.com/snapshot/bg-header1.gif) bottom right no-repeat; padding-bottom: 2px; } } @media handheld { h3 { background:none; } } h4, h5 {font-size: 0.9em; text-transform: lowercase; letter-spacing: 2px;} h5 {color: #7B8186;} h6 {font-size: 0.8em; text-transform: uppercase; letter-spacing: 2px;} p {margin: 0 0 1em 0;} img, form {border: 0; margin: 0;} /* -- layout -- */ @media all { #content { width: 700px; margin: 0 auto; text-align: left; background: #fff url(http://www.blogblog.com/snapshot/bg-body.gif) 0 0 repeat-y;} } #header { background: #D8DADC url(http://www.blogblog.com/snapshot/bg-headerdiv.gif) 0 0 repeat-y; } #header div { background: transparent url(http://www.blogblog.com/snapshot/header-01.gif) bottom left no-repeat; } #main { line-height: 1.4; float: left; padding: 10px 12px; border-top: solid 1px #fff; width: 428px; /* Tantek hack - http://www.tantek.com/CSS/Examples/boxmodelhack.html */ voice-family: "\"}\""; voice-family: inherit; width: 404px; } } @media handheld { #content { width: 90%; } #header { background: #D8DADC; } #header div { background: none; } #main { float: none; width: 100%; } } /* IE5 hack */ #main {} @media all { #sidebar { margin-left: 428px; border-top: solid 1px #fff; padding: 4px 0 0 7px; background: #fff url(http://www.blogblog.com/snapshot/bg-sidebar.gif) 1px 0 no-repeat; } #footer { clear: both; background: #E9EAEB url(http://www.blogblog.com/snapshot/bg-footer.gif) bottom left no-repeat; border-top: solid 1px #fff; } } @media handheld { #sidebar { margin: 0 0 0 0; background: #fff; } #footer { background: #E9EAEB; } } /* -- header style -- */ #header h1 {padding: 12px 0 92px 4px; width: 557px; line-height: 1;} /* -- content area style -- */ #main {line-height: 1.4;} h3.post-title {font-size: 1.2em; margin-bottom: 0;} h3.post-title a {color: #C4663B;} .post {clear: both; margin-bottom: 4em;} .post-footer em {color: #B4BABE; font-style: normal; float: left;} .post-footer .comment-link {float: right;} #main img {border: solid 1px #E3E4E4; padding: 2px; background: #fff;} .deleted-comment {font-style:italic;color:gray;} /* -- sidebar style -- */ @media all { #sidebar #description { border: solid 1px #F3B89D; padding: 10px 17px; color: #C4663B; background: #FFD1BC url(http://www.blogblog.com/snapshot/bg-profile.gif); font-size: 1.2em; font-weight: bold; line-height: 0.9; margin: 0 0 0 -6px; } } @media handheld { #sidebar #description { background: #FFD1BC; } } #sidebar h2 {font-size: 1.3em; margin: 1.3em 0 0.5em 0;} #sidebar dl {margin: 0 0 10px 0;} #sidebar ul {list-style: none; margin: 0; padding: 0;} #sidebar li {padding-bottom: 5px; line-height: 0.9;} #profile-container {color: #7B8186;} #profile-container img {border: solid 1px #7C78B5; padding: 4px 4px 8px 4px; margin: 0 10px 1em 0; float: left;} .archive-list {margin-bottom: 2em;} #powered-by {margin: 10px auto 20px auto;} /* -- sidebar style -- */ #footer p {margin: 0; padding: 12px 8px; font-size: 0.9em;} #footer hr {display: none;} /* Feeds ----------------------------------------------- */ #blogfeeds { } #postfeeds { }

Tuesday, June 23, 2009

Early Case Assessment (ECA): From Key Word Search to Automating Document Relevance

Just before the Christmas holidays in 2007 I was teaching a CLE on the “Changes to the Federal Rules of Civil Procedure (FRCP)”. There had to have been 150 litigators, litigation service providers and consultants in the room listening to me preach about this unchartered new world called eDiscovery. My initial morning session was an overview of the current status of paper based litigation processing, how the accelerating increase in the volume of Electronically Stored Information (ESI) was steaming down the eDiscovery tracks like an out of control locomotive, how the changes to the FRCP would effect us all and what they all needed to know that day to get ready. One of topics that I touched on was the need to become more familiar with key word search methodology and the new technologies that would enable the legal industry to better manage the process of finding relevant data and assessing cases early on in the eDiscovery lifecycle. The reason that I am recalling this CLE seminar was because I will always remember the comments from an older litigator that packed up his belongings during the first break, walked up to the front and announced that, “all of this fancy new technology was never going to replace good old fashion legal hard work, understanding the relevant facts and key information for case wasn’t as complicated as I was making it sound and therefore he wasn’t going to stick around for the rest of my session.”

Well, the remainder of 149 attendees stayed and I hope that they all walked away with a better understanding of the importance of understanding key word search methodology and the new technologies that would enable the legal industry to better manage the process of finding relevant data and assessing cases early on in the eDiscovery lifecycle. Over the subsequent 18 months, key word became a big issue in the eDiscovery lifecycle and spawned a whole new technology arena called Early Case Assessment (ECA) that focused on reducing the amount of “relevant” data that had to ultimately be reviewed by a lawyer. And, having just completed another CLE this past week on ECA, I think that “most” eDiscovery professionals understand the keyword search methodology and the new technologies that enable them to better manage the process of finding relevant data and assessing cases early on in the eDiscovery lifecycle.

However, just went we thought we had it had it all figured out, a study from the Text Retrieval Conference (TREC) indicated that the keyword method tends to miss most of the relevant documents, while yielding mainly irrelevant documents. As a result, only a fraction of the relevant documents make it to the detailed review stage, while most of the documents that are submitted to review are in fact not relevant.

Moreover, the study also found that the keyword approach is typically binary, meaning that documents are either included or not. There is no graduated scale of relevance. This rigid approach does not allow for relative ranking of documents, making it extremely difficult to manage and prioritize document review.

However, as with any market that is going through a paradigm shift looking for its center and trying to normalize on some standards, the litigation technology vendors have been all over the keyword search issue and are starting to release their new solutions into production. One of the very first players in the industry to address the issue of document relevance is Equivio, a leading provider of near de-duping and email thread management technology. They have just launched Equivio>Relevance™, an expert-guided system that enhances the eDiscovery process through automated document prioritization.

Getting back to the comments of my lawyer friend that walked out of my CLE back in 2007, fancy new technology may never replace good old legal hard work. However, in today’s new world of Terabytes of potential evidence in even some of the small matters, we need all the technical help that we can get. And, it appears that Equivio is stepping up and offering us all at least a fighting chance to find the documents that we need successfully mange our cases.

The Full Text of Equivio’s Press Release is as follows:

Kensington, MD, June 22, 2009 – Equivio, a provider of software for managing data redundancy, announced today that it has launched Equivio> Relevance™, an expert-guided system that enhances the eDiscovery process through automated document prioritization.
Traditionally, attorneys use keywords to pre-filter documents prior to detailed review. According to the TREC (Text Retrieval Conference) studies, the keywords method tends to miss most of the relevant documents, while yielding mainly irrelevant documents. As a result, only a fraction of the relevant documents make it to the detailed review stage, while most of the documents that are submitted to review are in fact not relevant.

Moreover, the keywords approach is typically binary, meaning that documents are either included or not. There is no graduated scale of relevance. This rigid approach does not allow for relative ranking of documents, making it extremely difficult to manage and prioritize document review.

Equivio>Relevance™ is designed to address these limitations, introducing a higher level of flexibility, control and accuracy into the eDiscovery process. Based on initial input from a lead attorney, Equivio>Relevance uses statistical and self-learning techniques to calculate graduated relevance scores for each document in the data collection. Equivio>Relevance also uses a statistical model to calculate the precision and recall achieved by the software. This statistical model is used to provide a new level of measurability and control in the eDiscovery arena, while also helping to ensure the defensibility and transparency of the process.

Equivio>Relevance drives value throughout the eDiscovery flow through:

  • Early case assessment: Equivio>Relevance facilitates rapid assessment of the key issues and concepts in a case.
  • Culling: Equivio>Relevance achieves high levels of recall and precision, helping overcome the challenges of over and under-inclusion that characterize traditional keyword methods.
  • Review prioritization: By organizing the review set according to relevance rankings, Equivio enables prioritization of document review. This allows attorneys to immediately focus on the most relevant documents.
  • Review quality assurance: By identifying discrepancies in the responsiveness designations of Equivio>Relevance vis-à-vis the human review team, the application helps find responsive documents missed in the detail review. Similarly, the discrepancies can be used to locate documents incorrectly marked by the human review team as responsive.
Equivio>Relevance also generates a list of keywords that characterize relevant documents in the collection. These automatically-generated keywords can be used to supplement and enhance the manual list of keywords built by the legal team.

"Equivio is committed to developing and delivering innovative technologies that will help litigators improve the quality and consistency of their eDiscovery processes," said Amir Milo, CEO of Equivio. "Equivio>Relevance™ enables attorneys to review fewer and more relevant documents, lowering review costs and reducing the risk of missing key data."

About Equivio

Equivio enables the management of data redundancy in content-centric business processes. Equivio's technology zooms in on unique data, allowing you to read less, think more, win big™. With products for grouping near-duplicates, capturing email threads and determining document relevance, Equivio powers a broad range of business applications, including eDiscovery, records management, email archiving, data retention and intelligence. To learn more about winning with Equivio, visit http://www.equivio.com/.

Labels: , , , , ,

0 Comments:

Post a Comment

Subscribe to Post Comments [Atom]

<< Home