e-Discovery Team ®

Blog by Ralph Losey on the team approach to electronic discovery combining the talents of Law, IT, and Science. The views expressed are my own, and not necessarily those of my law firm, clients or University. Copyright Ralph Losey 2006-2012.
  • Home
  • About
  • New Rule 16
  • 26
  • 34
  • 37
  • Duties
  • Hash
  • Interviews
  • Meta
  • School
 

Predictive Coding Based Legal Methods for Search and Review

March 25, 2012

Predictive coding type algorithms are designed to leverage the expertise of human input, preferably attorneys who are subject matter experts of the case at hand. A classification of one document by an attorney results in a recommended classification of hundreds, if not thousands of other documents that the computer identifies as similar. The computer examines the entire data set, the corpus, and predicts the probability of each document therein fitting within the same classification. The expert then tests and correct the predictions in an iterative process. The cost savings in this approach are obvious, particularly considering the high expense of attorney review time. Reviewers can break through the current linear review speed barrier of approximately 100 files per hour, to 1,000, or even 10,000 files per hour. These supersonic review speeds are what it takes to handle e-discovery today in an effective and economical manner.

Predictive coding software tracks and learns from human expert decisions in a feedback loop. This increases accuracy, allowing greater reliance and faster culling of irrelevant (or unresponsive) ESI. Only the ESI predicted to be relevant then needs to be reviewed by attorneys for final quality control verification before production. The extent and degree of the final human review for quality control  varies according to two considerations: proportionality constraints related to the value and importance of the case; and, the parties risk tolerance for disclosure of confidential or privileged information.

This kind of software features that I here refer to as predictive coding algorithms use what is known in information science as classification systems based on probability theories. The Sedona Conference Best Practices Commentary on the Use of Search & Information Retrieval Methods in E-Discovery, 8 Sedona Conf. J. 189, 218-221. (2007):

Probability theories are used to make decisions regarding relevant documents. The most prominent of these are so-called “Bayesian” systems or methods, based on Bayes’ Theorem, and the related  “dimension reduction systems.”

With predictive coding human review of all files is eliminated. The operative word here is all. As mentioned, review of some files by attorneys is still required for a variety of reasons. Primary among them are training of the artificial intelligence agents, and quality control verification before information is transferred to the requesting party. But all files will not be reviewed by human eyes. In fact, in some cases only a small minority may be subjected to human review. The percent of files not reviewed by experts, but instead only reviewed by artificial intelligence algorithms, will depend on the costs and time factors in the case — the proportionality analysis under Rule 26(b)(2)(B) — and the producing party’s risk tolerance.

Experienced, Skilled Attorneys Are Key to Successful Use of Predictive Coding Software

The input of human experts to train the artificial intelligence algorithms is essential and has a direct impact upon the quality of performance of the predictive coding. If, for instance, the expert is inconsistent in making initial calls on the relevance of documents, then the computer extrapolations based on these calls will also be inconsistent. It is the perennial computer situation of garbage in, garbage out. The quality of the predictions made by the computer is, in large part, based on the quality of the training, the input, provided by the human experts. Quality is, of course, also impacted by other factors, including the nature of the ESI under review, the number of iterative human expert input cycles, and the quality of the software itself. But any deficiencies in these quality factors can be detected and corrected by statistical random sampling of the final predictions. This is ultimately why the black box itself, i.w. exactly how the software works, is not really that important. The final quality controls of random sampling protect against any errors, including software errors.

Although random sampling protects against errors, and greater reliance is placed on these new technologies, the skills, or lack of skills, of attorneys managing a discovery project are still critical. Unskilled attorneys may, for instance, be inconsistent in coding of relevance over multiple documents. They may also make poor choices as to what documents should be considered as representative of highly relevant documents. Even errors in merely relevant coding can have cascading effects.

Predictive coding software models future choices after the documents submitted to it as representative. If the documents have not been properly evaluated or selected by attorneys to begin with, then the results of the automated analysis will also be flawed. If you show the computer what a smoking gun document actually looks like, then the computer will have a good chance to find it. But if you show it a poor imitation, not a real smoking gun document, then you should not be surprised when the computer only finds duds.

When I speak of skilled attorneys, I am not simply talking about attorneys who know how to operate a particular vendor’s software. I am talking about attorneys who really understand the particular issues of a case and the relative probative value of various documents. I am also talking about attorneys who understand the new iterative processes and legal methods to properly use the software.  The diagram below provides a high-level overview of these new processes that I recently prepared. I will go into greater detail about these new iterative predictive coding methods for ESI productions in a future blog.

Do not simply use in an unthinking manner the standard work flow methods that vendors set up as defaults for software. Also, if a vendor offers advice on legal methods and best practices (and good vendors know better than to do that), take it all with a big grain of salt. Vendors are not lawyers. E-discovery vendors are not allowed to provide legal services or legal advice. They are prohibited from doing so, even if the expert has a law degree and is licensed to practice law. It is even worse when a tech expert, one who has never even gone to law school, provides opinions on legal methods and what is reasonable and what is not. Vendors and techs are permitted to be advisers on technology and software functions, not the law. Legal methods must be determined by lawyers, not techs. Legal advice may only be provided by practicing attorneys in firms, or acting as solo practitioners. Still, unless you are already an expert in a particular kind of software or technology, it is a very good idea to consult with a software expert or bona fide information scientist in order to formulate your legal opinions. The multi-disciplinary team approach is required in e-discovery work.

Bottom line: you must never let the tail (the software) wag the dog (legal search). Attorneys must at all times remain in control and not abdicate their responsibility as legal advisers to non-lawyer tech experts, much less to software. In other words, even the best software is only as good as the lawyers who use it.

Proving a Negative

The inability of attorneys to uncover documents requested does not necessarily mean that the attorneys are unskilled or the software is defective (although it might). The template for the smoking gun document might be accurate, but nothing like it is found for the simple reason that none exist. This is a key point that should never be forgotten. It is also one reason that skill alone is not enough to know whether you have proven a negative, or have made a mistake. Art and experience are important in this area of the law, just like in any other area of the law. If there is no needle in the haystack to begin with, then no amount of skills, quality controls, or repetition in search will find one.

An experienced attorney gets a feel for the data under review and the people involved. After a while of studying a large data set like a multi-custodian collection of email they know when proceeding further is likely futile. The data examined allows for an accurate assessment of the probabilities of the data not yet examined. It looks like this may be a kind of intuitive application of Bayes’ theorem of conditional probabilities.

It is not uncommon for a requesting party to imagine or hope for the existence of emails or other documents that, like a Unicorn, simply do not exist. The requester is on a mere fishing expedition, and often a delusional one at that. If only a few relevant documents are uncovered by a search, that does not necessarily mean garbage in, garbage out. It could be treasure in, treasure out, as far as the responding party in concerned. The search was successful in proving that what was alleged to have happened, in fact did not. It is proof of a negative by failing to discover a positive. This does not require that every stone be turned as some still think. Probability of data is quite sufficient in the law in these circumstances for a number of reasons, including cost proportionality restraints and burden of proof requirements. 

The old adage “where there’s smoke there’s fire” is usually true, but the converse is also true. Moreover, sometimes there is just smoke and mirrors without any fire at all. It is just a conjured wrong that doesn’t really exist. Knowing the difference is where experience and art in law come into play. There is no artificial intelligence for that, although Bayesian probability analysis comes close. This is something that most software designers, vendors, and inexperienced attorneys do not fully appreciate.

Positive Learning Feedback Loop

Even the most highly skilled attorneys will make mistakes from time to time. All experienced attorneys know that. That is why the iterative processes of predictive coding are so valuable. When mistakes in input are made by the subject matter expert attorneys who create the first seed set in the predictive coding process, it is not a one-time, all or nothing procedure.

In predictive coding the linear processes are replaced by cyclic ones. A review by attorneys of the results returned by their initial input creates a positive learning feedback loop for both the software and attorney reviewers. They will quickly detect any errors in their analysis and make corrections accordingly. It is a way of repeatedly checking their work. If they get a return of duds, they know to improve the picture of the smoking guns documents they are looking for, or to look for something else. Proper use of predictive coding tools will in effect make the attorney reviewers smarter and more effective. This in turn allows better input from the attorneys and improves the computer’s efficiency in the next round. The iterative nature of the process allows the creation of powerful positive feedback loops.

The more advanced trans-keyword search methods include the following as described by one of the leading jurists in this area, Judge Paul Grimm:

In addition to keyword searches, other search and information retrieval methodologies include: probabilistic search models, including “Bayesian classifiers” (which searches by creating a formula based on values assigned to particular words based on their interrelationships, proximity, and frequency to establish a relevancy ranking that is applied to each document searched); “Fuzzy Search Models” (which attempt to refine a search beyond specific words, recognizing that words can have multiple forms. By identifying the “core” for a word the fuzzy search can retrieve documents containing all forms of the target word); “Clustering” searches (searches of documents by grouping them by similarity of content, for example, the presence of a series of same or similar words that are found in multiple documents); and “Concept and Categorization Tools” (search systems that rely on a thesaurus to capture documents which use alternative ways to express the same thought).

Victor Stanley, Inc. v. Creative Pipe, Inc., supra at FN 9 of the opinion. Also see videos of my short-hand explanations of predictive coding at LegalTech interviews: My Impromptu Video Interview at NY LegalTech on Predictive Coding and Some Hopeful Thoughts for the Future; and, Interview at Legaltech 2012 on predictive coding and transparency.

The current linear review and culling process derived from paper discovery, where one step follows after the last one is completed, should be replaced by a non-sequential, four-dimensional approach. Here all steps are pursued at once, to some degree, and are repeated over time (the fourth dimension) within predetermined budgetary constraints. This is a use of iterative cycles, where multiple decisions are made at the same time with computer assistance, but later verified and quality controlled by sampling. This is all done within case specific proportionality limits under Rule 26(b)(2)(C), Federal Rules of Civil Procedure and related case law. This  type of iterative project management may be new to the law, but is well established in technology development where diagrams like this are common.

Under the new procedures, the old style ad hoc quality control methods are replaced by judgmental and statistical sampling and other metric based quality control systems. The old school muddling management approach is replaced by proven project management techniques. See The Sedona Conference® Commentary on Achieving Quality in the E-Discovery Process; (identifies seven elements critical for successful project management: 1. Leadership.
 2. Tailoring. 
3. Expertise. 
4. Adaptability. 
5. Measurement.
 6. Documentation. 
7. Transparency.) The new procedures also require and depend on cooperation and disclosure. See eg. Animation Showing How Not To Cooperate In An E-Discovery Conference.

Six Search Ideas

In Secrets of Search Parts One, Two and Three, I outlined the five key characteristics of effective search today, using the rubric of secrets. In Part Three I summarized my ideas on search and review using the symbol of the Pythagoreans, the five-sided polygon, or pentagon:

With my blog, Bottom Line Driven Proportional Review  I added the sixth idea, where the process gets real and takes money into consideration. In that blog I shared my method to use estimation, projections, budget, cooperation, transparency, and the legal doctrine of proportionality to control the costs of search and review. This final piece competed the outline of a new gold standard of search and review:

Predictive coding is just one tool, one part of the six search ideas. It is part of the “hybrid multimodal,” one of many search techniques, including keyword, that make up multimodal, meaning multiple types of search techniques. Predictive coding is currently the best search tool to be sure, but it is just another high-priced hammer without a well-trained carpenter to use it, one who knows the new multidimensional, iterative, largely automated, cooperative methods. (By the way, have I mentioned yet that most software is priced too high? Come on vendors, go for quantity of sales by lowering the prices.)

Conclusion

The number of documents we have to review seems to double every two to three years, so this new six-fold legal method is imperative. New and better software, especially predictive coding type, is also important. The ranking of relevancy and other categories built into the latest algorithms is, under Bottom Line Driven Proportional Review, an especially helpful new capability. You rank the documents that the computer predicts will be the most relevant to your case and still fit within your budget. You plan an open proportional review, not an over-burdensome one.

Predictive coding is a powerful new tool, but technology alone is not enough. We must also have new legal methods. Technology and law have to work together, grounded in science, to create a new gold standard. The law, driven as it is to stop the run-away costs of e-discovery, is now ready to adopt these new ideas and methods. It is ready to change from a linear, confrontative, one-dimensional, mostly-manual, Bates stamp approach to discovery, to a multidimensional, cooperative, iterative, largely-automated, hash value approach.

Change is possible. Just keep working on it, a little bit every day, and then suddenly, a breakthrough. Once you break the ice, and get things started, the pace of change can quicken. New technologies like predictive coding and new legal methods like bottom line driven proportional review can be quickly accepted.

But we need your help to establish and explain these new ideas and methods to the Bench and Bar. We need more lawyers and other experts to implement the new approach. Remember the lessons of Zuckerberg’s success: Impactful, Fast, Bold, Open, Values: Guidance of the “Hacker Way”. Don’t wait any longer. Affirmance is on its way. Be bold, try the new methods, the new software, and don’t be afraid to be open about what you are doing. We need the impact of these values to quickly stem the tide of ESI overload and excessive costs.

As Margaret Mead said:

Never doubt that a small group

of thoughtful, committed, citizens

can change the world.

Indeed,

it is the only thing that ever has.

6 Comments | Evidence, Metadata, New Rules, Review, Search, Technology | Permalink
Posted by Ralph Losey


New Methods for Legal Search and Review

March 18, 2012

New systems of e-discovery are emerging that are designed for today’s digital world. Unlike most existing e-discovery systems, they are not mere adaptations of old paper discovery ways. The new methods use an entirely new collaborative approach and technologies, exemplified by predictive coding software. Although this paradigm shift in discovery is just starting, many of the contours of the new methods are already apparent.

In general terms, we are coming to realize that our current linear, confrontative, one-dimensional, largely manual, Bates stamp approach to discovery must be replaced with a multidimensional, cooperative, iterative, largely automated, hash value approach.

Computers and advanced technologies and communication systems created this problem for the law. They can now provide the solution. As the influential Bar group The Sedona Conference puts it:

A consensus is forming in the legal community that human review of documents in discovery is expensive, time consuming, and error-prone. There is growing consensus that the application of linguistic and mathematic-based content analysis, search and retrieval technologies, and tools, techniques and process in support of the review function can effectively reduce the cost, time, and error rates.

The Sedona Conference Best Practices Commentary on the Use of Search & Information Retrieval Methods in E-Discovery, 8 Sedona Conf. J. 189, 215 (2007).

Attorneys are beginning to embrace new technologies as indispensable tools around which new legal processes and procedures are built. They understand that expensive human review must be significantly curtailed and re-directed to the use of new software tools, not just keyword searching. The new tools include context and conceptual search software, advanced multidimensional indexing, and other new types of artificial intelligence based predictive coding search and review programs. See Eg.: Kevin D. Ashley, Will Bridewell, Emerging AI+Law Approaches to Automating Analysis and Retrieval of ESI in Discovery Proceedings, ICAIL 2009 Global E-Discovery/E-Disclosure Workshop.

Technology should not be mistaken as an end in itself, or a magic panacea. New software and other technology that is just plugged into the old paper search and review methods and processes, and old attitudes, is not the answer. A more fundamental change in legal practice is required. Quoting again from another important Sedona Conference commentary:

The legal profession is at a crossroads: the choice is between continuing to conduct discovery as it has “always been practiced” in a paper world — before the advent of computers, the Internet, and the exponential growth of electronically stored information (ESI) — or, alternatively, embracing new ways of thinking in today’s digital world. Cost-conscious clients and over-burdened judges are demanding that parties now undertake new approaches to solving litigation problems.

Baron and Burke, The Sedona Conference® Commentary on Achieving Quality in the E-Discovery Process; (May 2009) in the Executive Summary.

Many of our leading jurists, information scientists, academics, scholars, writers, and legal practitioners recognize that the old methods and attitudes that worked for paper no longer work for ESI. Victor Stanley, Inc. v. Creative Pipe, Inc., 250 F.R.D. 251 (D. Md. 2008) (Judge Grimm); Securities and Exchange Commission v. Collins & Aikman Corp., 2009 WL 94311 (S.D.N.Y., 2009) (Judge Scheindlin); Disability Rights Council of Greater Wash. v. Wash. Metro. Area Transit Auth., 2007 WL 1585452 (D.D.C. 2007) (Judge Facciola); United States v. O’Keefe, 2008 WL 449729 (D.D.C. 2008) (Judge Facciola); William A. Gross Const. Associates, Inc. v. American Mfrs. Mut. Ins. Co., _F.R.D._, 2009 WL 724954 (S.D.N.Y. 2009) (Judge Peck); Digicel (St. Lucia) Ltd & Ors v. Cable & Wireless & Ors, [2008] EWHC 2522 (Ch) (Justice Morgan) (UK decision). Moreover, scientific research has shown that keyword search alone is ineffective and multi-modal approaches that use keyword and other methods work far better. See:  Child’s Game of “Go Fish” is a Poor Model for e-Discovery Search;  The Multi-Modal “Where’s Waldo?” Approach to Search and My Mock Debate with Jason Baron; Secrets of Search: Parts One, Two, and Three.

The new legal methods for search and review take advantage of the computational properties of ESI. These new methods are not only tolerated by judges, they are encouraged, because judges understand that they allow a way out of the unnecessary disputes and expenses that the old methods perpetuate. The new methods and technologies are also encouraged by a recent Congressional action amending the Federal Rules of Evidence to include new Rule 502 on the protection of privileged information from inadvertent disclosure, a significant problem now because of the high volumes of ESI to review. The rule sometimes requires a producing party to take reasonable steps to protect privilege. The Advisory Committee Notes To Federal Rule Of Evidence 502
 state that:

… a party that uses advanced analytical software applications and linguistic tools in screening for privilege and work product may be found to have taken ‘reasonable steps’ to prevent inadvertent disclosure.

Also see: The Sedona Principles (2007) Principle 11:

A responding party may satisfy its good faith obligation to preserve and produce relevant electronically stored information by using electronic tools and processes, such as data sampling, searching, or the use of selection criteria, to identify data reasonably likely to contain relevant information.

Software has advanced to the point that computer review alone is at least as accurate as manual review, the so-called gold standard, and is getting better all of the time. Maura R. Grossman & Gordon v. Cormack, Technology-Assisted Review in E-Discovery Can Be More Effective and More Efficient Than Exhaustive Manual Review, Rich. J.L. & Tech., Spring 2011, at 48; Herbert L. Roitblat, Anne Kershaw, Patrick Oot, Document Categorization in Legal Electronic Discovery: Computer Classification vs. Manual Review (Electronic Discovery Institute 2009); Also see my prior blogs: Secrets of Search: Parts One, Two, and Three; and, The Legal Implications of What Science Says About Recall.

Furthermore, highly focused, multiple-phase production is becoming an accepted best practice norm, replacing the old paper ways of trying to discover all possible relevant writings in a single, broad, boilerplate production request and production. Covad Communications Co. v. Revonet Inc., 2008 WL 5377698 (D.D.C. 2008) (boilerplate paper forms still used to request ESI); Shannon Capone Kirk, Kristin Ali, The Need for Art and Surgery in Discovery, Chapter 16 of Losey, Electronic Discovery (West 2010).

You can't afford the truth!

The profession is beginning to understand that in today’s world of Too Much Information, the production of all relevant information is a practical impossibility. See eg. Rowe Entm’t, Inc. v. William Morris Agency, Inc., 205 F.R.D. 421, 423 (S.D.N.Y. 2002):

The more information there is to discover, the more expensive it is to discover all the relevant information until, in the end, “discovery is not just about uncovering the truth, but also about how much of the truth the parties can afford to disinter.”

Instead, the goal should be production of as many highly relevant documents as is proportionate to the value and significance of the case. Rule 26(b)(2)(c) Federal Rules of Civil Procedure; Sedona Conference Commentary on Proportionality (2010); The Sedona Principles (2007), 2nd Principle:

When balancing the cost, burden, and need for electronically stored information, courts and parties should apply the proportionality standard embodied in Fed. R. Civ. P. 26(b)(2)(C) and its state equivalents, which require consideration of the technological feasibility and realistic costs of preserving, retrieving, reviewing, and producing electronically stored information, as well as the nature of the litigation and the amount in controversy.

Also see Final Report on the Joint Project of the American College of Trial Lawyers and the Institute for the Advancement of the American Legal System (2009) (At page 7: “Proportionality should be the most important principle applied to all discovery.”) Also see my recent blog: Bottom Line Driven Proportional Review.

The new discovery methods rely heavily upon various types of sampling and other metric based quality control systems. Rule 34(a)(1), Federal Rules of Civil Procedure specifically allows sampling as a form of discovery and the official Rules Commentary confirms that this provision is intended to cover ESI as well as tangible objects. Also see In re Vioxx Products Liability Litigation, No. 06-30378, 06-30379 2006 W.L. 1726675, at *2 n.5 (5th Cir. May 26, 2006) (“By random sampling, we mean adhering to a statistically sound protocol for sampling documents.”); In Re Seroquel Products Liability Litigation, 244 F.R.D. 650 (M.D. Fla. 2007) (“Common sense dictates that sampling and other quality assurance techniques must be employed to meet requirements of completeness.”); Mt. Hawley Ins. Co. v. Felman Production, Inc., 2010 WL 1990555 (S.D. W. Va. May 18, 2010) (plaintiff’s attorney client privilege was lost because the plaintiff “failed to perform critical quality control sampling to determine whether their production was appropriate and neither over-inclusive nor under-inclusive.”); McPeek v. Ashcroft, 202 F.R.D. 31, 34 (D.D.C. 2001) (ordering initial limited search of backup tapes within dates determined to be most relevant, as a “rational starting point” for ESI search). Also consider the opinion of Kipperman v. Onex Corp., 260 F.R.D. 682 (N.D. Ga. 2009). The court required production of “sample” backup tapes to weigh the volume and importance of the information on the tapes against the costs of their restoration and production before deciding whether to permit additional discovery. After reviewing the results of the sample, the court held that the ESI contained on the tapes was highly relevant and so compelled further discovery saying: “I don’t . . . declare these to be smoking guns but they certainly are hot and they certainly do smell like they have been discharged lately.” Id. at 691.

New Cooperative Strategies to Litigation Discovery

We know that the new methods should be based upon open communications and sharing of information of preliminary findings with the requesting party. This is what George Paul and Jason Baron call “virtuous cycle iterative feedback loops.” Information Inflation: Can the Legal System Adapt? Supra at pgs. 32-36.

The parties need to exchange expert views and gain a common understanding of the relevance parameters. Scientific investigations on the effectiveness of search and review processes performed by the TREC Legal Track confirm this. Douglas W. Oard, Bruce Hedin, Stephen Tomlinson and Jason R. Baron, Overview of the TREC 2008 Legal Track. Studies also confirm that at least one meeting between opposing counsel to share initial review results, and consider alterations based on mutual disclosure and input, significantly increases the overall precision and recall of the project. Jason R. Baron, Douglas W. Oard, Feng C. Zhao, Improving Search Effectiveness in the Legal E-Discovery Process Using Relevance Feedback.

For this new system of open, iterative communication processes to work, lawyers must change their attitudes. The total adversarial approach to discovery, which is now prevalent in the United States, must be replaced by strategic cooperation with active judicial management where required. This is critical for the new iterative open process to work at maximum efficiency.

Cooperative discovery processes have long been required by the rules of professional ethics and procedure, but they were frequently misunderstood or ignored by attorneys succumbing to economic pressures to win at all costs. Losey, Lawyers Behaving Badly, 60 Mercer L. Rev. 983 (Spring 2009). They were also not well understood or enforced by the courts. Mancia v. Mayflower Textile Services. Co., 253 F.R.D. 354 (D.Md. Oct. 15, 2008). But this has changed with recent scholarship and experiences showing the importance of cooperation in e-discovery. The Case for Cooperation, 10 Sedona Conf. J. 339 (2009 Supp.) (Preface by Justice Breyer; lead article by Sedona Conference Working Group, edited by Bill Butterfield, Richard Braman, Ken Withers and others); The Bull’s-Eye View of Cooperation in Discovery, 10 Sedona Conf. J. 363 (2009 Supp.) (Professor Steven S. Gensler); Mancia v. Mayflower Begins a Pilgrimage to the New World of Cooperation, 10 Sedona Conf. J. 377 (2009 Supp.) (Ralph Losey).

Experience has shown that an over-contentious approach to discovery is a significant cause of many of the inflated costs of e-discovery. A strong movement is now underway in the U.S. to adopt a cooperative approach to discovery. It is led by The Sedona Conference and over 100 prominent judges and many more legal practitioners. See: Sedona Cooperation Proclamation, Sedona Conf. (2008). As Jason R. Baron, the past Co-Chair of The Sedona Conference® Working Group on Electronic Document Retention and Production, recently explained:

[T]he challenge is how best to reasonably (not perfectly) manage the exponentially growing amount of ESI caught in, and subject to, modern-day discovery practice. The answer lies principally in culture change (i.e., fostering cooperation strategies), combined with savvier exploitation of a range of sophisticated software and analytical techniques.”

Baron, J. Law In The Age of Exabytes, XVII RICH. J.L. & TECH. 9, at pg. 5 (2011).

Conclusion

Discovery of evidence and the legal analysis of relevancy and privilege determinations are at the heart of our legal system. They are essential to the common law evidence based system of justice. The methods and tools used in paper discovery did not work with the vast stores of digital information ubiquitous in the Twenty-First Century. That is why e-discovery became so expensive and riddled with mistakes. That is why completely new methods and tools emerged for digital discovery, which, in 2012, finally caught on. The last impediment of no judicial approval has been destroyed. The way is now clear.

The old linear methods of reading all papers are being replaced by iterative methods that include predictive coding. The new ways involve cooperative, multidimensional, cyclic approaches that focus on proportional, phased productions.

The stated goal of the new legal processes has also changed. The goal is now legally adequate recall, not complete recall. It is discovery of all highly relevant information, but not necessarily all relevant information. Adequacy is determined on a case-by-case basis as necessary to render justice and, at the same time, not unfairly burden the parties to litigation. It is based on reasonability and proportionality, not a delusional notion of perfection that, in fact, has never been required by the Law.

The leaders in e-discovery have already made this change and are embracing and refining these new methods. They are managers of complex technologies and iterative quality control processes. They lead an e-discovery team. This trend will continue because it works. It saves money and mitigates risks.

The old-school, linear, confrontative, one-dimensional, largely manual, Bates stamp approach to e-discovery has already been abandoned by all of the top experts in the field. The old ways have been replaced by a new nine-step process I described in Bottom Line Driven Proportional Review, or something close to that. The new methods are multidimensional, cooperative, iterative, and include predictive coding as part of a multimodal approach. The predictive coding software works, and, if it is properly used with the new legal methods, will be accepted by courts. See eg. Chris Dale’s Judge Peck’s Predictive Coding Opinion – reporting the reaction and the case and articles cited therein. The time to employ these new methods is now. You have nothing to lose but high expense and low recall.

As the famous Indian lawyer turned saint once said:

Be the change that you wish to see in the world.

Mahatma Gandhi

7 Comments | Evidence, Lawyers Duties, Metadata, New Rules, Review, Search | Permalink
Posted by Ralph Losey


Perspective on Legal Search and Document Review

March 11, 2012

For millennium writings were on paper. For centuries the legal profession depended upon writings, referred to in the law as documents, as the key evidence to resolve disputes in a fair and just manner. Losey, Mathematical Formula for Justice Proves the Importance of ESI in Civil Litigation, Chapter 4 of Electronic Discovery (West 2010). Paper documents were well-known and mastered by every lawyer and judge who swore an oath to uphold the law.

This all changed in a historical blink of the eye. In just one generation documents have dematerialized and transformed into a dizzying array of digital media.

New Age of Technology

Many see this as a much more profound cultural revolution than that precipitated by Guttenberg, which took centuries to play out, not decades. George L. Paul and Jason R. Baron, Information Inflation: Can the Legal System Adapt? 13 RICH. J.L. & TECH. 10 (2007). Paul and Baron’s well known article explains how writing co-evolved with civilization over the past 50 centuries or longer, with a slow but steady increase in information as our writing technologies slowly improved. They point out that this all changed about 25 years ago when Mankind invented a totally different form of electronic writing, free from physical confines, that triggered a Big Bang like explosion of a new Universe of virtually unlimited information. Paul and Baron predicted the legal profession will necessarily have to significantly change and adapt new strategies of practice to cope with this information revolution.

Documents originally created on paper still exist in our society, but are rare. Zubulake v. UBS Warburg LLC, 217 F.R.D. 309, (S.D.N.Y. 2003) (FN 5 cites Wendy R. Liebowitz, Digital Discovery Starts to Work, Nat’l L.J., Nov. 4, 2002, at 4 reporting that in 1999, ninety-three percent of all information generated was in digital form).

Most of the paper documents we see are merely printouts of one dimension (the text) of the original electronic information. The law recognizes this transformation and the Federal Rules of Civil Procedure were amended in 2006 to include electronically stored information (ESI) as information that can be discovered and used as evidence in lawsuits. Rule 34(a)(1) Federal Rules of Civil Procedure (2006). ESI is not specifically defined in the rules. The Rules Committee Commentary explained why:

The wide variety of computer systems currently in use, and the rapidity of technological change, counsel against a limiting or precise definition of electronically stored information. Rule 34(a)(1) is expansive and includes any type of information that is stored electronically.

Even without specific amendments to rules, all courts today, state and federal, treat ESI as potentially admissible evidence subject to discovery. The first Sedona Principle is now commonplace:

Electronically stored information is potentially discoverable under Fed. R. Civ. P. 34 or its state equivalents. Organizations must properly preserve electronically stored information that can reasonably be anticipated to be relevant to litigation.

Stressed Out Lawyers

The legal profession has been severely stressed by the rapid, ever-accelerating advances in technology. The changes in writing and resulting information explosion have been the key stressors. ESI is not only changing and evolving new into forms every year, but is now multiplying at an exponential rate that is almost beyond comprehension. See Eg., Rowe Entm’t, Inc. v. William Morris Agency, Inc., 205 F.R.D. 421, 429 (S.D.N.Y. 2002) (explaining that electronic data is so voluminous because, unlike paper documents, “the costs of storage are virtually nil. Information is retained not because it is expected to be used, but because there is no compelling reason to discard it”), aff’d, 2002 WL 975713 (S.D.N.Y. May 9, 2002); Data, Data Everywhere (The Economist, March 2010); Baron and Losey E-Discovery: Did You Know video.

Most lawyers are unfamiliar with ESI and the complex systems that store it. They prefer the familiar paper and alphabetical filing cabinets. They are paper lawyers living in a digital world.

The astronomical volumes and complexity of ESI has made the traditional process of legal discovery very expensive and burdensome. Many are calling this a crisis in our legal system that threatens our system of justice. Final Report on the Joint Project of the American College of Trial Lawyers and the Institute for the Advancement of the American Legal System (2009) at pg. 15:

Although electronic discovery is becoming extraordinarily important in civil litigation, it is proving to be enormously expensive and burdensome.

The old methods to review digital writings that lawyers used for paper are too expensive. Few can afford the time and effort required to locate, review, and produce all relevant evidence using those old methods. The costs and burdens that way often exceeds the value of entire case. There is a real danger that the resolution of disputes in a court of law based on both testimony and writings will be a luxury available only to the wealthiest parties. See eg. Justice Stephen Breyer’s Preface to Sedona Conference Journal, Vol. 10 Supplement, Fall 2009:

[The articles in this Supplement] suggest that if participants in the legal system act cooperatively in the fact-finding process, more cases will be able to be resolved on their merits more efficiently, and this will help ensure that the courts are not open only to the wealthy. I believe this to be a laudable goal, and hope that readers of this Journal will consider the articles carefully in connection with their efforts to try cases.

Even though many scholars, jurists, and practitioners recognize the problems created by the inability of lawyers to keep pace with technology, most law schools still only train students in paper evidence and discovery. Students graduate unprepared to handle the ESI where the truth of past events is now stored. William Hamilton, The E-Discovery Crisis: An Immediate Challenge to our Nation’s Law Schools, Chapter 39 of Electronic Discovery (West 2010); Shannon Capone Kirk, Kristin G. Ali, Teach Your Children Well: A Case for Teaching E-Discovery in Law Schools, Chapter 38 of Electronic Discovery (West 2010); Shira Scheindlin, Ralph Losey, E-Discovery and Education, Chapter 33 of Electronic Discovery (West 2010); Ralph Losey, Plato’s Cave: Why most lawyers love paper and hate e-discovery and what this means to the future of legal education, Chapter 32 of Electronic Discovery (West 2010).

The law remains as dependent as ever upon documents to prove the truth, but the vast majority of lawyers are untrained and unprepared to handle the electronic documents upon which the world is now built. E-Discovery Competence is a Fundamental Ethical Challenge Now Faced by the Legal Profession, Chapter 34 of my book Electronic Discovery (West 2010). In fact, most lawyers, even those who specialize in litigation, dislike e-discovery and try their best to avoid it. See my blogs: Tell Me Why?, and Spilling the Beans on a Dirty Little Secret of Most Trial Lawyers. Lawyers are trained and prepared instead to handle paper documents following systems developed in the 20th Century.

Old 20th Century Methods

These old paper-based legal search and review methods are one dimensional and linear in nature. They typically follow a sequential Bates stamp organizational model created in the 1890s. The simple paper evidence discovery processes worked pretty well for decades before computers. It should be noted, however, that even before technology moved away from  paper typing machines to computers in the 1980s, the discovery processes were already severely taxed by the growing volumes of paper documents generated from the 1960s forward. The increase in paper volume was caused by another technological innovation, the photocopy machine and by ever more complex transactions. Still, the legal profession coped somehow for the rest of the Twentieth Century. Lawyers added more numbers to the Bate stamps and used larger teams of lawyers and paralegals to manage the additional papers. They were still on familiar ground.

The linear systems developed in the 19th and 20th Centuries for the discovery and production of documents continue to be used today by most attorneys for both ESI and paper discovery. See Eg. the D’Onofrio saga of four opinions by Judge Facciola which describes the processes used in this case and his many orders resolving disputes, including an order requiring production of a sample of the 9,413 documents listed on the privilege log. D’onofrio v. SFX Sports Group, Inc. 247 F.R.D. 43 (D.D.C. 2008); D’onofrio v. SFX Sports Group, Inc. 254 F.R.D. 129 (D.D.C. 2008). The cases are described in Chapter 27 of my book Electronic Discovery (West 2010). This use of old paper based systems with today’s ESI is a big mistake and the primary reason e-discovery is so expensive.

The Paper Derived Processes and Methods for Search and Review Did Not Work When Applied to High Volumes of ESI

The old linear review methods involved serial culling of documents down to a final production set. The process generally required multiple reviews of the same document for different purposes. It was inefficient. It was expensive. Moreover, the quality control of human eyes on paper did not work with high volumes of documents. This is shown by the latest scientific experiments where the agreement rate among professional legal reviewers was found to be just less than 50%. Cormack, Grossman, Hedin, Oard, Overview of TREC 2010 Legal Track (February 21, 2012).

This tradition of multiple manual reviews, with only limited computer assistance, typically on a linear based review platform, still continues today. But it is too inexpensive and inefficient with high volumes of ESI. This will only get worse as the amount of information continues to grow exponentially. Jason Baron, the Head of Litigation for the National Archives and Records Administration, which is in charge of all federal records, including white house email, explains this as a problem of scale. He projects the number of White House emails will soon exceed One Billion. He estimates it would cost over $2 Billion to search that many emails. That assumes a team of one hundred full-time lawyers working over fifty-four years and a very low billing rate of $100 per hour. Information Inflation: Can the Legal System Adapt? Supra at pgs. 13-15. It also assumes computer-assisted review tools, but following the old paper-based linear review models. Also see: Baron, Jason, E-discovery and the Problem of Asymmetric Knowledge (Presentation at the Mercer Law School Ethics in the Digital Age Symposium, Nov. 2008).

Moreover, too many mistakes are being made when these traditional linear review methods are applied to the astronomical volumes and new medias of ESI. See Eg.: Diabetes Centers of America, Inc. v. Healthpia America, Inc., 2008 U.S. Dist. LEXIS 8362, 2008 WL 336382 (S.D. Tex. Feb. 5, 2008):

Plaintiff’s counsel conceded at the hearing that the task of searching Plaintiff’s records for relevant emails in response to Defendants’ discovery request was entrusted to a junior associate. It is apparent that the associate worked with little or no direction or supervision. The search terms used by the associate were inadequate — they did not even include the term “phone” — and, as a result, she failed to locate or perceive the significance of the emails about which Defendants now complain.

Also see Danis v. USN Communications, Inc., 2000 WL 1694325 (N.D. Ill. 2000) ($10,000 fine imposed against CEO personally when the young general counsel he hired to supervise ESI preservation was grossly negligent); Mt. Hawley Ins. Co. v. Felman Production, Inc., 2010 WL 1990555 (S.D. W. Va. May 18, 2010) (serious mistake made resulting in waiver of privilege in spite of sophisticated counsel with very elaborate processes and safeguards).

Outsourcing Alone is Not the Answer

Some are looking for an answer to these expense issues by keeping the old processes, but outsourcing the work of manual review to less expensive contract lawyers. They are called “contract lawyers” because the law firm that represents the client does not employ them. Instead, they work for the firm or some other company under a contract to do only review work. These contract lawyers may be located in India or other countries, or may be down the street from your office, or down the hall.

Even if you accept the abilities of contract lawyers to adequately perform the task of the first level relevance review (which I do, with qualifications), this is still just a stopgap measure. The volumes of ESI are increasing exponentially. Outsourcing may help with expenses to a point, but is still futile as a stand-alone long-term strategy, still just treading water in the midst of a flood. Indeed, when this outsourcing strategy was employed by the Department of Justice to try to reduce the costs of a privilege review, it still cost $9.09 per file to do a review at a total cost of $6,000.000. In re Fannie Mae Securities Litigation, 552 F.3d 814, (D.C. App. Jan. 6, 2009). The government reviewed 660,000 files at a cost of $6,000,000 using contract lawyers to respond to a non-party subpoena. The Order denying the government’s motion for cost-shifting was upheld by the appellate court.

The answer does not lie in modifying the system somewhat to employ cheap labor to do manual review. Outsourcing may still be part of the answer, and is likely to remain as a viable long-term strategy. But outsourcing only works when combined with the new systems and technologies like predictive coding, and only when properly supervised and performed by highly skilled professionals. The answer is not outsourcing. The answer is a whole new system for e-discovery.

Conclusion

Our current linear, confrontative, one-dimensional, largely manual, costly, Bates stamp approach to discovery must be replaced with a cooperative, iterative, largely automated, predictive coding based, proportionality cost-controlled, hash value approach.

For more on hash values and e-discovery, see: Losey, R., HASH: The New Bates Stamp, 12 Journal of Technology Law & Policy 1 (June 2007). For more on cost control see my blog Bottom Line Driven Proportional Review. For more on the new approaches in general and predictive coding in particular, see eg. Chris Dale’s Judge Peck’s Predictive Coding Opinion – reporting the reaction and the case and articles cited therein.

4 Comments | Evidence, Lawyers Duties, Metadata, New Rules, Review, Search, Technology | Permalink
Posted by Ralph Losey


LegalTech Interview of Dean Gonsowski on Predictive Coding and My Mission to Make Predictive Coding Software More Affordable

March 9, 2012

I highly recommend you watch this interview of an expert on e-discovery search whom I respect, Dean Gonsowski, Symantec eDiscovery Counsel. Dean is a former practicing attorney and general counsel who is now an expert on technology for Clearwell-Symantec. He has a good handle on the big picture of predictive coding type technologies, not only in litigation, but also in corporate information governance. The interview is well-done by Don McLaughlin, CEO of Falcon Discovery.

Stay to the end of the interview and learn about the possible use of predictive coding as an email management tool to file your email. I can just see the commercials of the near future! Let your computer do your filing for you. That way the computer can more easily retrieve your email when you want it. Manual self-filing will become a thing of the past, freeing up hours of time for the average information worker. I am all for that vision of the future. This is outlined by Dean, and so is the use of predictive coding in litigation. Dean talks at length, as did I at the time, about there being no real need to wait for a big decision approving predictive coding. We all used keyword search without an opinion approving its reasonability, so why should these new search technologies be different? Dean, like everyone at the time, pointed out that judicial approval of predictive coding might never come, or its coming could take years.

Dean is one of the key e-discovery people from Clearwell who are now helping to make Symantec a more dynamic, information-driven company. Their blog, e-discovery 2.0: thoughts about the evolution of e-discovery, is one of my favorites. They have new software called Clearwell Transparent Predictive Coding that will come out in a few months (price to be announced). I saw a private demo of it at LegalTech from the top people of Clearwell-Symantec. I told them then what I tell all of the vendors, bring the prices down! I will keep beating that drum so that predictive coding culling is affordable in all sizes of cases. It is the ideal tool to implement my Bottom Line Driven Proportional Review method. 

In the video interview above Dean Gonsowski gives a new perspective on technologies and legal methods that goes back to Dean’s early days in e-discovery in 2003. Poor Dean never gets much of a chance to talk when I appear on panels with him, which I have done now for Clearwell for the last two years at LegalTech. He is usually graciously allowing his panelists to speak while he asks questions. When the tables are turned he has many interesting things to say, as this You Tube video shows. Frankly, I have never heard Dean speak for so long as in this video.

As a final note you might want to calendar a free webinar I’m doing for Symantec and Inside Counsel on March 22, 2012 at 2:00 ET. It is called Is Keyword Search in eDiscovery Dead? I’ll answer that question, talk about the old days of endless games of Go Fish, some of my new search methods, and the Kleen Products case, but not the other case. Presenting with me and trying to get a word in edgewise is Matt Nelson, eDiscovery Counsel for the Symantec Intelligent Information Group.

I wonder who counsel is for their Dumb Information Group? Glad I’m not presenting with that guy! Anyway, did I mention it’s free? So give it a listen and ask a question or two, such as, what will your predictive coding software cost when it comes out, and can you make it less expensive? I will cheer you on, and even if you don’t ask (or, more like it they don’t read your question), I will keep on trying to influence all of the software leaders.

I want to make predictive coding software an affordable, everyday item. Now that I’ve helped to open the door, I want as many people as possible to be able to walk through. With the right methods to use this tool, a new world of affordable e-discovery awaits.

3 Comments | Evidence, Forensic Exam, Lawyers Duties, Metadata, New Rules, Review, Search, Technology | Permalink
Posted by Ralph Losey


My CLE event this week with the e-Discovery Triumvirate: Judges Facciola, Grimm and Peck; and my upcoming battle at Legal Tech with Craig Ball

January 22, 2012

I took part in a day-long CLE this week in Charlotte, North Carolina, where the big-three of e-discovery all attended, United States Magistrate Judges Paul Grimm, John Facciola and Andrew Peck. It was great to see the Triumvirate in action, even though I did suffer some friendly abuse from their rulings in the mock court proceedings we set up. Of course, the other side’s repeated reference to me as Sanctions Losey did not help. Yes, I got to play the role of a plaintiff’s lawyer in a mock non-compete case. I’m still waiting for my contingency fee.

A CLE All About Rule 26(f) Conferences

The entire one day event was devoted to Rule 26(f) conferences, where we role-played with frequent Office-type out-takes to explain to the 175 in attendance what we were doing and why. We started with a conference where we did it the wrong way, and then we role-played it again, but this time we did it right (or at least better). This is the kind of in-depth, in-person CLE that I like, where a full day is devoted to a single topic. It was also terrific to have instant commentary by some of the best e-discovery judges in the country.

The local judges, attorneys, corporate counsel, vendors, and techs who took part in the event were very good too, and so was my old friend Sonya Sigler, now with SFL Data, who came all the way from San Francisco. The addition of top local Magistrate Judges David Keesler and Shiva Hodges, shown right, made the event even more interesting for the local Bar in attendance. (By the way, can you guess what judge’s learned hand is also shown in this picture?) These are really good judges who took the opportunity to learn from the Three Wise Men from out-of-town. Occasionally, they even seemed to listen to me! Charlotte is in good hands.

The defense team of local lawyers gave me a real run for the money. Of course, that same team got to dream up the convoluted fact scenario they pegged me with. They had us role-playing a law firm crazy enough to sue our small client’s former employer, a corporate giant, to seek a declaratory judgment on the invalidity of the non-compete they signed before leaving with big customers in tow. They also had us suing the Goliath corporation for trade disparagement, which gave me some fun with the scenario, as we looked for the key words “low dirt.” That was a term we learned the bad defendant was using to disparage our clients.

Yeah, I played with the facts too once I saw the set-up, and like any good plaintiff’s lawyer, waived my arms around a lot and demanded preservation of everything. I even tried crying once when describing my client’s persecution, but I’m a little out of practice in doing that. Still, the audience-students seemed to have a good time with the role-playing and interactions. I certainly got more than my usual share of laughs, and questions, which is how I usually judge the success of my performance-teaching. The mock trial type scenario, if done right, can be a lot more entertaining than just listening to a series of talking heads.

Search, ethics, competence, preservation, rules, privileges, claw-backs, and most every other e-discovery topic you can imagine came up in the course of the mock 26(f) conferences and five judge commentary. As you would expect with Judges Peck, Facciola and Grim in attendance, the hot topic of today, predictive coding, was also discussed at some length.

It was also great to hear all of the triumvirate opine on proportionality and how to control costs. As you know from my last blog, Bottom Line Driven Proportional Review, cost control is my current mission, my way of helping Superman in the never-ending battle for truth, justice and the American way.

The CLE Plan

Below is the agenda of the event sponsored by Nova Office, which other vendors might want to look at as an example of the right way to do a CLE. Of course, we deviated somewhat from the script, and went with the flow, but still, the outline may be of some assistance to future CLE organizers. The event itself was called e-discovery judges in charlotte:


Continental Breakfast 7 AM – 8 AM

  • Introduction to panel
  • Introduction to case (10 minutes)
  • Defendant reacts to receiving complaint (15 minutes)
  • Conversations between counsel (40 minutes)
  • Emergency motion (15 minutes) (we skipped this – no time)
  • Judicial reaction (as long as they want!)
  • Key Words (10 minutes)
  • Coffee Break (15 minutes)
  • Panel discussion (45 minute)
  • A better way (60 minutes)
  • Lunch Break (30 minutes)
  • Meet & Confer  FRCP 26(f) (120 minutes)
  • Coffee Break (15 minutes)
  • Judges Conference FRCP 16(b) (45 minutes)
  • Open Discussion with Magistrate Judges (60 minutes)

As an interesting twist, they had a court reporter transcribe the whole event, instead of a more typical video. I’m told it will be posted online in about ten days where it will be freely available to all. Try this address in ten days: www.novaoffice.net/assets/pdf/cle_transcript.pdf

Aside from the Big Three judges, the local judges, Sonya Sigler, and myself, we had a large team of mock 26(f) players, listed below. The opposing counsel who lead the defense team, Jill Griset and Kate Maynard, were worthy adversaries. I hope that my sparring with them helped prepare me for my next debate, which is only against one person. But, that one person is no mere mortal, it is none other than the world-famous Craig Ball, the long reigning champion of e-discovery debates.

The Coming Battle at Legal Tech

Yes. You heard it right, Losey v. Ball. It will be moderated by an esteemed member of the triumvirate, Judge Andrew Peck. I’m told Judge Peck has already used Sherlockian code to predict my loss. Or was it my irrelevance? Not sure. Either way, he seems to think my likely defeat is pretty funny. Judge Peck knows all of our moves, Ball and me. Following the methods of his hero and mine, Sherlock Holmes, he has played out all of the possible permutations in his mind deduced my inevitable defeat. Apparently my only chance of defeating Moriarty requires a suicide jump into the Falls. (Or does it?) Speaking of Sherlock Holmes, have you seen the new movie, A Game of Shadows? Judge Peck and I both say to ignore the critics and check it out.

The debate, some would say sacrifice of the newcomer (me), takes place at Legal Tech on January 31st, 4:00-5:00 pm, at the Sutton Center, 2nd floor. Apparently the very special Master of e-discovery, a/k/a Craig D. Ball, has never been beaten. In fact, most leave the field of law altogether after any attempt, or at least escape to the land of no-debates and no-arguments, just friendly, cooperative dialogue. I’ve heard it’s located somewhere in Arizona.

The legal rumble in New York is sponsored by BIA, Inc. I bet they are still chuckling behind the scenes at finally finding a patsy to take up the challenge of a battle against Ball. I later found out that the first twenty people they asked before me all came up with good excuses to say no, like wanting to save their reputation and avoid public humiliation. Me, I didn’t know any better, and succumbed to their flattery and said yes. (In fairness, BIA is very good at flattery, right Mark?) Only later did I understand the meaning of the evil laughter that followed my assent.

Please show up and cheer me on in the Losey v. Ball debate. I need your moral support as I attempt to do battle with the long-reigning Emperor of e-discovery. At the very least, you can help me pick up the pieces after my thrashing and buy me a drink. If only I knew how to make ESI emit from my hands like Craig does, then I might have a chance. Oh well, at least the defense Bar is with me, even if the force and forensic freaks are not.

The Key Players in Charlotte

In addition to the Big Three of Peck, Facciola and Grimm, here are the key players in my warm up battle in Charlotte last week. You will notice a lot of heavy hitters here from a variety of backgrounds. That is another key ingredient for an event like this.

Corporate e-discovery counsel or IT/Records Managers:

  • Craig Cannon, e-Discovery Counsel, Bank of America
  • Carranza Prior, Associate General Counsel & Director of Litigation TIAA CREF
  • Tim Nohr, Associate General Counsel Rack Room Shoes
  • Sarah Serpico, Manager , Business Solutions and Technology, National Gypsum
  • Terri Alsop, IT Project Manager, IT Dept. Duke Energy
  • Bruce Pfannenstiel, Senior Manager, Global Records, Global Quality and Compliance, Pharmaceutical Product Development (PPD)

E-Discovery litigators participating:

  • Joshua B. Durham, Litigation Co-Chair, Poyner Spruill
  • Ralph Losey, Attorney / Partner at Jackson Lewis LLP, Orlando, Adjunct Professor of Law at University of Florida
  • Jill Crawley Griset, Partner McGuireWoods
  • Christopher A. Hicks, Partner, Katten, Muchin Rosenman LLP
  • Katherine Gordon Maynard, Robinson, Bradshaw & Hinson

E-Discovery consultants:

  • Sonya Sigler, VP of Product Strategy, SFL Data
  • Matt Miller, SVP- Marketplace Leader, Intellectual Property Practice Group Leader, DiscoverReady
  • Joe Bartolo, Director of Corporate Sales and Hosting, SRM Legal

Final Thanks and Restaurant Review

A final special thanks goes out to Doug Moore, of FSO Outsourcing, now hailing from New York City (shown second to left, just behind someone’s big head). Doug did most of the heavy lifting for event planing. He also braved a rainy night in Charlotte to go outside and help me find the restaurant, The King’s Kitchen, at 129 West Trade Street in Charlotte. That is where all of the speakers ate the night before. I highly recommend this restaurant. King’s Kitchen is both a non-profit foundation and restaurant that serves good old-fashioned southern cooking with profits going to help feed the homeless. Can anyone guess Judge Grimm’s favorite food at this restaurant? No, it was not a Happy Meal.

Leave your comments below with guesses on Grimm’s favorite food and the name of the judge whose learned hand graced the second photo from the top. The winners get a free beer or cup of coffee at Legal Tech. (On me, or whatever vendor happens to have an open tab handy, which, of course, is the main reason most of us go to Legal Tech.)

Any debating tips for my battle with Ball would also be appreciated. I already owe him one steak dinner for our prior debate over Phillip M. Adams & Associates, L.L.C., v. Dell, Inc. I said it was a bad decision and would be reversed. It wasn’t. So, I lost the bet. I understand the Legal Tech debate will include the infamous Pippins v. KPG, LLP case. Once again, Master Ball says the decision is sound and will be affirmed. Once again, I predict reversal of this bad opinion. Am I right? What do you think?

7 Comments | Evidence, Forensic Exam, Lawyers Duties, Metadata, New Rules, Related Legal Webs, Review, Search, Spoliation/Sanctions, Technology | Permalink
Posted by Ralph Losey


« Previous Entries
  • You are currently browsing the archives for the Metadata category.

  • Related Legal Webs

    • DISCLAIMER
    • E-Discovery Team Training
    • Favorite E-Discovery Links
    • IT & Law: Ralph’s Website
  • Enter your email address to follow this blog and receive notifications of new posts by email.

  •  Subscribe in a reader

  • About the Blogger

    Ralph Losey is the lawyer, writer, ESI search designer, and teacher behind the e-Discovery Team blog. Ralph has been practicing law since 1980 and playing with computers and online communications since 1978. He is a partner in a major national law firm. He holds the highest AV peer rating by Martindale Hubbell and has 70 published opinions to his credit. Ralph is the proud father of two children, Eva Grossman, and Adam Losey, an e-discovery lawyer, and best of all, husband since 1973 to Molly Friedman Losey, a mental health counselor in Winter Park.

  • Losey’s Recent Books

    Adventures in Electronic Discovery (West Thomson Reuters, 2011).
    Electronic Discovery: New Ideas, Trends, Case Law, and Practices (West Thomson Reuters, 2010).
    Introduction to E-Discovery: New Cases, Ideas, and Techniques (ABA 2009).
    e-Discovery: Current Trends and Cases (ABA 2008).
  • Law Review Articles

    HASH: The New Bates Stamp, 12 Journal of Technology Law & Policy 1 (June 2007).
    Mancia v. Mayflower Begins a Pilgrimage to the New World of Cooperation, 10 Sedona Conf. J. 377 (2009 Supp.)
    Lawyers Behaving Badly, 60 Mercer L. Rev. 983 (Spring 2009).
    Mercer Ethics Symposium Transcript with Judge John Facciola, Judge David Baker, Ralph Losey, Jason Baron, William Hamilton, Professor Monroe Freedman, and Chilton Varner, 60 Mercer L. Rev. 863 (Spring 2009).
  • Ralph’s Twitters on Tech

    • British Barrister, Chris Dale, tells it straight up. is.gd/mxOQO2 1 week ago
    • Craig Ball here creates a short list of rights and duties of a requesting party in e-discovery. is.gd/SrLVJp 1 week ago
    • Part 2 of my interview by Exterro on Rule Changes, Doc Review., & science. Audio and edited transcript. is.gd/Syt3fZ 2 weeks ago
  • Math Tool for Quality Control

    Random Sample Size Calculator
  • Recent Interviews

    1. Ralph Losey and Judge Shira Scheindlin on ESIbytes.
    2. April 2012 interview, part one, audio and edited transcript, on general topics, and part two on rules and best-practices.
    3. Video interview at Legaltech 2012 on predictive coding.
    4. Part One of Interview at Legaltech 2012 by e-Discovery Daily blog. Part Two of Interview on trends.
    5. Sept. 2011 Interview on e-Discovery Education on ESIbytes.
    6. Ralph Losey and Judge David Waxse audio interview on the ESI Report.
    7. Video Interview at LegalTech 2011 on proportionality, cost-controls, and e-discovery training by Browning Marean and Tom O'Connor.
    8. Video Interview at LegalTech 2011 with Jason Baron on education and other topics by Greg Bufithis.
    9. Video Interview at LegalTech 2011 on this blog's training program by Sarah Brown.
    10. Audio Interview in October 2011 on Controversial Issues by Sharon Nelson and John Simek.
    11. Questions about Specialization and my movie with Jason Baron ESIbytes.
    12. FIOS Interview by Mary Mack.
    13. e-Discovery 2.0 Interview by Kurt Leafstrand.
    14. Ethics Interview by Karl Schieneman.
    15. TechLaw Interview by Browning Marean and Tom O'Connor.
  • Recent Blog Comments

    Carly Buntz on Announcing My New Book on e-Di…
    Links (weekly) | The… on Random Sample Calculations And…
    Lit Support Links (w… on Random Sample Calculations And…
    Blogging, Proportion… on Good, Better, Best: a Tale of …
  • Blog Stats

    • 780,856 visits
  • HOW MUCH DATA DO YOU HAVE?

    CD = 650 MB = 50,000 pages. DVD = 4.7 GB = 350,000 pages. DLT Tape = 40/80 GB = 3 to 6 Million pages.
    Super DLT Tape = 60/120 GB = 4 to 9 Million pages.

    ***************************
    Page Estimates:
    1 MB is about 75 pages;
    1 GB is about 75,000 pages (pick-up truck full of documents).

    Aver. pgs. per email: 1.5 (100,099 pages per GB).
    Aver. pgs. per word document: 8 (64,782 pages per GB).
    Aver. pgs. per spreadsheet: 50 (165,791 pages per GB).
    Aver. pgs. per power point: 14 (17,552 pages per GB).

    ***************************
    For the average .PST or .NSF Email File:
    100 MB .PST file is 900 emails and 300 attachments.
    400 MB .PST file is 3,500 emails and 1,200 attachments.
    600 MB .PST file is 5,500 emails and 1,600 attachments.
    A 1.00 GB .NSF file is 9,000 emails and 3,000 attachments.
    A 1.5 GB .NSF file is 13,500 emails and 4,500 attachments.

    ***************************
    Note: Many variables will affect ALL of the actual numbers above, including especially large image and video files, and recursive files.

    ***************************
    Bits and Bytes Sizes:
    •8 bits are equal to 1 byte (one or two words),
    •1,024 bytes are equal to 1 kilobyte (KB).
    •1,024 kilobytes (KB) are equal to 1 megabyte (MB or Meg).
    •1,024 megabytes are equal to 1 gigabyte (GB or Gig) (truck full of paper).
    •1,024 gigabytes are equal to 1 terabyte (TB) (50,000 trees of paper).
    •1,024 terabytes are equal to 1 petabyte (PB) (250 Billion Pgs. of Text).
    •1,024 petabytes are equal to 1 exabytes (EB) (1 000 000 000 000 000 000 bytes).

  • Search for the Golden Needle

    "He who must search a haystack for a needle is likely to end up with the attitude that the needle is not worth the search." Justice Robert H. Jackson (1892-1954) Brown v. Allen, 344 U.S. 443, 537 (1953)
  • Watch videos at Vodpod.
  • Sedona Principles, 2nd Ed.


    1. Electronically stored information is potentially discoverable under Fed. R. Civ. P. 34 or its state equivalents. Organizations must properly preserve electronically stored information that can reasonably be anticipated to be relevant to litigation.

    2. When balancing the cost, burden, and need for electronically stored information, courts and parties should apply the proportionality standard embodied in Fed. R. Civ. P. 26(b)(2)(C) and its state equivalents, which require consideration of the technological feasibility and realistic costs of preserving, retrieving, reviewing, and producing electronically stored information, as well as the nature of the litigation and the amount in controversy.

    3. Parties should confer early in discovery regarding the preservation and production of electronically stored information when these matters are at issue in the litigation and seek to agree on the scope of each party’s rights and responsibilities.

    4. Discovery requests for electronically stored information should be as clear as possible, while responses and objections to discovery should disclose the scope and limits of the production.

    5. The obligation to preserve electronically stored information requires reasonable and good faith efforts to retain information that may be relevant to pending or threatened litigation. However, it is unreasonable to expect parties to take every conceivable step to preserve all potentially relevant electronically stored information.

    6. Responding parties are best situated to evaluate the procedures, methodologies, and technologies appropriate for preserving and producing their own electronically stored information.

    7. The requesting party has the burden on a motion to compel to show that the responding party’s steps to preserve and produce relevant electronically stored information were inadequate.

    8. The primary source of electronically stored information for production should be active data and information. Resort to disaster recovery backup tapes and other sources of electronically stored information that are not reasonably accessible requires the requesting party to demonstrate need and relevance that outweigh the costs and burdens of retrieving and processing the electronically stored information from such sources, including the disruption of business and information management activities.

    9. Absent a showing of special need and relevance, a responding party should not be required to preserve, review, or produce deleted, shadowed, fragmented, or residual electronically stored information.

    10. A responding party should follow reasonable procedures to protect privileges and objections in connection with the production of electronically stored information.

    11. A responding party may satisfy its good faith obligation to preserve and produce relevant electronically stored information by using electronic tools and processes, such as data sampling, searching, or the use of selection criteria, to identify data reasonably likely to contain relevant information.

    12. Absent party agreement or court order specifying the form or forms of production, production should be made in the form or forms in which the information is ordinarily maintained or in a reasonably usable form, taking into account the need to produce reasonably accessible metadata that will enable the receiving party to have the same ability to access, search, and display the information as the producing party where appropriate or necessary in light of the nature of the information and the needs of the case.

    13. Absent a specific objection, party agreement or court order, the reasonable costs of retrieving and reviewing electronically stored information should be borne by the responding party, unless the information sought is not reasonably available to the responding party in the ordinary course of business. If the information sought is not reasonably available to the responding party in the ordinary course of business, then, absent special circumstances, the costs of retrieving and reviewing such electronic information may be shared by or shifted to the requesting party.

    14. Sanctions, including spoliation findings, should be considered by the court only if it finds that there was a clear duty to preserve, a culpable failure to preserve and produce relevant electronically stored information, and a reasonable probability that the loss of the evidence has materially prejudiced the adverse party.

    Copyright © 2007 The Sedona Conference®. All Rights Reserved.

    Reprinted courtesy of The Sedona Conference®.

    Go to www.thesedonaconference.org to download a free copy of the complete document for your personal use only.

  • Permanent Pages

    • About
    • New Rule 16
    • 26
    • 34
    • 37
    • Duties
    • Hash
    • Interviews
      • Interview of Judge Shira Scheindlin and Ralph Losey
      • Questions about Specialization and my movie with Jason Baron
      • 2009 FIOS Interview
      • e-Discovery 2.0 Interview
      • Ethics Interview
      • TechLaw Interview
      • 2010 FIOS Interview
    • Meta
    • School
      • Are Today’s CLE Programs Doomed to Go the Way of the Newspaper?
      • Cyber Law School in e-Discovery is Now Open!
      • Florida Bar Approves the Blog’s Online Education Program
      • Kroll’s Annual Report: Same-Old, Same-Old. So What Are We Going To Do About It?
      • Online e-Discovery Instruction in Law School Is Now a Reality
      • Replead for Participation and Another Case Showing the Need for Better Education
      • The Law Firm Apprenticeship Tradition And Why Most Lawyers Are Still Untrained in e-Discovery
      • Sample Class in Online Training Program
      • Judge Shira Scheindlin and I Speak on e-Discovery and Education
      • Plato’s Cave: why most lawyers love paper and hate e-discovery and what this means to the future of legal education
      • Fresh Perspectives on e-Discovery from Young Minds in the “Academy
      • “Teach Your Children Well” – A Case for Teaching E-Discovery in Law Schools
      • The E-Discovery Crisis: An Immediate Challenge to our Nation’s Law Schools
  • BLOGGING MUSIC

    Bach More Bach Chopin Debussy Glazunov Gershwin Imagine by John Lennon

Theme: Contempt by Vault9.
Blog at WordPress.com.