The term "metadata"
is likely the most commonly used technical term used by attorneys when dealing
with eDiscovery. Speaking as a technologist for more than twelve years,
metadata is a detail of electronic documents that the legal field has been able to gain
more value from than technologists. Backup tapes exist and were
invented to recover systems,
not for discovery; however, with the value that metadata brings to
intricate details of electronic files, I also understand the value of the
historical data in litigation that backup tapes may contain. A basic definition
of metadata is that it is simply "information about information". In
this article, we will cover some of the basics of metadata, the value of the
information, and how the metadata can tell a different story from the contents
of an electronic document by itself.
A striking statement that I make in many of my speaking engagements is that you
are already involved in eDiscovery whether you realize it or not. Ninety-five to ninety-eight percent of all
information is digital. This means that the paper files that you are reviewing
are likely just a printed manifestation of an electronic file. However, the
printed document does not have all of the information that may be relevant to
your case. You guessed it; the metadata is what is missing from a printed
document. The metadata of a digital document provides a wealth of information
that is only available when you have the electronic file. Information such as:
· File creation dates and times
· File modification dates and times
· Authors of a document
· Who last saved a document
Depending on the type of digital file, the metadata may also determine the
computer a file was created, when it was last printed, and what version of the
software was used to create and modify the document. The metadata of an excel
document may uncover hidden rows or columns, in addition to the actual formulas
used to derive the mathematical calculations. Email metadata will also contain
the email "header" which can be best described as the tracking
information for an email: when it was sent, where it was sent from, when it
arrived, etc.
When combining the basics of metadata with "embedded" metadata that
may contain the track changes history of word, the courts had a great deal of
foresight with the issues surrounding metadata and the need for Rule 26(b)(5),
the "clawback". When not properly analyzed, the metadata of a
document could reveal much more information than you intended.
Rather than explaining each and every exciting detail about metadata, let's put
this basic tutorial of metadata to use. You are representing a client in a
workplace discrimination suit. As part of your discovery request, you have
asked for all employment records from 2002-2007 in an electronic format. The
manager who performed the annual
reviews during that the time was named Darlene Johnson, she has since been
replaced by John Smith. On February 1 2008, you receive annual employee reviews
for your client in a native format (Microsoft Word). As you review the
contents, the information in the files does not match the situation that your
client has described. You decide to have the metadata of some the files
analyzed. Below is a screenshot of a utility we use to analyze metadata:

When analyzing the metadata of the
"Employee Review - 2002" document, there seems to be some
inconsistencies.
1. The original author of the "Employee Review -
2002" document was "djohnson", confirmed to be the username for
Darlene Johnson but the document was last saved by "jsmith",
confirmed to be the username for John Smith.
2. The file was last modified and saved on 1/26/2008 by
"jsmith" and was then printed.
As you can see, the metadata tells a different story from the contents of the
document. If only a paper copy of the "Employee Review - 2002"
document was provided, you may not have known that John Smith edited the
document less than a week before it was produced.
The metadata of a document provides a great deal of information not available
with a paper copy only. Information such as author, creations dates, and
modified dates may provide great value to your litigation. Maybe simply knowing
the formulas used in Microsoft Excel or when an email was actually sent could
be a turning point in your case.