The two groups, the National Security Archive and Citizens for Responsibility and Ethics in Washington, sued the Executive Office of the President in 2007 claiming that millions of e-mails were missing, based on information they had received and their own research.
The lawsuit wanted the White House to account for all the missing e-mails , not only because the law required it, but because of their importance to history.
The settlement (PDF document) with the White House, announced today by the groups that brought the suit, sets the terms for hunting for additional White House e-mails. The White House will investigate records from an additional 94 days.
More than 22 million e-mails have been recovered and more are likely waiting to be found, said Meredith Fuchs, the archive's general counsel. But "I don't think we will ever know whether everything was restored," Fuchs said.
The settlement also requires the White House provide a publicly releasable letter "describing in as much detail as possible" the current Executive Office of the President computer systems, including its e-mail, archiving and backup systems. "This document will include a detailed description of the controls in the system that prevents the unauthorized deletion of records," the settlement says.
That letter is due by Jan. 15.
"We actually want them to explain what their systems is," Fuchs said. But she said the White House, because of security concerns, is likely to focus on explaining "certain characteristics" of the systems and not necessarily the technologies underpinning them.
The e-mail problem began in 2002 and 2003 after the White House moved from Lotus Notes to Microsoft Exchange. As it moved to the new platform, the President's IT staff also discontinued use of legacy, circa 1994, electronic management and archiving system, called Automated Records Management Systems (ARMS.) Development began on a new archiving system that ran into its own issues and wasn't implemented.
Without an automated archiving system, the White House relied on manual processes to archive e-mails, and that's when the problems evidently began. Files were mislabeled and commingled on back-up tapes containing all types of information.
Among the techniques used by the White House to determine whether e-mails were missing was modeling of traffic, from which they founds days that had statistically low e-mail volumes.
More from IDG:
- ERP Vendor Offers to Take Over MySQL
- Oracle's Decade of Acquisition: Innovative or Just Well-Financed ...