The data is potentially explosive, but how do you even extract the information tied up in millions of PDF files and emails, much less tease out the complex relationships that stakeholders have intentionally tried to hide?