If you've been watching the last two RTL posts, you are aware of the personally identifiable information (PII) that is contained in the Enron Email Data Set. Responding to a report from BeyondRecognition's CEO John Martin, Amazon Web Services wrote to John yesterday and advised him that the Data Set had been taken down.
Through the prism of today's concerns with privacy, I believe this is the correct result and applaud John for ensuring that people outside the industry were made aware of the PII. It was time to revisit the issue of whether that data should be publicly available. I am glad that EDRM is working with Nuix to remove the PII so that the data set may once again be made public. When that happens, I'll be sure to post the new link to the data set.