So ePadd redacts everything except the named entities, but that still means going through each message by hand to ensure that the entities generated by the NLP software are correct. Plus the time spent fixing ePadd to make the import run correctly with his non-standard email client, the time spent negotiating permissions and restrictions related to the collection, etc.
C.f.: https://github.com/search?q=repo%3AePADD%2Fepadd++knuth&type...