-
MailBox
-
Inbox
-
Outbox
-
Sent
-
Trash
-
Steve, Harvey, and Matt
From: Hessling, Michael Sent: Tue 5/9/2017 11:19:23 AM
All right. This took me about 90 minutes to do 108 pages, and it's not going to get easier, I don't think. I've finally figured out why bulk archive doesn't work 100% ... we get rate limited by Akamai. Bulk archive goes too fast. I resorted to doing the archiving 1-by-1. It's not too slow once you get a rhythm (gonna get you), but tedious since I waited until the page was definitely archived before hitting the next. The bulk archive missed four files, highlighted in yellow in the attached spreadsheet. I found three of them stuck in the queue for archiving state (so at least that fails gracefully). (All three are attached.) Missing: clean-power-plan-and-carbon-pollution-standards-regulatory-actions. I find no evidence of a redirect to some other page (like clean-power-plan-existing-power-plants-regulatory actions??), but I could be wrong. What's the best way to get those three files over to archive? ~Mike
From: Hessling, Michael Sent: Monday, May 8, 2017 4:14:48 PM
Tomorrow morning, I will archive clean-power-plan. Tomorrow during the day, you'll want to rename the ARCHIVED clean-power-plan back to cleanpowerplan using your Perl script.~Mike
From: Freire, JP Sent: Friday, April 28, 2017 5:51 PM
Steve, Harvey and Matt, As discussed with Nancy, we would like the content at the links below removed and archived as soon as possible. The removed climate change pages will be replaced with a custom page that will explain why the change is happening, include a link to the January 19, 2017 snapshot of the EPA website, and a link to the press release we'll be putting out. The clean power plan pages will be redirected to the newly posted energy independence pages. We appreciate your assistance in this time-sensitive matter. Please let us know if you have questions. Thank you
From: Hessling, Michael Sent: Monday, May 8, 2017 2:23:37 PM
I would do this archiving early in the morning and go as fast as I can. Wouldn't that prevent the indexing? The URL pattern would be "clean-power-plan" if that helps (changed from "cleanpowerplan"). ~Mike