How To Back Up An Adult Tumblr (2018 Edition)
Wednesday, December 5th, 2018 -- by Bacchus
OK, Monday was our day to smack foreheads and react to the Tumblr porn ban. Tuesday was our day to mourn the loss of communities. For people who need a new home, yesterday began the migration process, or at least the search for a destination. Today? For me it’s a day to act. I’m a porn curator, and I’m aghast at the collections that will burn on December 17. What little I can do to fight those fires starts today.
Five years ago when I first saw this shitshow coming, I researched and advised the community on backup options for adult tumblrs. But since then, Tumblr has taken several technical steps to make porn tumblrs harder to view or copy. And fundamentally, I’m a technical idiot. I gloomily thought all my old advice was obsolete, which is why I wrote this fundamentally wrong depressed Eeyore shit on Monday:
I haven’t tested, but I don’t think the advice in that post would work any more, now that you have to be logged in to Tumblr even to view your own porn blog. (I could be wrong.)
Reader, I was wrong. After consulting with my good friend Dr. Faustus at Erotic Mad Science, who is technically sharper than me and who has spent much of the past five years diligently rescuing material of interest from Tumblr and turning it into a deep and intricate set of self-hosted blogs, he pointed me at some instructional material that set me straight. It turns out that I can pick the best backup solution from my old article, add one screenshot with a single new setting (the password/login you need to be able to see porn on Tumblr), and we are back in backup business! (Product best when used before December 17, 2018, contents may settle in shipping.)
How To Back Up An Adult Tumblr
These are instructions for using HTTrack/WinHTTrack Website Copier to make a local mirror of any Tumblr in a folder on your hard drive. HTTrack is a free-software GPL general utility for copying and mirroring websites, available for most current versions of Windows as well as for a wide variety of Linux/Unix flavors. The Windows version presents a fairly old-fashioned interface with a bunch of cryptic options, but most of them come pre-set with sensible defaults that you actually don’t need to mess with. Plus, there’s good documentation.
Download the software and install it. Windows will probably try to scare you out of completing the install. Be bold, be brave. Trust your open source software developers.
Now run the the software. You’ll be presented with a welcome screen where you need to click “Next”. Then, this screen:
The arrows show you the two fields that need your attention. All you really need to do is give this backup project a name and tell the software where to save the backup. Then hit “Next”:
The two red arrows above screen point to mandatory buttons that need our attention. The first one opens the screen where we tell Tumblr the URL of the porn tumblr you want to download. It will be something like: http://yourtumblr.tumblr.com — this doesn’t have to be your tumblr URL though, it can be any tumblr you want to save, like one that nobody has updated in five years that you know will be going dark in two weeks. This is also where you input your Tumblr login information, to prove to Tumblr that you’re special enough to view porn. This does have to be your login, no matter whose blog you are downloading.
The next screen is the one you see when you push the “Set options” button, which is absolutely vital. When you do, you’ll see this:
I’ve pointed arrows at three optional settings tabs that you may want to adjust, and at the one mandatory options tab where you must change a setting. I’m going to ignore the optional ones for now, except to say that you would tinker with these if you wanted to change the sorts of media files you’re saving beyond the basic .gif, .jpg, and .png (you’d need to do this if you were saving a Tumblr that had .wav files or .zip files or .mp3s), or if you need to limit this program from slamming your internet connection too hard. It’s the mandatory “Spider” tab you really need to click:
See the box where it says “follow robots.txt rules”? Change it using the drop-down menu to say “no robots.txt rules” so that this software (your robot) will know to ignore Tumblr’s robot-hostile electronic “keep out” signs.
Yay! We’re almost done. Hit the “OK” button, hit “Next”, hit “Finish”, and your site copying should begin.
How long it will take to finish depends on the available bandwidth of your net connection, the memory and processing speed of your computer, and on whether you tweaked any of the options that control things like how many simultaneous connections your computer is making and how many files it’s trying to download in parallel. It also depends on how many pages there are on the Tumblr blog you are backing up, and on how big the images are. The default settings seem to be fairly gentle about not maxing out your internet connection or putting an unruly amount of strain on the server at the site you are trying to copy. Using default settings and a fairly crappy internet connection, I downloaded an ancient porn blog overnight last night. It had 3,300 posts and took up 1.8GB on my hard drive.
What does success look like? You’ll have a folder on your hard drive with the name you provided on the first options screen. If you open it, you will find many sub-folders, and much that may seem mysterious. You should also find a file called “index.html” — and if you click on it, it should open in a new browser window where you’ll be looking at your backed up Tumblr site, using nothing but the files on your hard drive.
What have we not accomplished? Well, you’ve made what should be a full and true copy, but it’s not a nice clean export in some standard format that you could use to easily import all your posts into another content management system or blogging tool. HTML files and related images are scattered through a system of directories and subdirectories that, while logical, may not be the simplest thing to work with. Using the data you’ve got, a clever computer person could generate an XHTML document (or something similar) that could be semi-automatically imported into (say) WordPress. But it would take parsing; it would take work. Figuring out how to take the copy you just made and turn it back into a non-Tumblr website is a solvable problem, but how easy or hard it might be to actually do it depends on your access to computer expertise and tools. For now, you’re safe in the knowledge that you’ve got all the posts you’ve made this past however-many years. You’ve got the images, you’ve got their metadata (any tags you set for them and any credits you may have reblogged or included) and you’ve got the clever things you said about them, all, safe on your hard drive.
Now would be a good time to back up your hard drive. I’m just sayin’.
Additional caveats and warnings and gotchas and thoughts:
1) This might or might not work on your own porn tumblr after December 17. Tumblr hasn’t said it will delete porn, just that it will make it invisible to everyone but the owner. But I don’t trust them not to continue deleting stuff. I wouldn’t wait. And for any blog you don’t own, this will almost certainly stop working. Every old porn tumblr that is no longer being updated? This is a way to make a copy on your hard drive. Hurry! Don’t wait. Hell is coming, but so is December 17. December 17 will get here first.
2) You have to type your Tumblr login and password into HTTrack to use this method. There’s a good chance that login information gets recorded in the many complex files HTTrack writes to your hard drive. If you ever turn those files over to someone who is, say, helping you convert those files into a WordPress blog — or if, say, you send them off to the Internet Archive for posterity — your password could be exposed. It might be smart to set up a “burner password” on Tumblr that you never used anywhere else and will never use again before firing up HTTrack.
3) There is a method offered by Tumblr for doing a fancier and more complete export of the Tumblr blogs that belong to you, including some of the social interaction metadata. I have not covered it here because it appears not to be very reliable; in every case available to me for testing, it failed permanently with a “processing backup” message that never ends or goes away. So I decided not to waste your time with the instructions for making that attempt. I have heard precisely one account on Twitter of a successful official export.
I hope this helps!