Fix tag indexing #144

adevyish · 2018-12-06T07:15:22Z

Fix exception if tag has / in it
Fix tag links not working

- Fix exception if tag has `/` in it - Fix tag links not working

bbolli · 2018-12-06T08:48:09Z

That's great, thanks! Does slugify() also work with Windows?
And, seeing that you don't need the first tuple element, it should be removed completely.

adevyish · 2018-12-06T10:14:01Z

I think it should but I don’t have access to a testing environment. Can fix up the tuple thing tomorrow but I was trying to get it working quickly 😅

mcscope · 2018-12-06T23:19:28Z

tumblr_backup.py

@@ -414,7 +429,8 @@ def save_tag_index(self):
        mkdir(path_to(tag_index_dir))
        self.fixup_media_links()
        tag_index = [self.blog.header('Tag index', 'tag-index', self.blog.title, True), '<ul>']
-        for tag, index in sorted(self.tags.items(), key=lambda kv: kv[1].name):
+        for _, index in sorted(self.tags.items(), key=lambda kv: kv[1].name):


use tags.values() to get just the values instead of tuples

Yes, I’m aware of this.

adevyish · 2018-12-07T04:24:20Z

As a sidenote, I didn't want to add an additional dependency but unicode-slugify also handles if you want to slug with non-ascii characters (which I'm doing for my own archive, since I have plenty of CJK tags)

aspensmonster · 2018-12-07T19:15:50Z

It seems like tumblr tags are a wonderful example of diverse user input that's always trying to outsmart the slug code. CJK tags, tags with slashes, tags with all kinds of odd unicode, emoji, multiple tags condensing down to one slug (whose sets aren't identical, so the rendered HTML is incomplete)...

Assuming you don't mind too much what the folder name is --if your use of the backup is mostly using the rendered HTML-- this approach seems to work for me for various weird tags (haven't found a broken or empty link yet, though there are thousands of tags in my backup):

import hashlib
...
tag_index = [self.blog.header('Tag index', 'tag-index', self.blog.title, True), '<ul>']
for index in sorted(self.tags.values(), key=lambda v: v.name):
    tag = hashlib.sha256(index.name.encode('utf-8')).hexdigest()
    etc etc etc

I'm also pretty sure hashlib is part of the standard library, so no additional module install is needed.

Fix tags

7327c59

- Fix exception if tag has `/` in it - Fix tag links not working

mcscope reviewed Dec 6, 2018

View reviewed changes

remove unused variable

1f43dbe

fix the sort key too

897967e

aspensmonster mentioned this pull request Dec 11, 2018

Hacky fixes to archive likes #114

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix tag indexing #144

Fix tag indexing #144

adevyish commented Dec 6, 2018

bbolli commented Dec 6, 2018

adevyish commented Dec 6, 2018

mcscope Dec 6, 2018

adevyish Dec 6, 2018

adevyish commented Dec 7, 2018 •

edited

aspensmonster commented Dec 7, 2018

Fix tag indexing #144

Are you sure you want to change the base?

Fix tag indexing #144

Conversation

adevyish commented Dec 6, 2018

bbolli commented Dec 6, 2018

adevyish commented Dec 6, 2018

mcscope Dec 6, 2018

Choose a reason for hiding this comment

adevyish Dec 6, 2018

Choose a reason for hiding this comment

adevyish commented Dec 7, 2018 • edited

aspensmonster commented Dec 7, 2018

adevyish commented Dec 7, 2018 •

edited