Task queue metrics helper methods #108

alanhamlett · 2018-04-24T03:44:10Z

Adds new helper classmethods:

Task.task_count_from_queue
Task.queue_metrics

Also fixed these pep8 linter rules:

E116 unexpected indentation (comment)
E127 continuation line over-indented for visual indent
E222 multiple spaces after operator
E226 missing whitespace around arithmetic operator
E231 missing whitespace after ','
E261 at least two spaces before inline comment
E301 expected 1 blank line, found 0
E302 expected 2 blank lines, found 1
E303 too many blank lines (2)
E306 expected 1 blank line before a nested definition, found 0
E502 the backslash is redundant between brackets
F403 'from ._internal import *' used; unable to detect undefined names
F841 local variable 'e' is assigned to but never used
W503 line break before binary operator

Related to #107.

alanhamlett · 2018-04-24T17:49:17Z

@thomasst ready for code review.

thomasst

Thanks! See comment.

thomasst · 2018-04-24T22:35:32Z

README.rst

-keyword argument, which accepts an integer indicating how many executions
-should be loaded.
+To get a count of the number of tasks for a given queue and state, use
+``Task.count_tasks_from_queue``. To get a list of all tasks for a given queue


This is actually called task_count_from_queue in the code.

Also, another approach would be to let you pass 0 into tasks_from_queue (currently it's undefined). That way we wouldn't need to add a new function.

tasks_from_queue uses mget and then loops over all the returned tasks. It's much faster to use zcount to only get the count of tasks from redis.

Fixed the readme with 843e016.

I meant that passing limit=0 should make it just return the count and an empty list.

Oh, got it. I prefer methods to not change behavior. Would you be ok with leaving them separate methods?

alanhamlett · 2018-04-27T02:56:56Z

Should we leave the methods separate?

alanhamlett · 2018-05-02T03:58:40Z

Ready for re-review 😄

alanhamlett · 2018-05-10T03:46:59Z

Anything else needed for this PR?

alanhamlett · 2018-05-15T15:33:35Z

Bringing this PR to attention again. It should be ready to merge pending the comment above.

thomasst · 2018-05-15T18:05:55Z

Thanks! It's on my list to review.

alanhamlett · 2018-05-23T05:12:21Z

Let me know if I can do anything to help with your review! 😄

alanhamlett · 2018-05-25T13:22:25Z

Happy Friday! The best day for merging PRs 😉

thomasst · 2018-05-28T04:51:54Z

tasktiger/task.py

+        prefix = tiger.config['REDIS_PREFIX'] + ':'
+
+        for state in metrics.keys():
+            queues = tiger.connection.smembers(prefix + state)


I'd use tiger._key(state).

thomasst · 2018-05-28T04:52:19Z

tasktiger/task.py

+            queues = tiger.connection.smembers(prefix + state)
+            for queue in queues:
+                metrics[state][queue] = {
+                    'total': self.task_count_from_queue(tiger, queue, state),


This should use a pipeline to avoid round trips.

thomasst · 2018-05-28T04:57:32Z

tasktiger/task.py

+        """
+
+        key = tiger._key(state, queue)
+        count = tiger.connection.zcount(key, '-inf', '+inf')


zcard() does the same.

thomasst · 2018-05-28T05:03:18Z

tasktiger/task.py

+    def task_count_from_queue(self, tiger, queue, state):
+        """
+        Returns the number of tasks in a given queue and task state.
+        """


Implementation details aside, I'm still unsure on whether this could be solved by just using tasks_from_queue(limit=0)[0]. I'm aware that currently passing a 0 limit to tasks_from_queue returns all the tasks but this behavior is not documented, and a more Pythonic way would be to say limit=None if we actually wanted all the tasks. We should at least define how tasks_from_queue should behave before we decide to add a new function here. @jkemp101 curious if you have any thoughts on this?

jkemp101 · 2018-05-30T20:37:49Z

Are these stats different compared to what get_queue_stats returns? I didn't compare the output of both functions. Our Prometheus exporter uses those stats with the following code for our metrics. It just summarizes some subqueues.

        stats = defaultdict(lambda: defaultdict(int))
        for queue, depths in self._tiger.get_queue_stats().items():
            if queue not in self._isolated:
                queue = queue.split('.', 1)[0]
            for state in ('active', 'queued', 'scheduled', 'error'):
                stats[queue][state] += depths.get(state, 0)
        return stats

alanhamlett · 2018-05-31T16:29:17Z

@jkemp101 I wasn't aware of the get_queue_stats method! I could change this PR to just document and test that existing method instead of adding a new one, since they're the same?

alanhamlett · 2018-06-12T06:46:07Z

One thing I wish get_queue_stats could do is only return stats for top-level queues instead of including subqueue stats individually. Any ideas how to do this efficiently in a redis pipeline so it could be an option to get_queue_stats, or does it need to be done in Python space?

jkemp101 · 2018-06-18T18:04:32Z

I can't think of an easy way to do this in Redis without some Lua scripting. And I kind of feel that computing stats should take second priority over the regular task processing so it makes sense to me to just offload this to Python code instead of putting any extra burden on Redis.

alanhamlett added 5 commits April 23, 2018 20:04

Task.task_count_from_queue helper method

bf5ade7

fix pep8 linter issues

4934f7f

example of task_count_from_queue in readme

00ba1f8

tests for new helper method

651a042

Task.queue_metrics helper classmethod

debbcb2

alanhamlett changed the title ~~Task count from queue helper method~~ Task queue metrics helper methods Apr 24, 2018

thomasst reviewed Apr 24, 2018

View reviewed changes

fix readme typo

843e016

alanhamlett mentioned this pull request Apr 26, 2018

Prometheus exporter #109

Open

thomasst reviewed May 28, 2018

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Task queue metrics helper methods #108

Task queue metrics helper methods #108

alanhamlett commented Apr 24, 2018 •

edited

alanhamlett commented Apr 24, 2018

thomasst left a comment

thomasst Apr 24, 2018

alanhamlett Apr 24, 2018

alanhamlett Apr 24, 2018

thomasst Apr 24, 2018

alanhamlett Apr 24, 2018 •

edited

alanhamlett commented Apr 27, 2018

alanhamlett commented May 2, 2018

alanhamlett commented May 10, 2018

alanhamlett commented May 15, 2018

thomasst commented May 15, 2018

alanhamlett commented May 23, 2018

alanhamlett commented May 25, 2018

thomasst May 28, 2018

thomasst May 28, 2018

thomasst May 28, 2018

thomasst May 28, 2018

jkemp101 commented May 30, 2018

alanhamlett commented May 31, 2018

alanhamlett commented Jun 12, 2018

jkemp101 commented Jun 18, 2018

Task queue metrics helper methods #108

Are you sure you want to change the base?

Task queue metrics helper methods #108

Conversation

alanhamlett commented Apr 24, 2018 • edited

alanhamlett commented Apr 24, 2018

thomasst left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alanhamlett Apr 24, 2018 • edited

Choose a reason for hiding this comment

alanhamlett commented Apr 27, 2018

alanhamlett commented May 2, 2018

alanhamlett commented May 10, 2018

alanhamlett commented May 15, 2018

thomasst commented May 15, 2018

alanhamlett commented May 23, 2018

alanhamlett commented May 25, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jkemp101 commented May 30, 2018

alanhamlett commented May 31, 2018

alanhamlett commented Jun 12, 2018

jkemp101 commented Jun 18, 2018

alanhamlett commented Apr 24, 2018 •

edited

alanhamlett Apr 24, 2018 •

edited