function to emit metrics #1649

technillogue · 2024-05-06T22:47:58Z

we need the model code to emit accurate input token counts using the correct prompt formatting and tokenizer for the model in question. after some internal discussion from cog import emit_metric seems similar to the best approach for this. presently these metrics are ignored for non-official models and shouldn't really matter anywhere else, so it shouldn't be a concern with billing. this particular approach is janky but should unblock us for now.

usage:

     async def predict(self, ...) -> ...:
        ...
        formatted_prompt = prompt_template.format(prompt=prompt, system_prompt=system_prompt)
        input_token_count = len(self.tokenizer(formatted_prompt )["input_ids"])
        cog.emit_metric(name="input_token_count", value=input_token_count)
        ...

python/cog/server/worker.py

technillogue · 2024-05-17T20:17:10Z

TODO new doc file

zeke

I can't speak to the technical implementation of this, but I think it needs docs.

I would suggest creating a docs/metrics.md file to document why these exists, and the API for sending metrics. Maybe also some recommendations like "Use snake case for metric names" or "Put an x_ prefix on your custom metrics names" or similar.

zeke · 2024-05-17T21:37:48Z

docs/metrics.md

@@ -0,0 +1,14 @@
+# Metrics
+
+Prediction objects have a `metrics` field. This normally includes `predict_time` and `total_time`. Official language models have metrics like `input_token_count`, `output_token_count`, `tokens_per_second`, and `time_to_first_token`. Currently, custom metrics from Cog are ignored when running on Replicate. Official Replicate-published models are the only exception to this. When running outside of Cog, you can emit custom metrics like this:


What does "When running outside of Cog" mean? Should this be "outside of Replicate"?

fixed; we'll also have another chance to review this when merging into main

Signed-off-by: technillogue <technillogue@gmail.com>

technillogue requested review from bfirsh and a team May 6, 2024 22:48

yorickvP reviewed May 7, 2024

View reviewed changes

python/cog/server/worker.py Show resolved Hide resolved

technillogue force-pushed the syl/more-refactor branch 5 times, most recently from 53a5318 to 84cece7 Compare May 14, 2024 05:31

technillogue force-pushed the syl/more-refactor branch 5 times, most recently from 7098fde to 40bdb54 Compare May 16, 2024 20:18

Base automatically changed from syl/more-refactor to async May 16, 2024 21:08

technillogue force-pushed the syl/emit-metric branch 2 times, most recently from 5a0d81d to 56aa9d8 Compare May 17, 2024 00:13

zeke requested changes May 17, 2024

View reviewed changes

zeke reviewed May 17, 2024

View reviewed changes

technillogue added 2 commits May 17, 2024 17:38

function to emit metrics

c951fe5

Signed-off-by: technillogue <technillogue@gmail.com>

add metrics docs

0141cce

Signed-off-by: technillogue <technillogue@gmail.com>

technillogue force-pushed the syl/emit-metric branch from 68a9891 to 0141cce Compare May 17, 2024 21:38

technillogue merged commit b35d1f7 into async May 17, 2024
10 checks passed

technillogue deleted the syl/emit-metric branch May 17, 2024 21:45

technillogue mentioned this pull request May 21, 2024

[async] Include prediction id upload request #1680

Open

technillogue mentioned this pull request Jun 4, 2024

fix upload redirect handling #1714

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

function to emit metrics #1649

function to emit metrics #1649

technillogue commented May 6, 2024 •

edited

technillogue commented May 17, 2024

zeke left a comment

zeke May 17, 2024

technillogue May 17, 2024

		@@ -0,0 +1,14 @@
		# Metrics

		Prediction objects have a `metrics` field. This normally includes `predict_time` and `total_time`. Official language models have metrics like `input_token_count`, `output_token_count`, `tokens_per_second`, and `time_to_first_token`. Currently, custom metrics from Cog are ignored when running on Replicate. Official Replicate-published models are the only exception to this. When running outside of Cog, you can emit custom metrics like this:

function to emit metrics #1649

function to emit metrics #1649

Conversation

technillogue commented May 6, 2024 • edited

technillogue commented May 17, 2024

zeke left a comment

Choose a reason for hiding this comment

zeke May 17, 2024

Choose a reason for hiding this comment

technillogue May 17, 2024

Choose a reason for hiding this comment

technillogue commented May 6, 2024 •

edited