Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use AppInsigths to capture Cosmos Benchmark metrics #3892

Closed
7 of 8 tasks
M-Lipin opened this issue Jun 4, 2023 · 4 comments · May be fixed by #3897
Closed
7 of 8 tasks

Use AppInsigths to capture Cosmos Benchmark metrics #3892

M-Lipin opened this issue Jun 4, 2023 · 4 comments · May be fixed by #3897
Assignees

Comments

@M-Lipin
Copy link
Contributor

M-Lipin commented Jun 4, 2023

Subtasks:

  • Illustrate in the diagram how metrics are collected (detailed building blocks)
  • Capture Realtime P90, P99, P999, P9999
  • Explore other ways to capture metrics (AppMetrics and OpenTelemetry)
  • Investigate integration with AppInsights
  • Collect metrics with OpenTelemetry and send to AppInsights
  • Collect metrics from multiple VMs and aggregate these in AppInsights
  • Add dashboard using percentiles query
  • Availability visualization and 10 seconds granularity for latencies
sequenceDiagram
    participant A as Azure VM
    participant B as Benchmark Tool
    participant C as CosmosDB
    participant D as OpenTelemetry
    participant E as Application Insights

    A->>B: Launch Benchmark Tool
        
        B->>C: Request
        activate C
        
        C->>B: Response 
        deactivate C
        B->>D: Put metrics on each operation completion
        activate D
        
        par Every 5 sec (flush)
            D->>E: Send successes and failures, latencies and RPS
            Note right of C: collect and aggregate raw data
        end
        deactivate D
        B->>B: Benchmark completed
@M-Lipin
Copy link
Contributor Author

M-Lipin commented Jun 7, 2023

  • Right way to capture.
  • Need to fix the issue with point percentiles to Application Insights.
  • Distinguish latencies by read latencies, write latencies, queries and so on.
  • Fix the issue with window latencies (10 seconds or configured).

@M-Lipin
Copy link
Contributor Author

M-Lipin commented Jun 14, 2023

  • Need to collect metrics using OpenTelemetry
  • Need to aggreagate metrics from multiple VMs
  • Need to illustrate the process in details how metrics are collected and sent to AppInsights

@M-Lipin
Copy link
Contributor Author

M-Lipin commented Jun 22, 2023

#3932 - Dashboard template

@philipthomas-MSFT
Copy link
Contributor

Please add project type for associated PRs so that it appears on board.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging a pull request may close this issue.

4 participants