Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Homer 10 - Call Flow - Sorting Issues/Wrong Order #625

Open
tony1661 opened this issue Dec 29, 2023 · 19 comments · Fixed by metrico/grafana-flow#47 or metrico/grafana-flow#48
Open
Assignees
Labels

Comments

@tony1661
Copy link

tony1661 commented Dec 29, 2023

I am noticing an issue in Homer 10's Call Flow dashboard. It seems that the Call Flow is not in the correct order which is making troubleshooting difficult.

I have a Freeswitch server (FusionPBX) with the portable helpify agent running heplify -hs homer-stack-ip:9060

If I search by the Call-ID in the SIP headers, I see all SIP messages associated with the call. See below:
image

Obviously the INVITE would have occurred before the 407 however that is not what Grafana shows.

Based on the HEP Flow panel, the messages should be sorted oldest to newest.

If I look into each message I see that the INVITE has a later date than the 407.

INVITE Date
image

407 Date
image

The issue is similar to what is being experienced in Homer 7's web-ui which leads me to think the issue may be related to Heplify. In Homer 7, when searching for a call, the results are also in the wrong order, however when I click on the Session ID to view the ladder, it is in the correct order.

Is it possible that the ladder in Homer 7 is referencing a different timestamp that is also being referenced in Homer 10?

Copy link

Please star this repository to motivate the developers and to get higher priority! ⭐

@tony1661
Copy link
Author

I've tested using captagent instead of heplify and the issue persists in Homer 10.

@tony1661
Copy link
Author

To add to this issue, I took a packet capture of a call that has the SIP messages displayed in the wrong order in Homer 10.

It seems that in the HEP packets, the Unix Timestamp doesn't change. See below:
image

The last two messages (BYE and 200 OK) have a different timestamp and in Homer they do indeed appear at the bottom of the ladder but they appear in a different order in the ladder than they do in the packet capture.

It may be worth adding the Timestamp μs to the equation.

image

@lmangani
Copy link
Member

lmangani commented Dec 31, 2023

Thanks for the report @tony1661 we're investigating and will make sure this is part of the next grafana-flow release
@AlexeyOplachko could you check this after the holidays?

@tony1661
Copy link
Author

tony1661 commented Jan 4, 2024

@AlexeyOplachko let me know if I can help in any way. I can provide logs, pcaps etc.

I have this on a production freeswitch server with heplify and captagent both running.

Hundreds of calls a day that we can look at.

@lmangani
Copy link
Member

lmangani commented Jan 6, 2024

Hi @tony1661 we're back next week and we' ll most definitely address this

@RFbkak37y3kIY
Copy link
Member

pushed fix for grafana-plugin, metrico/grafana-flow#47

used field [tsNs] for increased sorting accuracy SIP messages

RFbkak37y3kIY added a commit to metrico/grafana-flow that referenced this issue Jan 9, 2024
RFbkak37y3kIY added a commit to metrico/grafana-flow that referenced this issue Jan 9, 2024
fix: sipcapture/homer#625 used tsNs field for sorting of messages
@tony1661
Copy link
Author

Hi all,

I saw there were some code merged. If I pull the latest docker images, will I be able to test this?

@lmangani
Copy link
Member

As long as its using plugin version 10.0.10 you can also update an existing setup

@lmangani
Copy link
Member

@tony1661 here's how

@tony1661
Copy link
Author

@lmangani Thanks for your quick response. I tested and the issue seems to still be there. Is there anything I can provide to help? Logs etc

@AlexeyOplachko
Copy link
Collaborator

For starters can you please verify that your grafana indeed got new plugin version please?
your_grafana_url/plugins/qxip-flow-panel
image

On our side we'll try to replicate this issue today and see if we need anything else from you

@tony1661
Copy link
Author

Hi @AlexeyOplachko ,

I have verified that I have 10.0.10 installed. See below:
image

@tony1661
Copy link
Author

Anything I can help with?

@lmangani
Copy link
Member

@AlexeyOplachko please provide an update

@AlexeyOplachko
Copy link
Collaborator

AlexeyOplachko commented Mar 14, 2024

Anything I can help with?

@tony1661
Can you please provide screenshots of Message details with all the info in them, on two messages that are in incorrect order.
image
image
And also can you please check if Sort Items is set.
image

@tony1661
Copy link
Author

Hi @AlexeyOplachko sorry for the delay on this.

The Sort items is set to "Sort by Time: Oldest first".

Here is what the call flow looks like:

image

Here is the first message (INVITE):

image

Here is the second message (200 OK):

image

Here is the fourth message (that is supposed to be second - 100 Trying):

image

@AlexeyOplachko
Copy link
Collaborator

Hi @tony1661, thanks for reply, seems like this is not a sorting issue, but an issue with data.

If you look closely, message with 100 Trying has timestamp almost 4 minutes later than 200 OK.
And all three timestamps(one in labels, one in Time field, and nanosecond one) show matching data that supports this.

@tony1661
Copy link
Author

tony1661 commented Apr 25, 2024

@AlexeyOplachko Yea something seems off with the data. The issue happens on multiple HEP clients (heplify and captagent)

I have some screenshots from pcaps above that may assist.

I am running freeswitch (via FusionPBX)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment