Thread Names not updating with collapsed output #511

SakaeKac · 2021-11-26T01:15:11Z

I have an Java application that uses an Executor to parallelize work. Before each chunk of work, the Runnable renames the worker thread based on the task that it is accomplishing and then resets the name at the end of its work. When I use async-profiler externally, it looks like it connects via MXBeans and takes thread dumps, which end up having the renamed thread names. This is very useful as I often want to focus in on one specific task. But when I run the profiler as an agent using the command and dumping collapsed output

start,event=itimer,threads

The thread names all seem to be the original ThreadFactory-assigned thread names. I tried to look through the code and I think it's caching the thread names based on the OS thread id, so it's only getting the first name that it sees and then that's sticky forever after. I'm wondering if there is any sort of flag, configuration or other method of dumping that would preserve the JVM thread name at the point that the sample is taken and work when running as an agent.

The text was updated successfully, but these errors were encountered:

apangin · 2021-11-27T16:45:27Z

One Java thread has one name in async-profiler reports.
The profiler takes care of thread renaming, but keeps only the last name.

SakaeKac · 2021-11-28T23:43:37Z

Is there any way to get the thread name assigned to the sample when the sample is taken rather than when the profiling is completed?

After reading a bit about this: #106

It looks like JFR was introduced to be able to keep the timestamps of individual samples. If I switch JFR will it also maintain the thread name from the point of the sample? Or is does it just maintain a reference to the thread and I will get whatever name happens to exist at the time that I collect the sample?

apangin · 2021-11-29T00:34:01Z

In JFR format, thread names are obtained in the same way.

SakaeKac · 2021-11-29T01:34:28Z

Any ideas or hints on where to adjust things to try to maintain the thread name that existed at the point of sample? I believe I understand that it's impossible to eliminate races and it's clear that it would add to the overhead in terms of CPU and memory, but wondering what part of the code would need to be adjusted to be able to even support this?

apangin · 2021-12-03T00:39:20Z

Current data model of the profiler implies at most one instance of a given thread (i.e. Java thread can't have more than one identifier).
Changing this would require notable architectural modifications, which are not on my list right now, unfortunately.

SakaeKac · 2021-12-06T03:40:48Z

I was looking through the code a bit more to try to figure out where this would happen. It looks like the threadId is pushed onto the stack at

https://github.com/jvm-profiling-tools/async-profiler/blob/170451990ba6b1c16ea54491a90d0ca3d81dee48/src/profiler.cpp#L653-L655

And then read back out at

https://github.com/jvm-profiling-tools/async-profiler/blob/170451990ba6b1c16ea54491a90d0ca3d81dee48/src/frameName.cpp#L245-L257

I understand now that ThreadMap is the map of thread id to the name. So, it's clear that in the profiler you are just grabbing the thread id and then using that to get the name of the thread in the frameName code. The actual updating of the _thread_names appears to happen asynchronously.

It seems like it should be possible to implement something that keeps track of the thread name at time of sample by

Adjusting the code at https://github.com/jvm-profiling-tools/async-profiler/blob/170451990ba6b1c16ea54491a90d0ca3d81dee48/src/profiler.cpp#L653-L655 to lookup the thread name with a jvmti VM::getThreadInfo call, then that name could be pushed into a dictionary.
Adjust the code at https://github.com/jvm-profiling-tools/async-profiler/blob/170451990ba6b1c16ea54491a90d0ca3d81dee48/src/frameName.cpp#L245-L257 to know about the dictionary of names and lookup with that instead. Maybe introduce a new type BCI_THREAD_NAME instead of BCI_THREAD_ID and use that to differentiate?

I'm not well versed in C nor in jvmti so not really sure how easy the above things are. The open questions in my mind are

How expensive is the VM::getThreadInfo call, if that's called for all of the threads at the point of generating the event and it is expensive, this could be a non-starter.
Is there a good object that can be used as a dictionary? I had assumed that there would be something already in the code and looked around, but it looks like the current profiling code always assumes that it can get the strings from the VM or some other part of the system and doesn't actually maintain its own dictionaries. This makes a lot of sense as it minimizes memory usage, but also didn't give me any hints for how easy/hard it is to create a dictionary?

Does this cover the architectural modifications you were talking about? Or is there something I'm missing?

apangin · 2021-12-11T15:47:16Z

GetThreadInfo is not safe inside a signal handler.
In general, any function that may lock or allocate, is prohibited within Profiler::recordSample.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thread Names not updating with collapsed output #511

Thread Names not updating with collapsed output #511

SakaeKac commented Nov 26, 2021

apangin commented Nov 27, 2021

SakaeKac commented Nov 28, 2021

apangin commented Nov 29, 2021

SakaeKac commented Nov 29, 2021

apangin commented Dec 3, 2021

SakaeKac commented Dec 6, 2021

apangin commented Dec 11, 2021

Thread Names not updating with collapsed output #511

Thread Names not updating with collapsed output #511

Comments

SakaeKac commented Nov 26, 2021

apangin commented Nov 27, 2021

SakaeKac commented Nov 28, 2021

apangin commented Nov 29, 2021

SakaeKac commented Nov 29, 2021

apangin commented Dec 3, 2021

SakaeKac commented Dec 6, 2021

apangin commented Dec 11, 2021