New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PSDK crashes when trying to load a neural network #145
Comments
Here are more error messages that I obtained in another crash which caused the jetson to shut down when the model is loaded: |
Agent comment from Leon in Zendesk ticket #101560: Hello, thank you for contacting DJI. Thank you for your patience. I have read your description and suspect that the abnormality is caused by excessive load on your serial link. You can do this and adjust the baud rate to the highest. If you have subscription behavior, please try to reduce the The number of TOPIC subscribed, or reduce the TOPIC frequency of subscription. Thank you for your support of DJI products. Best Regards, °°° |
Hello! Thank you for your reply. I did increase the baud rate to 1000000. I think there are some positive effects. But I would like to know if I can reduce the frequency of the image subscription too. The liveview sample works slightly different than the dji subscription topics. If I can subscribe that images at a slower rate, that could help reduce the load as well. We are planning to subscribe both RGB and thermal images for our multimodal neural network which would fuse together both the images for inference. I would like to know if that is possible as well. |
Agent comment from Leon in Zendesk ticket #101560: Hello, thank you for contacting DJI. Sorry, you are referring to reducing the frequency of image subscription. Can you provide the specific interface function? So that there is no deviation in our communication, thank you very much ~ Thank you for your support of DJI products. Best Regards, °°° |
Hello! There are more updates regarding this issue. After running the code several times, it has been found that increasing baud rate still doesn't solve the issue. But there are some interesting insights:
Given the fact that the psdk's .c files have been compiled into binary libraries, it is a bit hard to debug the underlying issues. |
Agent comment from Leon in Zendesk ticket #101560: Hello, thank you for contacting DJI. Regarding your previous question, the frequency of receiving pictures, this cannot be changed. Thank you for your support of DJI products. Best Regards, °°° |
Hello, |
Agent comment from Leon in Zendesk ticket #101560: Hello, thank you for contacting DJI. Thank you for your patience. I was on vacation yesterday, so the response time was slowed down. Thank you for your support of DJI products. Best Regards, °°° |
Hello, here are some updates.
|
I would like to add one thing: 1.147][linker]-[Warn]-[DjiCommand_SendAsyncHandle:894) Command async send retry: index = 0, retryTimes = 3, cmdSet = 0, cmdId = 135 When we start the NN program first and then run the PSDK code, we notice that when the following error message appears in the PSDK code, the PSDK will crash: [1.147][linker]-[Warn]-[DjiCommand_SendAsyncHandle:894) Command async send retry: index = 0, retryTimes = 3, cmdSet = 0, cmdId = 135 I would like to know if there is any function that we can call using the PSDK that would indicate if Command_SendAsynchandle is sending a retry message or not or something similar so that we could automatically terminate the program with a return code -1 and retry the code so that the issue is solved. |
Agent comment from Leon in Zendesk ticket #101560: Hello, thank you for contacting DJI. Could you please attach the complete PSDK log? This will allow us to better confirm your problem, and we will confirm for you whether there is any way to know if the serial port communication is abnormal. Thank you for your support of DJI products. Best Regards, °°° |
Hello! I am attaching the following three files and I hope that would help you debug the issue.
|
Agent comment from Leon in Zendesk ticket #101560: Hello, thank you for contacting DJI. Thank you for the information you provided. We will take a look at your log first. If there is any progress, we will synchronize it with you again. Thank you for your support of DJI products. Best Regards, °°° |
Hello! |
Agent comment from Leon in Zendesk ticket #101560: Hello, thank you for contacting DJI. Regarding your question, our internal team is still analyzing it and has not yet made a final conclusion. You can try and see if it helps you:
Thank you for your support of DJI products. Best Regards, °°° |
Hello! We are using a single board computer to execute the PSDK scripts. This one to be precise. This is an Nvidia Jetson Xavier board and I believe it uses the SDK round ribbon cable to communicate with the Matrice M30T's PSDK port. So I am not sure if the points you mentioned above would be relevant for our situation. One thing I believe that can be useful for us is if there is a timeout in the PSDK's serial communication, some way to re-establish communication would be great. Everytime the PSDK crashes, I have to restart the PSDK and this would be hard to do mid air. And even when I restart it does take a while for the PSDK to reliably execute without showing the "Command async send retry: index = 0, retryTimes = 3, cmdSet = 0" warning message. This is affecting our application a lot. We are also optimising on our end to ensure the serial communication is never blocked. But the code base is increasing in size as the project grows and it would be beneficial to have a redundancy on PSDK's side of things. Some kind of a function that I can call in the PSDK to restart the connection with the drone or some self correcting mechanism would be very useful. |
Hello!
In my code I have PSDK subscriptions callbacks which store images, GPS, velocity, altitude, etc from the matrice m30t drone.
I am using a jetson xavier based payload computer to run the PSDK. When I open another process in parallel which loads a yolo object detection network and starts performing inferences with it, there is a chance that the neural network crashes the Payload SDK and the callbacks stop receiving the latest image, widget and other values. These are the error messages that I receive before the PSDK crashes:
[87.903][linker]-[Error]-[DjiCommand_SendAsyncHandle:906) Command async send error 0
[87.903][infor]-[Warn]-[DjiAircraftInfo_CheckPingStatusAsyncCallback:265) connect status async timeout
I am not sure why this occurs but it seems that this error occurs when there is a high computational load. Alos, the PSDK crash is random. Sometimes the neural network is succesfully loaded and starts performing the inferences and other times, it crashes. I tried to look into the source code of the payload SDK but I could not find much. Any help would be appreciated.
The text was updated successfully, but these errors were encountered: