fail in reproducing the result of hover #169

zhixiongzh · 2023-08-24T08:06:07Z

Hi,

I am running the provided example gym_pybullet_drones/examples/learn.py, but the agent failed to hover at the desired position.

Then I found the provided script is different from the paper then I git checkout master and run the experiments experiments/learning/singleagent.py but there are a lot of errors due to version conflict.

Is there any suggestion to train a good agent with gym_pybullet_drones/examples/learn.py in current main branch?

The text was updated successfully, but these errors were encountered:

RibhavOjha · 2023-08-28T12:28:02Z

@zhixiongzh I am getting exactly the same problem. learn.py doesn't hover properly and just crashes. When I run singleagent.py, it gives this error:

 File "C:\Users\Username\AppData\Roaming\Python\Python311\site-packages\torch\nn\modules\linear.py", line 96, in __init__
    self.weight = Parameter(torch.empty((out_features, in_features), **factory_kwargs))
                            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: empty() received an invalid combination of arguments - got (tuple, dtype=NoneType, device=NoneType), but expected one of:
 * (tuple of ints size, *, tuple of names names, torch.memory_format memory_format, torch.dtype dtype, torch.layout layout, torch.device device, bool pin_memory, bool requires_grad)
 * (tuple of ints size, *, torch.memory_format memory_format, Tensor out, torch.dtype dtype, torch.layout layout, torch.device device, bool pin_memory, bool requires_grad)

Any updates?

zhixiongzh · 2023-08-28T12:50:35Z

@RibhavOjha I only run the code in learn.py not the singleagent.py. I also have not yet reproduced the result using the example

RibhavOjha · 2023-08-28T14:43:33Z

@zhixiongzh are you also getting the same error as me for singleagent.py ?

JacopoPan · 2023-09-02T11:50:29Z

Hi @zhixiongzh and @RibhavOjha

to reproduce the work in the paper you should checkout branches master or paper (and I am aware that some of the 3rd party dependencies are hardly maintained/backward compatible, unfortunately).

The current main/default branch is currently worked on to support SITL simulation and after that is done, I will re-introduce the learning examples (this time based on gymnasium and sb3 2.0).

cyril-data · 2023-10-19T12:59:01Z

I think you could get rid of this error :

 File "C:\Users\Username\AppData\Roaming\Python\Python311\site-packages\torch\nn\modules\linear.py", line 96, in __init__
    self.weight = Parameter(torch.empty((out_features, in_features), **factory_kwargs))
                            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: empty() received an invalid combination of arguments - got (tuple, dtype=NoneType, device=NoneType), but expected one of:
 * (tuple of ints size, *, tuple of names names, torch.memory_format memory_format, torch.dtype dtype, torch.layout layout, torch.device device, bool pin_memory, bool requires_grad)
 * (tuple of ints size, *, torch.memory_format memory_format, Tensor out, torch.dtype dtype, torch.layout layout, torch.device device, bool pin_memory, bool requires_grad)

By changing the onpolicy_kwargs in singleagent.py by this :

    # onpolicy_kwargs = dict(activation_fn=torch.nn.ReLU,
    #                        net_arch=[512, 512, dict(vf=[256, 128], pi=[256, 128])]
    #                        ) # or None
    onpolicy_kwargs = dict(activation_fn=torch.nn.ReLU,
                           net_arch=[512, 512, 256, 128]
                           )  # or None

But I face the same difficulty to reproduce the results with ppo and 1D (identical input to all motors) with RPMs ("one_d_rpm")...

abdul-mannan-khan · 2023-11-14T11:02:41Z

Same. I continued to run the training for 1E10. It did not work.

JacopoPan · 2023-12-08T15:14:38Z

See #180 for the current status

JacopoPan added the question Further information is requested label Sep 2, 2023

This was referenced Sep 2, 2023

No TakeOffAviary.py in gym-pybullet-drones/gym_pybullet_drones/envs /single_agent_rl/ #168

Closed

singleagent.py: user warning. Recommends transition from Open AI gym env to gymnasium env #167

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fail in reproducing the result of hover #169

fail in reproducing the result of hover #169

zhixiongzh commented Aug 24, 2023

RibhavOjha commented Aug 28, 2023 •

edited

zhixiongzh commented Aug 28, 2023

RibhavOjha commented Aug 28, 2023

JacopoPan commented Sep 2, 2023

cyril-data commented Oct 19, 2023

abdul-mannan-khan commented Nov 14, 2023

JacopoPan commented Dec 8, 2023

fail in reproducing the result of hover #169

fail in reproducing the result of hover #169

Comments

zhixiongzh commented Aug 24, 2023

RibhavOjha commented Aug 28, 2023 • edited

zhixiongzh commented Aug 28, 2023

RibhavOjha commented Aug 28, 2023

JacopoPan commented Sep 2, 2023

cyril-data commented Oct 19, 2023

abdul-mannan-khan commented Nov 14, 2023

JacopoPan commented Dec 8, 2023

RibhavOjha commented Aug 28, 2023 •

edited