Adding heterogeneous observation social dilemma environment #4

marimeireles · 2024-03-15T13:59:09Z

No description provided.

… opacity to keep consistency among the code

…observations = 0.5 for partial observable agents

…ints

…mples set up

review-notebook-app · 2024-03-15T13:59:14Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

marimeireles · 2024-03-15T15:05:31Z

pyCRLD/Environments/HistoryEmbedding.py

@@ -346,10 +347,64 @@ def TransitionTensor(self):

    def RewardTensor(self):
        return histSjA_RewardTensor(self.baseenv, self.h)
-
+
    def ObservationTensor(self):


I've changed this function to be able to generate different observation tensors for each agent.

marimeireles · 2024-03-15T15:06:41Z

pyCRLD/Environments/HeterogeneousObservationsEnv.py

@@ -0,0 +1,170 @@
+# AUTOGENERATED! DO NOT EDIT! File to edit: ../../nbs/Environments/02_HeterogeneousObservationsEnv.ipynb.


Most of the changes are within this file. It's largely adapting the ebase file to deal with multiple observations.

marimeireles · 2024-03-15T15:09:06Z

pyCRLD/Environments/MultipleObsSocialDilemma.py

@@ -0,0 +1,127 @@
+# AUTOGENERATED! DO NOT EDIT! File to edit: ../../nbs/Environments/12_MultipleObsSocialDilemma.ipynb.


This file simply implements the social dilemma layer into the heterogeneous observation env. file.
I've initially tried to incorporate the "contract" idea because I saw this in the Uncertain Environment file, however, I don't really understand the dynamics of contract and I don't think it's fully functional, I need to work on it.
I thought it wasn't relevant for our project as IPD only has the one state .. Please let me know if I misunderstood this.

marimeireles · 2024-03-15T15:18:42Z

I'm also a bit confused on whether it's possible to have observation tensors summing for numbers > 1 or < 1. I guess the only reason why we cannot is because of the generate_stochastic_observations... But we could change that.
I'm not sure if it makes sense, but I thought it could be possible for an agent to have for example, 0% chance of observing something. Or having tensors looking like [0, 0.8, 0.6, 0.4] or like [0.7,0.,0.,0.]. If this is not possible are there other reasons why it is not possible other than using generate_stochastic_observations in the step function?

marimeireles added 30 commits February 20, 2024 20:55

add base env for multi obs and socialdill with multi obs

4e65617

change names to MultipleObsSocialDilemma from socialdilemma

5a21dac

Work feb21

0860304

add hetero

5416ebb

Modify multi obs social dilemma

51b9adf

dont remember wt i changed hetero and mul obs env

cc44813

fix observabiity reward and transition for heterogeneuous environments

6dea33c

Fix the observation gen tension function and fix from transparency to…

fa3668d

… opacity to keep consistency among the code

Last version of hetero env before making it compat with history

85f10e5

Change Observation type to be explicitely string

3ab50a5

Change hetero env so its env = . .

26f086f

Change set in the correct place

b827a82

Remove this nb bc it's confusing me

e45d1b6

Add the states to the right class finally

e96586d

Update py files for heteroobs and multiobsdilemma

5e05c86

Stringfication of history into py file

006d119

Call Tss function with an explicity object

b415071

Organize and add some tests to the multi obs env

a558dca

Observations dont have to sum up[ to 1 anymore because we might have …

f960634

…observations = 0.5 for partial observable agents

Fix the label generation for Osets

c297264

Testing a bunch of stuff remove null chars so i can compile?

ef85f5b

Remove null chars from ebase

627a9c4

Change the if conditions so it accepts any kinds of numbers not only …

81deb30

…ints

Make call for parent class explicit on APOBase

aaf561c

Make call for parent class explicit on APOBase

3c70da3

Notebooks with corrected observations and different kinds + a few exa…

dffeeb6

…mples set up

Finish 12multipleobssoc and rm explorations

ce6b25b

Add the ability to have different types of obs within the same game

6d47d3d

Remove content added by me on hist env embedding nb

8ed5f37

Adding screenshot from tests ran in h222 env

fa8f6dc

marimeireles added 4 commits March 12, 2024 18:06

Explorations with het obs AC

c96083c

Delete irrelevant nb

4573ee6

Final adjustments initialization and exs and comments

c21b7e1

Add generated py files

31f9157

marimeireles commented Mar 15, 2024

View reviewed changes

marimeireles marked this pull request as draft March 15, 2024 15:09

marimeireles added 4 commits March 22, 2024 10:14

Introduces incomplete SARSA POS strategy

d46c1dc

Comment hOset function

01dd1dd

Up to date v of heterogeneous obs env

7050cd2

Fix missing self in obs assert

ac8ee53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding heterogeneous observation social dilemma environment #4

Adding heterogeneous observation social dilemma environment #4

marimeireles commented Mar 15, 2024

review-notebook-app bot commented Mar 15, 2024

marimeireles Mar 15, 2024

marimeireles Mar 15, 2024

marimeireles Mar 15, 2024

marimeireles commented Mar 15, 2024 •

edited

		@@ -0,0 +1,170 @@
		# AUTOGENERATED! DO NOT EDIT! File to edit: ../../nbs/Environments/02_HeterogeneousObservationsEnv.ipynb.

		@@ -0,0 +1,127 @@
		# AUTOGENERATED! DO NOT EDIT! File to edit: ../../nbs/Environments/12_MultipleObsSocialDilemma.ipynb.

Adding heterogeneous observation social dilemma environment #4

Are you sure you want to change the base?

Adding heterogeneous observation social dilemma environment #4

Conversation

marimeireles commented Mar 15, 2024

review-notebook-app bot commented Mar 15, 2024

marimeireles Mar 15, 2024

Choose a reason for hiding this comment

marimeireles Mar 15, 2024

Choose a reason for hiding this comment

marimeireles Mar 15, 2024

Choose a reason for hiding this comment

marimeireles commented Mar 15, 2024 • edited

marimeireles commented Mar 15, 2024 •

edited