New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Cervical cancer #1287

Draft

andrew-phillips-1 wants to merge 49 commits into master from cervical_cancer_

Collaborator

andrew-phillips-1 commented Mar 4, 2024

Here's a first draft of this module. I need to do another search to try to find additional data for calibration.

I couldn't upload the draft write as it was asking me to use the command line and Git LFS.

andrew-phillips-1 added 30 commits

October 16, 2023 16:26

d6bdece

533357a

116f241

4bc722b

9a3b48a

0393e17

bc1ac59


          first pass at cervical cancer module based on editing breast cancer m…

5a66e5d

…odule


          first pass at cervical cancer module based on editing breast cancer m…

b24c6bd

…odule


          first pass at cervical cancer module based on editing breast cancer m…

cc488bd

…odule


          first pass at cervical cancer module based on editing breast cancer m…

144644a

…odule


          first pass at cervical cancer module based on editing breast cancer m…

f1015b5

…odule


          first pass at cervical cancer module based on editing breast cancer m…

f2b44b0

…odule


          first pass at cervical cancer module based on editing breast cancer m…

0d06e44

…odule


          first pass at cervical cancer module based on editing breast cancer m…

c964058

…odule


          first pass at cervical cancer module based on editing breast cancer m…

1b0226b

…odule


          first pass at cervical cancer module based on editing breast cancer m…

fdcea86

…odule


          first pass at cervical cancer module based on editing breast cancer m…

7f13653

…odule


          first pass at cervical cancer module based on editing breast cancer m…

356973c

…odule


          first pass at cervical cancer module based on editing breast cancer m…

91efced

…odule


          first pass at cervical cancer module based on editing breast cancer m…

443401b

…odule


          first pass at cervical cancer module based on editing breast cancer m…

9e60e5c

…odule


          first pass at cervical cancer module based on editing breast cancer m…

8f5e8f0

…odule


          first pass at cervical cancer module based on editing breast cancer m…

…odule


          first pass at cervical cancer module based on editing breast cancer m…

86a503f

…odule


          first pass at cervical cancer module based on editing breast cancer m…

242de2c

…odule


          first pass at cervical cancer module based on editing breast cancer m…

77a2808

…odule


          HSIs

41b9743


          HSIs

0fe0ee1

tbhallett added this to In progress in PR priorities via automation

tbhallett moved this from In progress to Ready for EM review in PR priorities

Collaborator

mnjowe commented Apr 2, 2024

Thanks @andrew-phillips-1 for this first draft. I think it looks good. I will be adding some few comments/suggestions for your consideration.

Collaborator Author

andrew-phillips-1 commented Apr 2, 2024

Thanks @mnjowe

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

+                      # ----- SCHEDULE LOGGING EVENTS -----
+                      # Schedule logging event to happen immediately
+                      sim.schedule_event(CervicalCancerLoggingEvent(self), sim.date + DateOffset(months=0))

Collaborator

mnjowe Apr 2, 2024

Why schedule logging event immediately yet polling event is starting a month after? Are we interested in logging defaults also?

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

+                      sim.schedule_event(CervicalCancerLoggingEvent(self), sim.date + DateOffset(months=0))
+                      # ----- SCHEDULE MAIN POLLING EVENTS -----
+                      # Schedule main polling event to happen immediately

Collaborator

mnjowe Apr 2, 2024

Suggested change

      
                    # Schedule main polling event to happen immediately
          
                    # Schedule main polling event to happen after a month

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

Comment on lines +88 to +89

		Types.REAL,
		"probabilty per month of oncogenic hpv infection",

Collaborator

mnjowe Apr 2, 2024

Suggested change

      
                        Types.REAL,
          
                        "probabilty per month of oncogenic hpv infection",
          
                        Types.REAL,
          
                        "probability per month of oncogenic hpv infection",

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

Comment on lines +92 to +93

		Types.REAL,
		"probabilty per month of incident cin1 amongst people with hpv",

Collaborator

mnjowe Apr 2, 2024

Suggested change

      
                        Types.REAL,
          
                        "probabilty per month of incident cin1 amongst people with hpv",
          
                        Types.REAL,
          
                        "probability per month of incident cin1 amongst people with hpv",

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

Comment on lines +96 to +97

		Types.REAL,
		"probabilty per month of incident cin2 amongst people with cin1",

Collaborator

mnjowe Apr 2, 2024

Suggested change

      
                        Types.REAL,
          
                        "probabilty per month of incident cin2 amongst people with cin1",
          
                        Types.REAL,
          
                        "probability per month of incident cin2 amongst people with cin1",

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

Comment on lines +100 to +101

		Types.REAL,
		"probabilty per month of incident cin3 amongst people with cin2",

Collaborator

mnjowe Apr 2, 2024

Suggested change

      
                        Types.REAL,
          
                        "probabilty per month of incident cin3 amongst people with cin2",
          
                        Types.REAL,
          
                        "probability per month of incident cin3 amongst people with cin2",

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

Comment on lines +104 to +105

		Types.REAL,
		"probabilty per month of incident stage1 cervical cancer amongst people with cin3",

Collaborator

mnjowe Apr 2, 2024

typo. change probabilty to probability for Ln 105, 109, 113, 117 and 121.

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

Comment on lines +596 to +597

		def on_hsi_alert(self, person_id, treatment_id):
		pass

Collaborator

mnjowe Apr 2, 2024

are you planning on doing something here in the next draft? if not I think we can remove the function.

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

Comment on lines +331 to +332

		# this was not assigned here at outset because baseline value of hv_inf was not accessible - it is assigned
		# st start of main polling event below

Collaborator

mnjowe Apr 2, 2024

why is baseline value for hv_inf not accessible yet HIV module is included in the dependencies section and hv_inf has been initialised here in Hiv module? what was the error?

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

Comment on lines +669 to +687

+                      # this was done here and not at outset because baseline value of hv_inf was not accessible
+                      given_date = pd.to_datetime('2010-02-03')
+                      if self.sim.date < given_date:
+                          women_over_15_nhiv_idx = df.index[(df["age_years"] > 15) & (df["sex"] == 'F') & ~df["hv_inf"]]
+                          df.loc[women_over_15_nhiv_idx, 'ce_hpv_cc_status'] = rng.choice(
+                              ['none', 'hpv', 'cin1', 'cin2', 'cin3', 'stage1', 'stage2a', 'stage2b', 'stage3', 'stage4'],
+                              size=len(women_over_15_nhiv_idx), p=p['init_prev_cin_hpv_cc_stage_nhiv']
+                          )
+                          women_over_15_hiv_idx = df.index[(df["age_years"] > 15) & (df["sex"] == 'F') & df["hv_inf"]]
+                          df.loc[women_over_15_hiv_idx, 'ce_hpv_cc_status'] = rng.choice(
+                              ['none', 'hpv', 'cin1', 'cin2', 'cin3', 'stage1', 'stage2a', 'stage2b', 'stage3', 'stage4'],
+                              size=len(women_over_15_hiv_idx), p=p['init_prev_cin_hpv_cc_stage_hiv']
+                          )

Collaborator

mnjowe Apr 2, 2024

I think we can do this in initialise population? I'm interested to know why the value for hv_inf is not accessible at initialise population yet we have included Hiv in the list of dependencies

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py


		df.ce_selected_for_via_this_month = False

		eligible_population = df.is_alive & (df.sex == 'F') & (df.age_years > 30) & (df.age_years < 50) & \

Collaborator

mnjowe Apr 2, 2024

Suggested change

      
                    eligible_population = df.is_alive & (df.sex == 'F') & (df.age_years > 30) & (df.age_years < 50) & \
          
                    eligible_population = df.is_alive & (df.sex == 'F') & (df.age_years.between(30, 50, inclusive="neither") & \

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

+                                          | df.ce_ever_treated)
+                      # -------------------------------- SCREENING FOR CERVICAL CANCER USING XPERT HPV TESTING AND VIA---------------
+                      # A subset of women aged 30-50 will receive a screening test

Collaborator

mnjowe Apr 2, 2024

should the boundaries be included(30yrs and 50yrs) selection on Ln 720 is excluding them

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

+                          self.sim.schedule_event(
+                              InstantaneousDeath(self.module, person_id, "CervicalCancer"), self.sim.date
+                          )
+                          df.loc[selected_to_die, 'ce_date_death'] = self.sim.date

Collaborator

mnjowe Apr 2, 2024

is date of death not recorded already in demography?

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

Comment on lines +991 to +993

+                      # Ignore this event if the person is no longer alive:
+                      if not df.at[person_id, 'is_alive']:
+                          return hs.get_blank_appt_footprint()

Collaborator

mnjowe Apr 2, 2024

@tbhallett is this not being handled already by the Healthsystem? If yes then I think we will save some computational time by removing it here and in all other HSI's.

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

Comment on lines +1128 to +1150

+                      if random_value <= p['prob_cure_stage1'] and df.at[person_id, "ce_date_treatment"] == self.sim.date:
+                          df.at[person_id, "ce_hpv_cc_status"] = 'none'
+                          df.at[person_id, 'ce_current_cc_diagnosed'] = False
+                      else:
+                          df.at[person_id, "ce_hpv_cc_status"] = 'stage1'
+                      if random_value <= p['prob_cure_stage2a'] and df.at[person_id, "ce_date_treatment"] == self.sim.date:
+                          df.at[person_id, "ce_hpv_cc_status"] = 'none'
+                          df.at[person_id, 'ce_current_cc_diagnosed'] = False
+                      else:
+                          df.at[person_id, "ce_hpv_cc_status"] = 'stage2a'
+                      if random_value <= p['prob_cure_stage2b'] and df.at[person_id, "ce_date_treatment"] == self.sim.date:
+                          df.at[person_id, "ce_hpv_cc_status"] = 'none'
+                          df.at[person_id, 'ce_current_cc_diagnosed'] = False
+                      else:
+                          df.at[person_id, "ce_hpv_cc_status"] = 'stage2b'
+                      if random_value <= p['prob_cure_stage3'] and df.at[person_id, "ce_date_treatment"] == self.sim.date:
+                          df.at[person_id, "ce_hpv_cc_status"] = 'none'
+                          df.at[person_id, 'ce_current_cc_diagnosed'] = False
+                      else:
+                          df.at[person_id, "ce_hpv_cc_status"] = 'stage3'

Collaborator

mnjowe Apr 2, 2024

Are we assuming that patients do recover from cervical cancer same day they receive treatment?

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

Comment on lines +1273 to +1282

+                      # Schedule another instance of the event for one month
+                      hs.schedule_hsi_event(
+                          hsi_event=HSI_CervicalCancer_PalliativeCare(
+                              module=self.module,
+                              person_id=person_id
+                          ),
+                          topen=self.sim.date + DateOffset(months=1),
+                          tclose=None,
+                          priority=0
+                      )

Collaborator

mnjowe Apr 2, 2024

@tbhallett don't we have frequency argument in HSI events? could be useful here

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

Comment on lines +1295 to +1296

		self.repeat = 30
		super().__init__(module, frequency=DateOffset(days=self.repeat))

Collaborator

mnjowe Apr 2, 2024

how about changing days to months i.e.

Suggested change

      
                    self.repeat = 30
          
                    super().__init__(module, frequency=DateOffset(days=self.repeat))
          
                    self.repeat = 1
          
                    super().__init__(module, frequency=DateOffset(months=self.repeat))

mnjowe reviewed

View reviewed changes

src/tlo/methods/cervical_cancer.py

+                      self.repeat = 30
+                      super().__init__(module, frequency=DateOffset(days=self.repeat))
+                  def apply(self, population):

Collaborator

mnjowe Apr 2, 2024

I think the use of groupby can be more efficient in computing the statistics below?

mnjowe reviewed

View reviewed changes

src/tlo/methods/healthsystem.py

Comment on lines +1475 to +1476

		# warnings.warn(UserWarning(f"Couldn't find priority ranking for TREATMENT_ID \n"
		# f"{hsi_event.TREATMENT_ID}"))

Collaborator

mnjowe Apr 2, 2024

I think this should be uncommented

mnjowe reviewed

View reviewed changes

src/tlo/methods/hiv.py

@@ @@ -40,7 +40,7 @@ @@
               from tlo.util import create_age_range_lookup
               logger = logging.getLogger(__name__)
-              logger.setLevel(logging.INFO)
+              logger.setLevel(logging.CRITICAL )

Collaborator

mnjowe Apr 2, 2024 •

edited

Any reason you are setting this to critical? I think if we don't want .INFO logs from Hiv module, we can configure cervical cancer analyses to only allow logging.INFO from cervical cancer module

mnjowe reviewed

View reviewed changes

src/tlo/methods/tb.py

@@ @@ -20,7 +20,7 @@ @@
               from tlo.util import random_date
               logger = logging.getLogger(__name__)
-              logger.setLevel(logging.INFO)
+              logger.setLevel(logging.CRITICAL)

Collaborator

mnjowe Apr 2, 2024

same here, we can configure this in cervical cancer analyses. This should be as it was

Suggested change

      
            logger.setLevel(logging.CRITICAL)
          
            logger.setLevel(logging.INFO)

mnjowe reviewed

View reviewed changes

src/tlo/simulation.py

@@ @@ -16,7 +16,7 @@ @@
               from tlo.progressbar import ProgressBar
               logger = logging.getLogger(__name__)
-              logger.setLevel(logging.INFO)
+              logger.setLevel(logging.CRITICAL)

Collaborator

mnjowe Apr 2, 2024

same here.

Suggested change

      
            logger.setLevel(logging.CRITICAL)
          
            logger.setLevel(logging.INFO)

mnjowe reviewed

View reviewed changes

src/tlo/simulation.py

                       self.rng = np.random.RandomState(np.random.MT19937(self._seed_seq))
                   def configure_logging(self, filename: str = None, directory: Union[Path, str] = "./outputs",
-                                        custom_levels: Dict[str, int] = None, suppress_stdout: bool = False):
+                                        custom_levels: Dict[str, int] = None, suppress_stdout: bool = True):

Collaborator

mnjowe Apr 2, 2024

I think we can also do this in analyses file

Suggested change

      
                                      custom_levels: Dict[str, int] = None, suppress_stdout: bool = True):
          
                                      custom_levels: Dict[str, int] = None, suppress_stdout: bool = False):

mnjowe reviewed

View reviewed changes

src/tlo/simulation.py

Comment on lines +231 to +232

		# print(stats_dict)

Collaborator

mnjowe Apr 2, 2024

Suggested change

# print(stats_dict)

mnjowe reviewed

View reviewed changes

tests/test_cervical_cancer.py

Comment on lines +177 to +180

+              # todo: not sure what is wrong with this assert as I am fairly certain the intended assert is true
+              #   assert set(sim.modules['SymptomManager'].who_has('vaginal_bleeding')).issubset(
+              #       df.index[df.ce_cc_ever])

Collaborator

mnjowe Apr 2, 2024

This test is just okay. It is failing because of how test test_check_progression_through_stages_is_blocked_by_treatment has been configured.

mnjowe reviewed

View reviewed changes

tests/test_cervical_cancer.py

Comment on lines +360 to +368

+                  sim.population.props.loc[population_of_interest, "ce_hpv_cc_status"] = 'stage1'
+                  # force that they are all symptomatic
+                  sim.modules['SymptomManager'].change_symptom(
+                      person_id=population_of_interest.index[population_of_interest].tolist(),
+                      symptom_string='vaginal_bleeding',
+                      add_or_remove='+',
+                      disease_module=sim.modules['CervicalCancer']
+                  )

Collaborator

mnjowe Apr 2, 2024

This will make all >15 yrs females be on stage 1 and have cancer symptoms yes BUT it will not automatically make everyone deemed as ever had cervical cancer in the code Hence check
assert set(sim.modules['SymptomManager'].who_has('vaginal_bleeding')).issubset( df.index[df.ce_cc_ever]) is likely to fail

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment