2024 Feb 25 (Sun)

Went over ms, need to fill the new windows stuff. Removed all the Bayesian little r using Stan. Need to use program to print sample/estimates directly instead of writting it out. Need to switch out serial interval from mexico plot with constructed GI from Hampson2009 (proxy serial).

2024 Jan 17 (Wed)

Making a document specifically about the plots, but also contains our current thoughts

mm_plot.md

Based on data quality, we are thinking of changing the default value of minPeak to 15 (or possibly 16 specifically to exclude Perak‽)

2024 Jan 03 (Wed)

What do we want to do about random effects, etc?

Doing REs at the country level feels like a nightmare: possibly to fit, but also to interpret. During REs only at the phase level seems impossible (not enough info, and the additivity assumption is confusing).

So we are thinking for now of estimating r0 separately for each time series.

Another question is whether we want to estimate r0 at t=-∞, or at a time when the cumulative cases are estimated at 1. How much difference does it make? JD thinks it probably matters for some formulations but not others. Logistic seems stable-ish, so maybe stick with that. Meaning also: stick with egf, which does t=-∞.

2023 Dec 25 (Mon)

Our new pipeline has three parameters that are used in window selection. The first set we looked at was minPeak=12; ratThresh=0.25 and minLength=6.

We decided quickly that minLength=6 was too big a departure from past work, and switched to minLength=5.

First set of observations:

We don't like the second phase of Kanagawa (it goes down almost as much as up); we could consider tweaking ratThresh (or adding something else) to trim the beginning. We could also consider increasing minPeak a little bit to drop it
We think we're OK with losing Memphis, but should talk to Katie
Perak is another argument for increasing minPeak
Perak also points to a potential flaw in the algorithm; what should we do when the point-after-peak (which we want for egf) is classified as a separate phase. For now, we are dropping it, which doesn't seem stupid, but we should definitely not that we are doing that if we are.
Serengeti phase 2 is keeping a double wave as one outbreak (because ratThresh is low; is that what we want?)
Tokyo1 looks like a big mess to JD (both outbreaks are long and complicated)

2023 Dec 25 (Mon)

Maybe we should put this on hold. What will it take to finish it?

Finish windowing (see Oct notes)
Apply epigrowthfit
re-assess

JD rebuilt a data-processing pipeline, there are some sharp questions in monthly.md. We need to talk to Katie about this.

ML tried some egf experiments, but JD is pretty unhappy about the window interface and the window logic; we should talk to Mikael.

… We have hacked around the window stuff, but there is too much we don't understand about egf; should try to meet with Mikael instead of making ourselves crazy.

2023 Oct 27 (Fri)

We want the comparison with old window choices to be a side branch

JD wants to pre-screen series and break things with two “obvious” peaks

Mike wants to drop NYstate because it's not in the paper

Can we cite it? Is the data set public?
Mike: We can cite it, it is here. Note, KH did not have NYstate in the 2009 paper. The citation include both Central NY (in KH's paper) and NYstate (not in KH's paper).
https://ajph.aphapublications.org/doi/epdf/10.2105/AJPH.38.1_Pt_1.50

Set minmax (variable minPeak) to 12 which excludes HK, but maybe it should be a bit higher see below

Some thoughts: • Break things that look like they have two peaks into two series • Consider a slightly higher break either in general or for split series (worried about two fits for Serengeti) • Should we start the second TS one step before the trough to avoid a bias that comes from starting from the very lowest point – we think yes

Should there be a criterion for window length? Yes TODO when we do the splitting
Note that we are taking old windows and splitting them. This means that we've thrown away anything after the global peak before we split.

2023 Oct 18 (Wed)

We are revisiting data series and window choices.

There are at least 3 data series that never have >10 cases per month. Can we just throw them out?? Yes.

Multi-peak

Kanagawa looks OK (we pick the first peak automatically)
Tokyo (Tokyo1 is the new name) wants to pick a two-peak window, so we need to do something
NYstatecounties seems similar to Tokyo

Let's take a look first at the old window selections

Still happy with throwing out 3 series that have <10 cases in each month
Serengeti does not peak; will change the code to use it up until the end
Tokyo2 also fluctuates wildly within window, maybe drop it manually?
NYstate used to use the small first peak.

Conclusion for the day:

auto-drop 3 small series (<10 cases as the max)
Fix serengeti limit
worry about Tokyo1, Tokyo2, NYstate

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notes.md

notes.md

2024 Jan 17 (Wed)

2024 Jan 03 (Wed)

2023 Dec 25 (Mon)

2023 Dec 25 (Mon)

2023 Oct 27 (Fri)

2023 Oct 18 (Wed)

Files

notes.md

Latest commit

History

notes.md

File metadata and controls

2024 Jan 17 (Wed)

2024 Jan 03 (Wed)

2023 Dec 25 (Mon)

2023 Dec 25 (Mon)

2023 Oct 27 (Fri)

2023 Oct 18 (Wed)