Velocity estimation by piecewise linear regression #569

rpauszek · 2023-09-01T12:20:26Z

work in progress

Start with simple OLS linear regression if no breakpoints requested Set up model dataclass

JoepVanlier · 2023-09-01T12:36:44Z

lumicks/pylake/kymotracker/detail/velocity.py

+        diff_term = x - breakpoints[:, np.newaxis]
+        heavi = np.vstack([np.heaviside(d, -1) for d in diff_term])
+        u = diff_term * heavi
+        v = -heavi
+        design_matrix = np.vstack((np.ones(x.size), x, u, v)).T
+
+        xtx = np.linalg.pinv(np.matmul(design_matrix.T, design_matrix))
+        coeffs = np.dot(
+            xtx,
+            np.dot(design_matrix.T, y),
+        )
+
+        fit = np.dot(design_matrix, coeffs)


I wonder if moving the linear fit part and the parameter remapping inside PiecewiseModel would make sense. Since it's a linear fit, the fit is fully determined from just the breakpoints and the data (which could be its constructor arguments).

The benefit would be that you could add a simple .plot() function to that model to see it (since you'd have all the pieces there to plot it already).

Then you have a self-contained PiecewiseModel model with everything needed to fit (which can just happen on construction making it immutable) and assess its quality for a given set of breakpoints in a single dataclass.

bic, prediction, rss, breaks_in_range and external parameters could be properties.

The only thing that seems like it wouldn't fit quite as nicely is the exitflag; but that feels a bit like a property of the overarching algorithm rather than the PiecewiseLinearModel itself.

JoepVanlier · 2023-09-01T13:08:43Z

lumicks/pylake/kymotracker/detail/velocity.py

+        from a uniform distribution. Ignored if `n_breakpoints == 0`.
+    """
+    n_samples = len(x)
+    n_coeffs = 2 + 2 * n_breakpoints


I do wonder if BIC treating the breakpoint as just another parameter is sufficient penalty. In a sense, it is a very different type of parameter. See also the answer on this for example: https://stats.stackexchange.com/questions/337852/aic-bic-for-a-segmented-regression-model

When you run this with many simulations, how consistently does the BIC selection procedure recover the correct model?

JoepVanlier · 2023-09-01T13:09:29Z

lumicks/pylake/kymotracker/detail/velocity.py

+        v = -heavi
+        design_matrix = np.vstack((np.ones(x.size), x, u, v)).T
+
+        xtx = np.linalg.pinv(np.matmul(design_matrix.T, design_matrix))


There's a shorthand for matmul.

Suggested change

xtx = np.linalg.pinv(np.matmul(design_matrix.T, design_matrix))

xtx = np.linalg.pinv(design_matrix.T @ design_matrix)

JoepVanlier · 2023-09-01T13:11:33Z

lumicks/pylake/kymotracker/detail/velocity.py

+    slope_terms = np.hstack((alpha, beta))
+    slope_block = cov[1 : n_terms + 1, 1 : n_terms + 1]
+    slopes = np.array([np.sum(slope_terms[:j]) for j in range(1, n_terms + 1)])
+    slopes_std = np.array([np.sqrt(np.sum(slope_block[:j, :j])) for j in range(1, n_terms + 1)])


How close are these estimates to say a bootstrap with different realizations?

JoepVanlier · 2023-09-01T13:34:28Z

lumicks/pylake/kymotracker/detail/velocity.py

+            break
+
+        # update breakpoints and check in bounds
+        breakpoints = gamma / beta + breakpoints


Any reason why we're not just doing:

Suggested change

breakpoints = gamma / beta + breakpoints

breakpoints += gamma / beta

rpauszek added 4 commits August 31, 2023 13:35

velocity: add fit_piecewise_continuous (no breakpoints)

a1a45cc

Start with simple OLS linear regression if no breakpoints requested Set up model dataclass

velocity: add _optimize_breakpoints

b37c824

restarts and docstrings

b43c088

pull out coefficient conversions and error estimates

233bca1

JoepVanlier reviewed Sep 1, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Velocity estimation by piecewise linear regression #569

Velocity estimation by piecewise linear regression #569

rpauszek commented Sep 1, 2023

JoepVanlier Sep 1, 2023 •

edited

JoepVanlier Sep 1, 2023

JoepVanlier Sep 1, 2023

JoepVanlier Sep 1, 2023

JoepVanlier Sep 1, 2023

	xtx = np.linalg.pinv(np.matmul(design_matrix.T, design_matrix))
	xtx = np.linalg.pinv(design_matrix.T @ design_matrix)

	breakpoints = gamma / beta + breakpoints
	breakpoints += gamma / beta

Velocity estimation by piecewise linear regression #569

Are you sure you want to change the base?

Velocity estimation by piecewise linear regression #569

Conversation

rpauszek commented Sep 1, 2023

JoepVanlier Sep 1, 2023 • edited

Choose a reason for hiding this comment

JoepVanlier Sep 1, 2023

Choose a reason for hiding this comment

JoepVanlier Sep 1, 2023

Choose a reason for hiding this comment

JoepVanlier Sep 1, 2023

Choose a reason for hiding this comment

JoepVanlier Sep 1, 2023

Choose a reason for hiding this comment

JoepVanlier Sep 1, 2023 •

edited