Reducing refractory period violation for tetrode data #311

s-steinberg · 2020-11-07T16:10:03Z

I am new to SpyKING CIRCUS and I much appreciate the software. I do have a problem on sorting quality.
I have 8-tetrode / 32-channel recordings and raw data are already band pass filtered for spike analysis (1-6 kHz). Some channels have good s/n ratio. This is example raw traces from a tetrode.

These traces are ones with the best quality I can get in my recording setup and I am testing the algorithm with these data.

With the default setting, I got relatively few clusters (<3 clusters per channel) and all clusters were not well separated i.e. contamination in the refractory period. I thought the algorithm was merging too much and played around with parameters to increase the number of resulting clusters, because manual merging is easier than splitting.
I found that lowering [clustering] sensitivity the most dramatically increases the number of clusters. Using [clustering] sensitivity = 0.8 gives about 5 or more clusters per channel. However, most clusters still have refractory period violation.

In the raw data trace, I can visually tell that there are a few distinct populations in the spike amplitudes. However I noticed that the algorithm puts spikes with apparent difference in amplitudes into one cluster. In some cases, one cluster has two distinct peaks in amplitude:

So far I played around with the following parameters:

Skipping filtering and merging
Increasing [clustering] nb_elts to 0.95 takes longer time to calculate but does not quite improve the isolation
Increasing [clustering] cc_merge to 1 does not quite improve the isolation
Decreasing [clustering] sensitivity to 0.8 or even 0.5 makes calculation time longer, and results in obvious increase of number of clusters
Decreasing [clustering] dispersion to (2, 2) does not quite improve the isolation
Decreasing [detection] N_t to 2 ms has slight improvement in the isolation

I would be grateful if I get some advice on how to improve the sorting quality. Thank you very much in advance.

The text was updated successfully, but these errors were encountered:

yger · 2020-11-07T18:59:31Z

What version of the software are you using? If you want a quick feedback, the best would be to share your data. Is it something possible for you ? But so far, what your did with the parameters seems legit. Except that in recent version, the dispersion parameter is not used anymore.

s-steinberg · 2020-11-07T19:08:15Z

Thank you for the quick reply. I'm using version 1.0.1.
Here is the link to my .dat, .params and .prb files.
https://1drv.ms/u/s!AixpdTh0kn8rgas2KKR4780GSUU_dg?e=wuOkdY

yger · 2020-11-08T13:28:09Z

I had a look to your data, and the .params you sent me. First, N_t=1 is way too small, and we can see your templates are cropped. Cropped templates means non optimal clustering/fitting. So bring back the value to 2 or 3. Secondly, to better understand what is done by the software, set make_plots = png in the [clustering] section. It will generate plots of the local clustering, in the results folder/plot. Using the master branch, everything looks rather fine to me, so which version of the software are you using? Please update to master, such that we are sure to be in sync

From the debug plots, we can see that the clustering are performing rather decently, with no "obvious" errors. If you want to get more clusters, I would not play with sensitivity, but rather with the merging_param value. Since you have 16 local merges, it will reduce this number, and you'll have more clusters (at the cost of a higher overclustering maybe). So set 1 or even less. Regarding the final templates, you do have some without RPV, and some with. This is also because quite a lot are defined on only single channel. But to me, I do not see the behavior that you are showing

s-steinberg · 2020-11-08T17:15:01Z

That sounds little bit promising. I now use the latest master branch, and try with the default parameters and changing merging_param. I have 25 active channels (7 dead channels) and got 29 clusters with the default parameters and spike_thresh. = 4.25 All clusters have RPV (2 ms) and firing rates are min 5.26 Hz and max 162.97 Hz.
Increasing spike_thresh to 6 resulted in 27 clusters with firing rate range of [1.72, 17.98], which is reasonable for my recording, but still all clusters have RPV. I wonder what makes the difference.

Here is my .prams file
https://1drv.ms/u/s!AixpdTh0kn8rgas8S1oqHOGwW0kXtw?e=DJ3kAv

Thank you for your time.

yger · 2020-11-09T07:48:31Z

Decreasing the spike_thresh to 4.25 to very low value does not mean that you'll get more spikes, and that you'll get the clusters you had with a higher spike_thresh (at 6) plus some new ones. This might be counter intuitive, but let me explain. For the clustering, we select 10000 spikes per electrodes to cluster from. If you have a too low spike_thresh, these selected spikes might be just noise, and the large spikes might be under-represented. We have ways to somehow try to compensate for that, but this is not perfect. This is why I would not really advise to decrease spike_thresh too much, except if your data have a very high SNR. I'll look again at the data for the RPV

s-steinberg · 2020-11-16T16:06:43Z

I have tried several other recordings but all clusters had RPV issue. Could you possibly share your parameter file info that worked well? Thank you very much.

yger · 2020-11-16T19:58:20Z

Use master branch (1.0.2) and the attached parameter file (renamed as .txt)

data.txt

s-steinberg · 2020-11-16T20:48:48Z

Thanks a lot for the parameter file. It resulted in an error during 'estimating the templates'. Sorry if I am making stupid mistake.

log_1.0.2.txt
errorMessage_1.0.2.txt

yger · 2020-11-16T20:56:31Z

Yes, I noticed the bug, this has been patched. Sorry for that. Please upgrade to master, and I'll make a release asap

s-steinberg · 2020-11-17T20:27:01Z

Thanks for the update. In my environment, all resulting clusters are still having RPV...
I could see spikes with quite different waveforms are in the same cluster.

I am not sure what makes it wrong in my environment. I attach the log file. The log file is not full as it had tens of thousands of lines with DEBUG [matplotlib.font_manager] and DEBUG [matplotlib.ticker], which I assume are not relevant here, and I deleted them manually (but why are there so many?). I may have accidentally deleted lines with DEBUG [circus.clustering].
log.txt

Thanks a lot for your time.

yger · 2020-12-04T04:34:25Z

If you are a windows user, a silent bug recently spotted in windows only might be the reason for the problem... Sorry for that, there was an int overflow not triggering errors. Try upgrading to master, and hopefully it will be better

s-steinberg · 2020-12-06T14:17:13Z

Thank you for the update. Meanwhile, I tried more systematic testing of the parameters i.e. changing one parameter at once, occasionally to an extreme value.
One parameter change did not lead to obvious improvement but here are some.

[whitening] max_elts = 1000. This does not change number of clusters but 1 cluster has relatively little RPV. In turn, there are many false positives (small amp noises detected as spikes) and false negatives (large amplitude spikes missed)
[whitening] output_dim = 3. This resulted in more clusters but all with RPV. There are still many false positives. Changing this value to 1 leads to error (phy does not open)
[clustering] smart_search = False. Although not quantified, it seems less false negatives (missed spikes)
[clustering] merging_param = 0.1. Although not quantified, it seems less false negatives
[clustering] sensitivity = 1. This increases number of clusters by 3 times, but all with RPV. Changing this parameter to 0.1 causes error in fitting. Changing this parameter to 0.5 resulted in fewer number of clusters where 2 sparse clusters.
[clustering] fine_amplitude = False. This detects more spikes, more RPV but less false positives.
[fitting] amp_limits = (0.8, 1.2) # with amp_auto = False. This resulted in the same number of clusters with fewer spikes with similar amplitudes.

Changing other parameters alone did not give obvious improvement.
In my impression, narrowing the amp_limits helps the most to exclude contamination of spikes with very different amplitudes. However, there needs to be a lot more clusters to prevent false negatives, which can be done by lowering merging_pram and/or sensitivity.
In addition, I found that RPV is often due to detecting artifacts next to each other. Currently I do not have sophisticated artifact detection in addition to [triggers] clean_artefact but I tried a quick-and-dirty method and used the artifact times as .dead file. This dramatically improved reducing RPV.

Is it feasible and does it make sense to have an option to assign spikes in the refractory period to separate clusters?

yger · 2020-12-06T14:49:43Z

But are you using windows ot not? Because as said, it you are, then there was a bug in the amplitudes that could lead to noise contamination...

s-steinberg · 2020-12-06T16:21:43Z

Yes, I am using Windows. I will try with the latest commit again. Thank you.

yger · 2020-12-06T16:40:03Z

In the master branch, yes. Amplitudes should be much better (with default parameter, i.e. fine_ampltiude = True)

s-steinberg mentioned this issue Nov 7, 2020

Traces in Preview Window when using dead_channels #312

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reducing refractory period violation for tetrode data #311

Reducing refractory period violation for tetrode data #311

s-steinberg commented Nov 7, 2020

yger commented Nov 7, 2020

s-steinberg commented Nov 7, 2020

yger commented Nov 8, 2020

s-steinberg commented Nov 8, 2020

yger commented Nov 9, 2020

s-steinberg commented Nov 16, 2020

yger commented Nov 16, 2020

s-steinberg commented Nov 16, 2020

yger commented Nov 16, 2020

s-steinberg commented Nov 17, 2020

yger commented Dec 4, 2020

s-steinberg commented Dec 6, 2020

yger commented Dec 6, 2020

s-steinberg commented Dec 6, 2020

yger commented Dec 6, 2020

Reducing refractory period violation for tetrode data #311

Reducing refractory period violation for tetrode data #311

Comments

s-steinberg commented Nov 7, 2020

yger commented Nov 7, 2020

s-steinberg commented Nov 7, 2020

yger commented Nov 8, 2020

s-steinberg commented Nov 8, 2020

yger commented Nov 9, 2020

s-steinberg commented Nov 16, 2020

yger commented Nov 16, 2020

s-steinberg commented Nov 16, 2020

yger commented Nov 16, 2020

s-steinberg commented Nov 17, 2020

yger commented Dec 4, 2020

s-steinberg commented Dec 6, 2020

yger commented Dec 6, 2020

s-steinberg commented Dec 6, 2020

yger commented Dec 6, 2020