Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Failure of export_to_phy from existing mountainsort cluster data

See original GitHub issue

I have a use case where I take already data already sorted and curated with MountainSort, and funnel through SpikeInterface API to generate phy files…

import json, os, spikeinterface
import spikeinterface.core.waveform_extractor as wave
import spikeinterface.exporters.to_phy as phy
import spikeinterface.extractors.mdaextractors as mda

# Point to a prv recording and the params file
local_path = '/Volumes/GenuDrive/RY16_direct/MountainSort/RY16_36.mountain/RY16_36.nt1.mountain'
prv_file = os.path.join(local_path, 'raw.mda.prv')
params_file = os.path.join(local_path, 'params.json')

# Derive the properties in those json dicts
with open(prv_file, 'r') as F:
    raw_dict = json.load(F)
with open(params_file, 'r') as F:
    params_dict = json.load(F)

# Create the mda object
mda_reecording = mda.read_mda_recording(local_path,
                                   raw_fname=raw_dict['original_path'])
mda_reecording.annotate(is_filtered=True) # we've already filtered it, so mark it thus

# Derive the spikes file
firings_file = os.path.join(local_path, 'firings_raw.mda')
sorted_spikes = mda.read_mda_sorting(firings_file, sampling_frequency=params_dict['samplerate'])

# Create waveforms per spike
waveform_file = os.path.join(local_path, 'waveforms')
waveforms = wave.extract_waveforms(mda_reecording, sorted_spikes, waveform_file)

# Export to phy
phyplace = os.path.join(local_path, "phy")
phy.export_to_phy(waveforms, phyplace)

Which is great: I like phy a bit more than qt-mountainview curation. But for some of my data, this triggers an error in the export_phy step. Namely, recording chunk end points can overshoot the mda N1 x N2 size bounds, raising an error.

Disclaimer: I didn’t fully read the code in the execution stack to ensure this is kosher. But bounding the end to the true sample here fixes it. In other words, it works if I alter line 55 in job_tools.py …

def devide_recording_into_chunks(recording, chunk_size):
 51     all_chunks = []
 52     for segment_index in range(recording.get_num_segments()):
 53         num_frames = recording.get_num_samples(segment_index)
 54         chunks = divide_segment_into_chunks(num_frames, chunk_size)
 55         #all_chunks.extend([(segment_index, frame_start, frame_stop) for frame_start, frame_stop in chunks])
 57         # modification in the the next two lines 
 56         sample_max = recording.get_num_samples()
 57         all_chunks.extend([(segment_index, frame_start, np.min((frame_stop, sample_max)) for frame_start, frame_stop in chunks])
 58     return all_chunks

Issue Analytics

State:
Created 2 years ago
Comments:21 (16 by maintainers)

Top GitHub Comments

1reaction

samuelgarciacommented, Sep 16, 2021

Did you try my working branch ?

pip uninstall spikeinterface
git clone https://github.com/samuelgarcia/spikeinterface.git
cd spikeinterface
git checkout some_fixes
python setup.py install

On my side there is no bug anymore.

1reaction

alejoe91commented, Sep 12, 2021

I think this could be fixed by setting end_frame=num_samples if end_frame>num_samples in the get_traces function of the MdaRecordingSegment 😃

Top Results From Across the Web

Sorting your own data — MountainSort 1.0.0 documentation

The first step of spike sorting using MountainSort is to prepare a raw M×N timeseries dataset in .mda format. Here M is the...

Seg fault when computing cluster isolation metrics #190 - GitHub

I'm running ms3 on a large dataset (385 channels, 60GB raw data file). ... mountainsort.isolation_metrics:: Computing cluster metrics...

A fully automated approach to spike sorting - PMC - NCBI

In this work, we set out to develop a fully automated spike sorting algorithm having error rates that are comparable to or lower...

Tools for the next generation of extracellular physiology

numbers of simultaneously recorded neurons, though current approaches have ... Figure 2: Fully-automated MountainSort produces clusters comparable to manual.

Evaluation and resolution of many challenges of ... - bioRxiv

We cluster the spike clips in PCA space, following a modified version of the MountainSort “isocut” clustering algorithm (Chung et al. 2017).