Chapter 4. File Formats

Table of Contents

Data Files
The Cluster File
The Feature File
The Spike File
Parameter Files
The Parameter File
Obsolete Parameter Files

Klusters reads electrophysiological data from a set of files. Each of these files has a name of the form base.ext.n, where base stands for the common base name of all files belonging to a single recording session, ext is a three-letter extension describing the type of data stored in the file, and n stands for the ID of the particular group of electrodes from which the data was recorded. Thus, using this convention the cluster file for tetrode 2 of session CS150612 would be named CS150612.clu.2.

More specifically, the data files are organized as described in the following sections.

Note that the general and specific parameter files will likely be simplified and consolidated in the future.

Data Files

The Cluster File

Generic file name: base.clu.n

Format: ASCII

The cluster file lists the cluster ID of each spike, one per line. The first line contains the total number of clusters. Assuming N spikes (spike1...spikeN), this file looks like:

nbClusters
clusterID_spike1
clusterID_spike2
clusterID_spike3
...
clusterID_spikeN

Notice that the last line must end with a newline or carriage return.

The Feature File

Generic file name: base.fet.n

Format: ASCII, integer values

The feature file lists for each spike the PCA coefficients for each electrode, followed by the timestamp of the spike (more features can be inserted between the PCA coefficients and the timestamp). The first line contains the number of dimensions. Assuming N1 spikes (spike1...spikeN1), N2 electrodes (e1...eN2) and N3 coefficients (c1...cN3), this file looks like:

nbDimensions
c1_e1_spike1   c2_e1_spike1  ... cN3_e1_spike1   c1_e2_spike1  ... cN3_eN2_spike1   timestamp_spike1
c1_e1_spike2   c2_e1_spike2  ... cN3_e1_spike2   c1_e2_spike2  ... cN3_eN2_spike2   timestamp_spike2
...
c1_e1_spikeN1  c2_e1_spikeN1 ... cN3_e1_spikeN1  c1_e2_spikeN1 ... cN3_eN2_spikeN1  timestamp_spikeN1
The timestamp is expressed in multiples of the sampling interval. For instance, for a 20kHz recording (50 microsecond sampling interval), a timestamp of 200 corresponds to 200x0.000050s=0.01s from the beginning of the recording session.

Notice that the last line must end with a newline or carriage return.

The Spike File

Generic file name: base.spk.n

Format: 16 or 32 bit binary

The spike file describes the waveform of each spike. Assuming N1 spikes (spike1...spikeN1), N2 electrodes (electrode1...electrodeN2) and N3 samples per spike (sample1...sampleN3), the data are stored as follows:

sample1_electrode1_spike1
sample1_electrode2_spike1
...
sample1_electrodeN2_spike1
sample2_electrode1_spike1
sample2_electrode2_spike1
...
sample2_electrodeN2_spike1
...
sampleN3_electrodeN2_spike1
...
sample1_electrode1_spikeN1
...
sampleN3_electrodeN2_spikeN1