Export¶

Intro¶

Developer Note: if you may make a PR in the future, be sure to copy this notebook, and use the gitignore prefix temp to avoid future conflicts.

This is one notebook in a multi-part series on Spyglass.

To set up your Spyglass environment and database, see the Setup notebook
To insert data, see the Insert Data notebook
For additional info on DataJoint syntax, including table definitions and inserts, see these additional tutorials
For information on what's goint on behind the scenes of an export, see documentation

In short, Spyglass offers the ability to generate exports of one or more subsets of the database required for a specific analysis as long as you do the following:

Inherit SpyglassMixin for all custom tables.
Run only one export at a time.
Start and stop each export logging process.

NOTE: For demonstration purposes, this notebook relies on a more populated database to highlight restriction merging capabilities of the export process. Adjust the restrictions to suit your own dataset.

Imports¶

Let's start by connecting to the database and importing some tables that might be used in an analysis.

In [1]:

Copied!





import os
import datajoint as dj

# change to the upper level folder to detect dj_local_conf.json
if os.path.basename(os.getcwd()) == "notebooks":
    os.chdir("..")
dj.config.load("dj_local_conf.json")  # load config for database connection info

from spyglass.common.common_usage import Export, ExportSelection
from spyglass.lfp.analysis.v1 import LFPBandV1
from spyglass.position.v1 import TrodesPosV1
from spyglass.spikesorting.v1.curation import CurationV1
import os
import datajoint as dj

# change to the upper level folder to detect dj_local_conf.json
if os.path.basename(os.getcwd()) == "notebooks":
    os.chdir("..")
dj.config.load("dj_local_conf.json")  # load config for database connection info

from spyglass.common.common_usage import Export, ExportSelection
from spyglass.lfp.analysis.v1 import LFPBandV1
from spyglass.position.v1 import TrodesPosV1
from spyglass.spikesorting.v1.curation import CurationV1

[2024-03-28 16:32:49,766][INFO]: Connecting root@localhost:3309
[2024-03-28 16:32:49,773][INFO]: Connected root@localhost:3309
/home/cb/miniconda3/envs/spy/lib/python3.9/site-packages/torch/cuda/__init__.py:83: UserWarning: CUDA initialization: CUDA unknown error - this may be due to an incorrectly set up environment, e.g. changing env variable CUDA_VISIBLE_DEVICES after program start. Setting the available devices to be zero. (Triggered internally at  ../c10/cuda/CUDAFunctions.cpp:109.)
  return torch._C._cuda_getDeviceCount() > 0

Export Tables¶

The ExportSelection table will populate while we conduct the analysis. For each file opened and each fetch call, an entry will be logged in one of its part tables.

In [2]:

Copied!

ExportSelection()
ExportSelection()

Out[2]:

export_id	paper_id	analysis_id	spyglass_version	time

Total: 0

In [3]:

Copied!

ExportSelection.Table()
ExportSelection.Table()

Out[3]:

export_id	table_id	table_name	restriction

Total: 0

In [4]:

Copied!

ExportSelection.File()
ExportSelection.File()

Out[4]:

export_id	analysis_file_name name of the file

Total: 0

Exports are organized around paper and analysis IDs. A single export will be generated for each paper, but we can delete/revise logs for each analysis before running the export. When we're ready, we can run the populate_paper method of the Export table. By default, export logs will ignore all tables in this common_usage schema.

Logging¶

There are a few restrictions to keep in mind when export logging:

You can only run ONE export at a time.
All tables must inherit SpyglassMixin

How to inherit SpyglassMixin

DataJoint tables all inherit from one of the built-in table types.

class MyTable(dj.Manual):
    ...

To inherit the mixin, simply add it to the () of the class before the DataJoint class. This can be done for existing tables without dropping them, so long as the change has been made prior to export logging.

from spyglass.utils import SpyglassMixin
class MyTable(SpyglassMixin, dj.Manual):
    ...

Let's start logging for 'paper1'.

In [5]:

Copied!





paper_key = {"paper_id": "paper1"}

ExportSelection().start_export(**paper_key, analysis_id="analysis1")
my_lfp_data = (
    LFPBandV1  # Logging this table
    & "nwb_file_name LIKE 'med%'"  # using a string restriction
    & {"filter_name": "Theta 5-11 Hz"}  # and a dictionary restriction
).fetch()
paper_key = {"paper_id": "paper1"}

ExportSelection().start_export(**paper_key, analysis_id="analysis1")
my_lfp_data = (
    LFPBandV1  # Logging this table
    & "nwb_file_name LIKE 'med%'"  # using a string restriction
    & {"filter_name": "Theta 5-11 Hz"}  # and a dictionary restriction
).fetch()

[16:32:51][INFO] Spyglass: Starting {'export_id': 1}

We can check that it was logged. The syntax of the restriction will look different from what we see in python, but the preview_tables will look familiar.

In [6]:

Copied!

ExportSelection.Table()
ExportSelection.Table()

Out[6]:

export_id	table_id	table_name	restriction
1	1	`lfp_band_v1`.`__l_f_p_band_v1`	(( ((nwb_file_name LIKE 'med%%%%%%%%')))AND( ((`filter_name`="Theta 5-11 Hz"))))

Total: 1

And log more under the same analysis ...

In [7]:

Copied!





my_other_lfp_data = (
    LFPBandV1
    & {
        "nwb_file_name": "mediumnwb20230802_.nwb",
        "filter_name": "Theta 5-10 Hz",
    }
).fetch()
my_other_lfp_data = (
    LFPBandV1
    & {
        "nwb_file_name": "mediumnwb20230802_.nwb",
        "filter_name": "Theta 5-10 Hz",
    }
).fetch()

Since these restrictions are mutually exclusive, we can check that the will be combined appropriately by priviewing the logged tables...

In [8]:

Copied!

ExportSelection().preview_tables(**paper_key)
ExportSelection().preview_tables(**paper_key)

Out[8]:

[FreeTable(`lfp_band_v1`.`__l_f_p_band_v1`)
 *lfp_merge_id  *filter_name   *filter_sampli *nwb_file_name *target_interv *lfp_band_samp analysis_file_ interval_list_ lfp_band_objec
 +------------+ +------------+ +------------+ +------------+ +------------+ +------------+ +------------+ +------------+ +------------+
 0f3bb01e-0ef6- Theta 5-10 Hz  1000           mediumnwb20230 pos 0 valid ti 100            mediumnwb20230 pos 0 valid ti 44e38dc1-3779-
 0f3bb01e-0ef6- Theta 5-11 Hz  1000           mediumnwb20230 pos 0 valid ti 100            mediumnwb20230 pos 0 valid ti c9b93111-decb-
  (Total: 2)]

Let's try adding a new analysis with a fetched nwb file. Starting a new export will stop the previous one.

In [9]:

Copied!

ExportSelection().start_export(**paper_key, analysis_id="analysis2")
curation_nwb = (CurationV1 & "curation_id = 1").fetch_nwb()
trodes_data = (TrodesPosV1 & 'trodes_pos_params_name = "single_led"').fetch()
ExportSelection().start_export(**paper_key, analysis_id="analysis2")
curation_nwb = (CurationV1 & "curation_id = 1").fetch_nwb()
trodes_data = (TrodesPosV1 & 'trodes_pos_params_name = "single_led"').fetch()

[16:32:51][INFO] Spyglass: Export 1 in progress. Starting new.
[16:32:51][INFO] Spyglass: Starting {'export_id': 2}

We can check that the right files were logged with the following...

In [10]:

Copied!

ExportSelection().list_file_paths(paper_key)
ExportSelection().list_file_paths(paper_key)

Out[10]:

[{'file_path': '/home/cb/wrk/alt/data/raw/mediumnwb20230802_.nwb'},
 {'file_path': '/home/cb/wrk/alt/data/analysis/mediumnwb20230802/mediumnwb20230802_ALNN6TZ4L7.nwb'}]

And stop the export with ...

In [11]:

Copied!

ExportSelection().stop_export()
ExportSelection().stop_export()

Populate¶

The Export table has a populate_paper method that will generate an export bash script for the tables required by your analysis, including all the upstream tables you didn't directly need, like Subject and Session.

NOTE: Populating the export for a given paper will overwrite any previous runs. For example, if you ran an export, and then added a third analysis for the same paper, generating another export will delete any existing bash script and Export table entries for the previous run.

In [12]:

Copied!

Export().populate_paper(**paper_key)
Export().populate_paper(**paper_key)

[16:32:51][INFO] Spyglass: Export script written to /home/cb/wrk/alt/data/export/paper1/_ExportSQL_paper1.sh

By default the export script will be located in an export folder within your SPYGLASS_BASE_DIR. This default can be changed by adjusting your dj.config.

Frank Lab members will need the help of a database admin (e.g., Chris) to run the resulting bash script. The result will be a .sql file that anyone can use to replicate the database entries you used in your analysis.

Up Next¶

In the next notebook, we'll start working with ephys data with spike sorting.