Setup¶

Welcome to Spyglass, a DataJoint pipeline maintained by the Frank Lab at UCSF.

Spyglass will help you take an NWB file from raw data to analysis-ready preprocessed formats using DataJoint to (a) connect to a relational database (here, MySQL), and (b) automate processing steps. To use Spyglass, you'll need to ...

Set up your local environment
Connect to a database

Local environment¶

Codespace users can skip this step. Frank Lab members should first follow 'rec to nwb overview' steps on Google Drive to set up an ssh connection.

For local use, download and install ...

Python 3.9.

mamba as a replacement for conda. Spyglass installation is significantly faster with mamba.

wget "https://github.com/conda-forge/miniforge/releases/latest/download/Miniforge3-$(uname)-$(uname -m).sh"
bash Miniforge3-$(uname)-$(uname -m).sh

VS Code with relevant python extensions, including Jupyter. Hold off on selecting your interpreter until after you make the environment with mamba.
git for downloading the repository, including notebooks.

In a terminal, ...

navigate to your project directory.
use git to download the Spyglass repository.
navigate to the newly downloaded directory.
create a mamba environment with either the standard environment.yml or the environment_position.yml, if you intend to use the full position pipeline. The latter will take longer to install.
open this notebook with VSCode

Commands for the steps above ...

cd /your/project/directory/ # 1
git clone https://github.com/LorenFrankLab/spyglass/ # 2
cd spyglass # 3
mamba env create -f environment.yml # 4
code notebooks/00_Setup.ipynb # 5

Note: Spyglass is also installable via pip and pypi with pip install spyglass-neuro, but downloading from GitHub will also other files accessible.

Next, within VSCode, select the kernel that matches your spyglass environment created with mamba. To use other Python interfaces, be sure to activate the environment: conda activate spyglass

See this guide for additional details on each of these programs and the role they play in using the pipeline.

Database Connection¶

You have a few options for databases.

Connect to an existing database.
Use GitHub Codespaces (coming soon...)
Run your own database with Docker

Once your database is set up, be sure to configure the connection with your dj_local_conf.json file.

Existing Database¶

Members of the Frank Lab will need to use DataJoint 0.14.2 (currently in pre-release) in order to change their password on the MySQL 8 server. DataJoint 0.14.2

git clone https://github.com/datajoint/datajoint-python
pip install ./datajoint-python

Members of the lab can run the dj_config.py helper script to generate a config like the one below.

cd spyglass
python config/dj_config.py <username> <base_path> <output_filename>

Outside users should copy/paste dj_local_conf_example and adjust values accordingly.

The base path (formerly SPYGLASS_BASE_DIR) is the directory where all data will be saved. See also docs for more information on subdirectories.

A different output_filename will save different files:

dj_local_conf.json: Recommended. Used for tutorials. A file in the current directory DataJoint will automatically recognize when a Python session is launched from this directory.
.datajoint_config.json or no input: A file in the user's home directory that will be referenced whenever no local version (see above) is present.
Anything else: A custom name that will need to be loaded (e.g., dj.load('x')) for each python session.

The config will be a json file like the following.

{
    "database.host": "lmf-db.cin.ucsf.edu",
    "database.user": "<username>",
    "database.password": "Not recommended for shared machines",
    "database.port": 3306,
    "database.use_tls": true,
    "enable_python_native_blobs": true,
    "filepath_checksum_size_limit": 1 * 1024**3,
    "loglevel": "INFO",
    "stores": {
        "raw": {
            "protocol": "file",
            "location": "/stelmo/nwb/raw",
            "stage": "/stelmo/nwb/raw"
        },
        "analysis": {
            "protocol": "file",
            "location": "/stelmo/nwb/analysis",
            "stage": "/stelmo/nwb/analysis"
        }
    },
    "custom": {
        "spyglass_dirs": {
            "base": "/stelmo/nwb/"
        }
    }
}

Spyglass will use the log level present in your DataJoint config to decide the appropriate logging level for this session. To change the messages you see, select from one of these options.

If you see an error saying Could not find SPYGLASS_BASE_DIR, try loading your config before importing Spyglass.

import datajoint as dj
dj.load('/path/to/config')
import spyglass

Running your own database¶

Setup Docker¶

First, install Docker.
Add yourself to the docker group so that you don't have to be sudo to run docker.
Download the docker image for datajoint/mysql:8.0.
```
docker pull datajoint/mysql:8.0
```

Note: For this demo, MySQL version won't matter. Some database management features of Spyglass, however, expect MySQL >= 8.

When run, this is referred to as a 'Docker container'
Next start the container with a couple additional pieces of info...
- Root password. We use tutorial.
- Database name. Here, we use spyglass-db.
- Port mapping. Here, we map 3306 across the local machine and container.
```
docker run --name spyglass-db -p 3306:3306 -e MYSQL_ROOT_PASSWORD=tutorial datajoint/mysql:8.0
```

For data to persist after terminating the container, attach a volume when running:

docker volume create dj-vol
docker run --name spyglass-db -v dj-vol:/var/lib/mysql -p 3306:3306 -e MYSQL_ROOT_PASSWORD=tutorial datajoint/mysql

Configure¶

The dj_local_conf_example.json contains all the defaults for a Docker connection. Simply rename to dj_local_conf.json and modify the contents accordingly. This includes the host, password and user. For Spyglass, you'll want to set your base path under custom:

{
  "database.host": "localhost",
  "database.password": "tutorial",
  "database.user": "root",
  "custom": {
    "database.prefix": "username_",
    "spyglass_dirs": {
      "base": "/your/base/path"
    }
  }
}

Loading the config¶

We can check that the paths are correctly set up by loading the config from the main Spyglass directory.

In [1]:

Copied!

import os
import datajoint as dj

if os.path.basename(os.getcwd()) == "notebooks":
    os.chdir("..")
dj.config.load("dj_local_conf.json")

from spyglass.settings import config

config
import os
import datajoint as dj

if os.path.basename(os.getcwd()) == "notebooks":
    os.chdir("..")
dj.config.load("dj_local_conf.json")

from spyglass.settings import config

config

Out[1]:

{'debug_mode': True,
 'prepopulate': True,
 'SPYGLASS_BASE_DIR': '/stelmo/nwb',
 'SPYGLASS_RAW_DIR': '/stelmo/nwb/raw',
 'SPYGLASS_ANALYSIS_DIR': '/stelmo/nwb/analysis',
 'SPYGLASS_RECORDING_DIR': '/stelmo/nwb/recording',
 'SPYGLASS_SORTING_DIR': '/stelmo/nwb/spikesorting',
 'SPYGLASS_WAVEFORMS_DIR': '/stelmo/nwb/waveforms',
 'SPYGLASS_TEMP_DIR': '/stelmo/nwb/tmp',
 'SPYGLASS_VIDEO_DIR': '/stelmo/nwb/video',
 'KACHERY_CLOUD_DIR': '/stelmo/nwb/kachery_storage',
 'KACHERY_STORAGE_DIR': '/stelmo/nwb/kachery_storage',
 'KACHERY_TEMP_DIR': '/stelmo/nwb/tmp',
 'KACHERY_ZONE': 'franklab.default',
 'FIGURL_CHANNEL': 'franklab2',
 'DJ_SUPPORT_FILEPATH_MANAGEMENT': 'TRUE',
 'KACHERY_CLOUD_EPHEMERAL': 'TRUE'}

Connect¶

Now, you should be able to connect to the database you set up.

Let's demonstrate with an example table:

In [2]:

Copied!

from spyglass.common import Nwbfile

Nwbfile()
from spyglass.common import Nwbfile

Nwbfile()

[2023-09-28 08:07:06,176][INFO]: Connecting root@localhost:3307

[2023-09-28 08:07:06,254][INFO]: Connected root@localhost:3307

Populate: Populating table DataAcquisitionDeviceSystem with data {'data_acquisition_device_system': 'SpikeGadgets'} using insert1.
Populate: Populating table DataAcquisitionDeviceAmplifier with data {'data_acquisition_device_amplifier': 'Intan'} using insert1.

Out[2]:

Table for holding the NWB files.

nwb_file_name name of the NWB file	nwb_file_abs_path
CH101_20210711_.nwb	=BLOB=
CH73_20211206_.nwb	=BLOB=
CH65_20211212_.nwb	=BLOB=
J1620210620_.nwb	=BLOB=
montague20200802_.nwb	=BLOB=
chimi20200304_.nwb	=BLOB=
Wallie20220913_.nwb	=BLOB=
mango20211203_.nwb	=BLOB=
peanut20201108_.nwb	=BLOB=
wilbur20210406_.nwb	=BLOB=
eliot20221022_.nwb	=BLOB=
Dan20211109_.nwb	=BLOB=

...

Total: 817

Up Next¶

Next, we'll try inserting data