Tutorial: EMPIAR-10499 70S ribosomes#
Here we present tutorials for processing heterogeneous ribosome data from cryo-ET benchmark dataset EMPIAR-10499, as described in our tomoDRGN manuscript. The tutorials cover the following stages of processing:
upstream processing and obtaining input data for tomoDRGN
validating that particles and metadata were extracted correctly:
tomodrgn backproject_voxel
, ortomodrgn train_nn
withtomodrgn convergence_nn
learning structural heterogeneity within the dataset:
tomodrgn train_vae
withtomodrgn convergence_vae
analyzing structural heterogeneity within the dataset:
tomodrgn analyze
,tomodrgn eval_vol
andtomodrgn analyze_volumes
, and external tools including SIREn and MAVEnvisualizing structural heterogeneity patterns in the tomogram’s spatial context: tomoDRGN’s interactive visualization jupyter notebook and
tomodrgn subtomo2chimerax
isolating particle subsets of interest: tomoDRGN’s interactive filtering jupyter notebook and
tomodrgn filter_star
taking homogeneous particle subsets back into external STA tools for further refinement
With these steps as building blocks, many additional types of analyses are possible.
Note
These tutorials were originally written using data processed using the Warp v1 -> RELION v3 -> M STA pipeline. Therefore, the command syntax used in the tutorials is specific to “Warp v1 style inputs”. Alternative syntax for other STA pipelines, including “WarpTools style inputs”, are given in the command usage section.